We are happy to announce a list of students which will participate in Google Summer Of Code 2012 project with CMUSphinx organization:
Task
Currently sphinx4 can only work with predefined dictionary. It's possible to build phonetic dictionary automatically but it requires both application of machine learning for training and development of decoder module as well as testing. Various language modules needs to be trained as well. This work will be implement letter to sound rules with OpenFST in sphinx4.
Student John Salatas
Task
Implement the simple reading and pronunciation learning system
Students
Current language models are very basic that means they don't really understand what's transcribed. That affects error rate. Create a decoder over the lattices that will select semantically correct path and create a perfectly readable result.
Student
Create language-independent postprocessing framework that will turn ASR results into something readable with punctuation, abbreviations and capitalization.
http://www.makapa.de/Paulik_Sent_ICASSP08.pdf
Student
Write a crawler which can collect text data for language model training on certain topic
Student
We expect great features implemented this summer. Please stay tuned, the news will appear here.
Hello CMUSphinx User and Developers
If you are interested in CMUSphinx history or just want to become more familar with core CMUSphinx developers and listen to them you can now do so. Recently Sourceforge team and Rich Bowen has made a great podcast with the CMUSphinx team
Check it out
Sonalight, which showed off its product at this week’s Y Combinator Demo Day, thinks voice tech is better put to use tackling real issues users have with their mobiles in everyday settings, like texting while driving. Sonalight actually employs Google’s own existing voice recognition tech, in combination with the CMU Sphinx open source software, to achieve its results. This is a great use case for CMUSphinx.
Visit
To try it.
We are pleased to announce that CMUSphinx project is accepted to Google Summer Of Code 2012 program. That will enable us to help several students to start their way in speech recognition, open source development and in CMUSphinx. We are really excited about that.
http://www.google-melange.com/gsoc/org/google/gsoc2012/cmusphinx
If you are interested to participate as a student, an application period will open soon but it’s better to start preparation of your application right now. Feel free to contact us for any questoins! For more details see:
https://cmusphinx.github.io/wiki/summerofcodestudents
If you would like to be a mentor please sign in into gsoc web application and add your ideas to the ideas list:
https://cmusphinx.github.io/wiki/summerofcodeideas
We invite you to participate!