CMUSphinx Open Source Speech Recognition

Apr 25, 2012

Podcast About CMUSphinx History

Hello CMUSphinx User and Developers

If you are interested in CMUSphinx history or just want to become more familar with core CMUSphinx developers and listen to them you can now do so. Recently Sourceforge team and Rich Bowen has made a great podcast with the CMUSphinx team

Check it out

https://sourceforge.net/blog/podcast-cmusphinx/

Apr 4, 2012

CMUSphinx powers mobile dictation application

Sonalight, which showed off its product at this week’s Y Combinator Demo Day, thinks voice tech is better put to use tackling real issues users have with their mobiles in everyday settings, like texting while driving. Sonalight actually employs Google’s own existing voice recognition tech, in combination with the CMU Sphinx open source software, to achieve its results. This is a great use case for CMUSphinx.

Visit

http://sonalight.com/

To try it.

onalight actually employs Google’s own existing voice recognition tech, in combination with the CMU Sphinx open source software, to achieve its results.

Mar 30, 2012

CMUSphinx at GSOC 2012

We are pleased to announce that CMUSphinx project is accepted to Google Summer Of Code 2012 program. That will enable us to help several students to start their way in speech recognition, open source development and in CMUSphinx. We are really excited about that.

http://www.google-melange.com/gsoc/org/google/gsoc2012/cmusphinx

If you are interested to participate as a student, an application period will open soon but it’s better to start preparation of your application right now. Feel free to contact us for any questoins! For more details see:

https://cmusphinx.github.io/wiki/summerofcodestudents

If you would like to be a mentor please sign in into gsoc web application and add your ideas to the ideas list:

https://cmusphinx.github.io/wiki/summerofcodeideas

We invite you to participate!

Mar 6, 2012

Audio tool for displaying spectrogram in real time using Sphinx-4

The iSound is a program that was built with help of the CMU Sphinx-4 system. It is a part of the thesis at the Faculty of Mathematics, Natural Sciences and Information Technologies from Koper, Slovenia. Its main goal is real-time audio signal visualization, also known as spectrogram or sonogram. Which means, that it allows observation of the sound.

Spectrogram

This property could be useful in many areas, such as: phonetics, animal sounds analysis, music, sonar/radar, speech processing, seismology, etc. Additionally, it has included few features into basic spectrogram drawing, which made the application more useful. That features are: image freezing, zoom control, signal frequency display, resizing, changing of the color schemes and contrast adjustment.

Compared to the other programs with similar functioning it gives promising results of the CPU, memory and graphics usage. Tests were made on Windows XP, Windows 7, Linux ubuntu and OS X Lion. You can find full test results in the diploma, on page 40.

Author’s comment: “For future work i plan to publish research work as an article in the journal. Currently I’m working on idea, how to use similar technologies and develop a tool, which can help persons with hearing handicap.”

Find useful information about the project, at the author, Irman Abdić’s web page:
http://www.irmanabdic.com