Overview of the CMUSphinx toolkit

The CMUSphinx toolkit is a leading speech recognition toolkit with various tools used to build speech applications. CMUSphinx contains a number of packages for different tasks and applications. Sometimes, it’s confusing what to choose. To shed some light on the parts of the toolkit, here is a list:

  • Pocketsphinx — lightweight recognizer library written in C.
  • Sphinxbase — support library required by Pocketsphinx
  • Sphinx4 — adjustable, modifiable recognizer written in Java
  • Sphinxtrain — acoustic model training tools

We recommend that you use the latest available releases:

Of course, many things are missing. Things like building a phonetic model capable of handling an infinite vocabulary, postprocessing of the decoding result, sense extraction and other semantic tools should be added one day. Probably you should take it on.

The following resources are the main ones for CMUSphinx developers:

Basic concepts of speech recognition Before you start