CMUSphinx
Downloads Tutorial C Python Java FAQ Contact Us About

======= Data Sources ======

The possible sources of speech data:

Acoustic Data

http://voxforge.org/

TEDLIUM:

http://www-lium.univ-lemans.fr/en/content/corpus

CSTR VSTK:

http://homepages.inf.ed.ac.uk/jyamagis/page3/page58/page58.html

Librivox:

http://librivox.org

Text data

Project Guttenberg

Wikipedia

Google N-Grams (books and web)

Subtitles

Crawled data

Contact us
  • GitHub Project
  • PocketSphinx Issue Tracker
  • SphinxTrain Issue Tracker
Links
  • AlphaCephei
  • LinkedIn
Feeds
  • Posts
Pixyll theme crafted by John Otander available on GitHub.