======= Data Sources ======
The possible sources of speech data:
Acoustic Data
TEDLIUM:
http://www-lium.univ-lemans.fr/en/content/corpus
CSTR VSTK:
http://homepages.inf.ed.ac.uk/jyamagis/page3/page58/page58.html
Librivox:
Text data
Project Guttenberg
Wikipedia
Google N-Grams (books and web)
Subtitles
Crawled data