Speech and Natural Language Processing

Awesome

A curated list of speech and natural language processing resources. Other lists can be found in this list(link is external). If you want to contribute to this list (please do), send me a pull request. All Sub-caterogires are listed in alphabetical order

Finite State Toolkits and Regular Expressions

Many of the toools in the machine translation section also implement interesting graph and semiring operations.

Language Modelling Toolkits

Speech Recognition

Signal Processing

Text-to-Speech

Speech Data

  • cmudict(link is external) CMUdict (the Carnegie Mellon Pronouncing Dictionary) is a free pronouncing dictionary of English.
  • LibriSpeech ASR corpus(link is external) LibriSpeech is a corpus of approximately 1000 hours of 16kHz read English speech, prepared by Vassil Panayotov with the assistance of Daniel Povey. The data is derived from read audiobooks from the LibriVox project, and has been carefully segmented and aligned.
  • TED-LIUM Corpus(link is external) The TED-LIUM corpus was made from audio talks and their transcriptions available on the TED website.

Machine Translation

Machine Learning

Deep Learning

Natural Language Processing

Applications

Other Tools

  • GraphViz.sty(link is external)
    Really handy tool adding dot languge directly to a LaTex document, useful for
    tweaking the small colorized WFST figure in papers and presentations.

Blogs

Books

Rating

0 out of 5 Stars 0 Review

5 Stars
 
0.00%
4 Stars
 
0.00%
3 Stars
 
0.00%
2 Stars
 
0.00%
1 Star
 
0.00%

About

  • There are no comments yet

Thank you! Review submitted.

Ok