I would like to build a language model for a text corpus. Are there good out-of-the-box toolkits which will alleviate my task? The only toolkit I know off is t开发者_JAVA百科he Statistical Language Modelling(SLM) Toolkit by CMU.
Regards,
NLTK is very powerful, though I've never used it.
The SRILM toolkit is very useful.
http://www.speech.sri.com/projects/srilm/
KenLM is also worth trying. It's fast and uses good default settings. In contrast to SRILM, it offers less options for configuration.
精彩评论