I'm a super-hero in my off-time and I'm trying to build an application that analyzes audio for spoken keywords. (Think emergency/911 calls) If a keyword is "robbery", and that word is spoken within the audio provided I would like to flag that file and possibly translate it to text.
What development libraries or software applications are out there for doing this sort of thing? C++ or Java librar开发者_StackOverflow中文版ies are preferred but not required.
Wiki page here is a good starting point. Of the ones mentioned there I think CMU Sphinx is the most active one.
You could work with Praat http://www.fon.hum.uva.nl/praat/ , it's an excelent program for working with phonetics and it has it's own scripting language. You can also find a lot of scripts in the Praat community. You can also use sendpraat http://www.fon.hum.uva.nl/praat/sendpraat.html to work with the praat functions as a subroutine.
精彩评论