IBM Watson Speech to Text package
This package supports the following audio file formats: flac, mpeg, mp3, ogg, pcm, wav, and webm. The following languages are supported: Arabic, Brazilian Portuguese, Chinese (Mandarin), English (United Kingdom and United States), French, German, Japanese, Korean, Spanish (Argentinian, Castilian, Chilean, Colombian, Mexican, and Peruvian).
|Detect speakers||Identifies the individuals in a conversation between multiple people.
|Keyword spotting||Detects specific strings in the transcript. The output contains the timestamp(s) for each keyword and a confidence score.|
|Smart formatting||Converts the following types of strings into more conventional representations to make the transcript easier to read:
|Profanity filter||Obscures profanity by replacing it with asterisks in the transcript.|