June 25, 2019
By Pavel Jirik in Blog
Our fastest, 5th generation of the Speech To Text and Keyword Spotting technology has been out since last summer, and we have been gradually extending the engine to support more and more languages. As we added two more languages to the engine last week, it is a perfect time to explore its performance in more detail.
As mentioned above, the 5th generation of Phonexia Speech To Text and Keyword Spotting technology is the fastest generation providing much quicker transcription and keyword spotting capabilities.
It currently supports the English, Russian, Czech, Slovak, Polish, and Dutch language with more languages coming soon.
The very first language upgraded to the 5th generation was the Polish language back in July 2018, allowing the Polish speech to be transcribed to text seven times faster than real time.
The keyword spotting achieved 29x FtRT (faster than real time) performance. In other words, a single CPU core would only take one hour to search 29 hours of the Polish speech for specific keywords.
We then extended the engine with the Czech and Dutch language in December 2018. Both languages are currently achieving 7x FtRT transcription performance and 29x FtRT keyword spotting performance, being on a par with the Polish language.
Then spring 2019 came, and it was time to release another Slavic language—the Slovak language. Our engine managed to transcribe it even faster than the previously released languages delivering 9x FtRT transcription performance. The keyword spotting performance was 27x FtRT.
Last week, we added our latest language extension—the Russian and English language. Our engine achieves 8x FtRT transcription performance for both languages and even more impressively 40x FtRT for the Russian and 47x FtRT for the English language when keyword spotting!
Check out this summary table:
As you can see, Phonexia Speech To Text and Keyword Spotting functionality offers exciting transcription and keyword spotting capabilities ready to be unleashed!
So, what's next?
We are now working on the Latin American Spanish language and expect to release it during this summer. But stay tuned as more languages are on the way!