Content Discovery
Uncover the full context hidden in speech recordings by transcribing spoken words into plain text automatically so that you can easily search through the speech’s content.
Uncover the full context hidden in speech recordings by transcribing spoken words into plain text automatically so that you can easily search through the speech’s content.
Recognize the topics that are being discussed in the speech recordings and detect emerging trends so that you can respond to them in the most appropriate way.
Ensure that voice bots fully understand a given voice command by transcribing human speech into text that is easy to process and contains all the spoken words.
Why Phonexia?
Offer your customers Speech to Text adapted specifically to their unique environment and use case scenarios.
Combine Speech to Text with other Phonexia technologies easily and enhance your solutions whenever necessary.
Receive support from our in-house, closed loop team of experts supporting speech technology implementations since 2006.
We have been selling Phonexia products in the Asian market for over 3 years now, and we can clearly say that Phonexia products are by far the best ones in the global market. We have encountered some local and global competitors, but we can easily push them off the table by showing how strong and affordable Phonexia products are. Phonexia’s strength comes from a mixture of dedicated genius engineers and down-to-earth, smart business experts who can draw big pictures.
Technology Details
• Trained with an emphasis on spontaneous telephone conversations
• Based on state-of-the-art techniques for acoustic modeling, including discriminative training and neural network-based features
• Applies channel compensation techniques compatible with the broadest range of audio sources possible: GSM/CDMA, 3G, VoIP, landlines, satellite phones, etc.
• Since the fifth generation, supports the addition of other words to the model via the Language Model Customization tool
• New languages can be trained on demand
Home Credit Case Study
Find out how Home Credit streamlined its call center operators’ work and significantly improved the overview of the executed calls using the analytics powered by the Phonexia Speech to Text technology.
Read the Case Study
Audio File:
• WAV or RAW (PCM 8 or 16 bits, IEEE float 32-bit, A-law or Mu-law, ADPCM)
• FLAC
• OPUS
8kHz+ sampling (other audio formats are automatically converted).
Audio Stream:
Raw signed 16-bit PCM (s16le) audio via HTTP streams or WebSockets
XML/JSON format with all results or results files containing:
• One-best transcription (a time-aligned speech transcript)
• N-best transcription (a confusion network with hypotheses for words at each moment)
Arabic (Gulf), Arabic (Levantine), Bengali, Chinese Mandarin, Croatian, Czech, Dutch, English (US), Farsi, French, Georgian, German, Italian, Kazakh, Pashto, Polish, Russian, Serbian, Slovak, Spanish, Swedish, Turkish, Ukrainian, Vietnamese.
Schedule a demo with our experts and see for yourself how Phonexia Speech to Text technology can help your business.
We will show you a free online demo tailored to your needs, showcasing Phonexia speech and voice recognition technologies and the ways you can use them to achieve your project’s goals.