Powered by deep learning and the speech recognition technology, FPT.AI Speech to Text (STT) service offers an easy-to-use cloud-based API for developers to transcribe spoken words into written words. The service can be integrated with various business applications.
Automatic language recognition
Using the latest advanced neural network algorithms, FPT.AI STT delivers accurate and improved audio recognition over time, recognizing linguistic variants based on regional accents, ages, and the use of non-native Vietnamese words.
Automatic proper nouns transcription and punctuation
STT formats specific contextual results and can correctly transcribe proper nouns (such as proper names, names of place) and appropriate language formats (such as dates and phone numbers). Using machine learning technology, STT service automatically punctuates after each break.
Support real-time or pre-recorded audio
Audio input can be received directly from the microphone of the online application or sent from an audio file.
Customization for enterprises
In addition to the multi-purpose voice recognition service, FPT.AI TTS provides a service channel that customize to enterprises’ needs.
Integrated in mobile phone applications to control IoT devices or voice command software
Cost saving and profit optimization
Automate operational activities, enabling businesses to optimize human resource investments and increase profitability.
Absolute security for customers with the powerful server as well as experienced experts available for 24/7/365 support via multiple customer care channels.