Skip to main content
TrustRadius
Azure AI Speech

Azure AI Speech
Formerly Azure Cognitive Speech Services

Overview

What is Azure AI Speech?

The Azure AI Speech service provides a range of speech recognition and generation capabilities including speech transcription, text-to-speech and speech translation. It provides a range of speech recognition and generation capabilities including speech transcription, text-to-speech, speech translation, and speaker recognition.

Read more
Recent Reviews
Read all reviews

Awards

Products that are considered exceptional by their customers based on a variety of criteria win TrustRadius awards. Learn more about the types of TrustRadius awards to make the best purchase decision. More about TrustRadius Awards

Return to navigation

Pricing

View all pricing

Entry-level set up fee?

  • No setup fee
For the latest information on pricing, visithttps://azure.microsoft.com/en…

Offerings

  • Free Trial
  • Free/Freemium Version
  • Premium Consulting/Integration Services

Starting price (does not include set up fee)

  • $1 per month
Return to navigation

Product Details

What is Azure AI Speech?

The Speech service is the unification of speech-to-text, text-to-speech, and speech-translation into a single Azure subscription. It's speech capabilities enable applications, tools, and devices with the Speech CLI, Speech SDK, Speech Devices SDK, Speech Studio, or REST APIs.

Services include:

Speech to Text - Transcribe audio in more than 92 languages and variants. Gain customer insights with call center transcription, improve experiences with voice-enabled assistants, and capture key discussions in meetings.

Text to Speech - Create apps and services that speak conversationally, choosing from more than 215 voices, and 60 languages and variants. Create natural-sounding audio content, improve accessibility with read-aloud functionality, and create custom voice assistants.

Speech Translation - Translate audio from more than 30 languages and customize translations for organization's specific terms in a preferred programming language.

Speaker Recognition - Confirm a person's identity or recognize who's speaking in a meeting by adding speaker verification and identification to an app.

Custom Commands - Users can build a touchless, voice-first experience to improve safety and support back-to-work scenarios.

Custom Keywords - Custom keyword for IoT devices and voice-enabled assistants to set your brand apart—making it more personal, personable, and secure.

Azure AI Speech Technical Details

Deployment TypesSoftware as a Service (SaaS), Cloud, or Web-Based
Operating SystemsUnspecified
Mobile ApplicationNo

Frequently Asked Questions

The Azure AI Speech service provides a range of speech recognition and generation capabilities including speech transcription, text-to-speech and speech translation. It provides a range of speech recognition and generation capabilities including speech transcription, text-to-speech, speech translation, and speaker recognition.

Azure AI Speech starts at $1.

The most common users of Azure AI Speech are from Enterprises (1,001+ employees).
Return to navigation

Comparisons

View all alternatives
Return to navigation

Reviews and Ratings

(16)

Attribute Ratings

Reviews

(1-7 of 7)
Companies can't remove reviews or game the system. Here's why
Score 8 out of 10
Vetted Review
Verified User
Incentivized
This service is well suited for scenarios where you need to integrate text-to-speech and/or speech-to-text into applications. Within our organisation, it is primarily used by students for development purposes to enable said functionality but is also used to provide accessibility to students who have hearing-related issues. Its multi-language support is also beneficial for our international students who have English as a second language and are therefore able to rapidly translate any text or speech that they do not understand.
Johnson Martins | TrustRadius Reviewer
Score 9 out of 10
Vetted Review
Verified User
Ease of the functionalities and the best solution on customer services management and intelligent ability on multiple voice recognition with Azure Cognitive Speech Services are very helpful. Also, the reporting functions manipulation is great and excellent experience managing multiple contacts and the effective data analytics functions offers results in real-time data.
Score 7 out of 10
Vetted Review
Verified User
Incentivized
Azure Cognitive Speech Services can work with many languages and dialects, making it imperative for people working with multi-lingual clients. It also helps to catch speeches while conversing in meetings. The setup is complicated, which is why for a novice user - it is not an easy endeavor to use. The pricing is also high.
Score 10 out of 10
Vetted Review
Verified User
Incentivized
It is well suited for scenarios where there is a requirement to integrate speech-to-text and text-to-speech into user interaction, for example, with chatbots used internally at a large enterprise. We have also investigated the use of Azure Cognitive Speech Services for live captions during meetings and presentations and the additional translation of these captions from English into German.
Score 8 out of 10
Vetted Review
Verified User
Incentivized
Azure Cognitive Speech Services is well suited for scenarios where you need real-time or batch-based data conversion - either from speech to text or text to speech, It can be used to interpret and document customer conversations or employee conversations or to make specific training programs. It can also be used to make and train avatars to read from a text document. It can also be made to use for cases where it can read for employees with special needs.
Score 10 out of 10
Vetted Review
Verified User
Incentivized
Since this is made by Microsoft, integration wouldn't be a huge obstacle with your O365 applications. You just have to check if your apps are available for integration, use the available APIs then you are good to go.
Return to navigation