Google Cloud Speech-to-Text vs. IBM Watson Text to Speech

IBM Watson Text to Speech

Overview
Product	Rating	Most Used By	Product Summary	Starting Price
Google Cloud Speech-to-Text	Score 7.2 out of 10	N/A	Speech-to-Text on Google Cloud is a tool used to convert speech into text using an API powered by Google’s AI technologies. The vendor states users can transcribe content in real time or from stored files; deliver a better user experience in products through voice commands; and, gain insights from customer interactions to improve service.	$0.02 per min
IBM Watson Text to Speech	Score 9.1 out of 10	N/A	IBM Watson Text to Speech is an API cloud service that enables users to convert written text into natural-sounding audio in a variety of languages and voices within an existing application or within Watson Assistant. It can be used to give a brand a voice and interact with users in their native language. Increase accessibility for users with different abilities, provide audio options to avoid distracted driving, or automate customer service interactions to eliminate hold times.	N/A

Pricing

Google Cloud Speech-to-Text

IBM Watson Text to Speech

Editions & Modules

Speech-to-Text V2 API: $0.016
per min
Speech-to-Text V1 API: $0.024
per min

No answers on this topic

Offerings

Pricing Offerings
Google Cloud Speech-to-Text	IBM Watson Text to Speech
Free Trial
Yes	No
Free/Freemium Version
Yes	No
Premium Consulting/Integration Services
No	No

Entry-level Setup Fee

No setup fee

Additional Details

Speech-to-Text V1 API V1 offers data residency for multi region only. Models include short, long, phone call, and video. V1 does not include audit logging. New customers get $300 in free credits and 60 minutes for transcribing and analyzing audio free per month, not charged against your credits. Speech-to-Text V2 API V2 offers data residency for multi and single region. Models include short, long, telephony, video, and Chirp. V2 does include audit logging and support for customer managed encryption keys.

—

More Pricing Information

Pricing Info

Community Pulse
	Google Cloud Speech-to-Text	IBM Watson Text to Speech

Best Alternatives
	Google Cloud Speech-to-Text	IBM Watson Text to Speech
Small Businesses	RingCentral Contact Center Score 8.1 out of 10	No answers on this topic
Medium-sized Companies	Zoom Contact Center Score 7.6 out of 10	No answers on this topic
Enterprises	Verint Speech and Text Analytics Score 8.4 out of 10	No answers on this topic
All Alternatives	View all alternatives	View all alternatives

User Ratings
	Google Cloud Speech-to-Text	IBM Watson Text to Speech
Likelihood to Recommend	6.6 (44 ratings)	9.1 (7 ratings)
Usability	7.3 (25 ratings)	- (0 ratings)

User Testimonials
	Google Cloud Speech-to-Text	IBM Watson Text to Speech
Likelihood to Recommend	Google Real-time meeting notes for the smaller group audience. Strong language coverage of over 125+ languages. Handles mobile phone recordings and environmental noise effectively. Fast transcription turnaround also supports phrases, which improves industry-specific terminology. Generating QA/compliance audit logs. Also builds the sentences with accurate punctuation and sentence boundaries. It has vast global support centers whose primary focus in resolving customer issues and help multinational engineering in building great products Incentivized Shaik Noor Mohammed Sohail Technical Analyst Read full review	IBM I advise Watson for all scenarios where you need AI to speak. I don't use this for this goal, but I suppose that could be a good solution for tourism and travel: companies in the hospitality industry can make it easier for people to get around and offer tours in numerous languages, all at the same time. In telecommunications, could be used to create customized messaging that the caller can use with customers, and it can generate words from a customer’s records that are read to them in a professional and friendly voice. It's very good for English, for Italian is a little "robotic" but the pronunciation is right. Incentivized Andrea Bardone Manager Read full review
Pros	Google So, first of all it gives the answer or translates in real time which is awesome. It has speaker diarization, which detects who spoke each segment. This is a great feature because it can track the number of people as well. It has an automatic punctuation system that detects each punctuation mark, such as a dot and a comma, and places it in the text. Lastly, it offers a variety of language translations, providing a global platform for interaction with people from different countries. Incentivized Satyam Pandey Associate software developer Read full review	IBM Improve customer experience and engagement. Offers both on-premise and cloud deployments. Automate requests and transactions in our agency. Voice recognition. Allows me to choose dialect which help select accent of the selected speaker. Incentivized Jessica Daniels Information Technology Specialist Read full review
Cons	Google Integration outside of the google eco system is challenging here. Google Cloud Speech-to-Text works only with active internet connection if the internet bandwidth is low it effect the transcription process and can lead to data inaccuracy. In terms of the pricing also this is at higher range which all the companies cannot afford like small scale organisation if they would like to use the tool they would look over the price to make the decision. Reducing the price can increase the product usage more Incentivized IS irfan shaik Technical Consultant Read full review	IBM Add sentiment variation. Add parameter in order to change the characteristic of the voice. Use custom voice in order to tune the speaker. Incentivized Andrea Bardone Manager Read full review
Usability	Google The reasoning behind my 10 is that the UI is very intuitive; I didn't require any formal training to use it. Google's speech-to-text is not just a conversion tool; it helps automate mundane tasks, saves time, and has an almost human-like understanding. Incentivized Vaibhav Singh Sr. Analyst Read full review	IBM No answers on this topic
Alternatives Considered	Google Google Cloud Speech-to-Text outperformed its competitors significantly in terms of accuracy, surpassing any other product available. Additionally, its support for multiple languages was unrivaled in the market. Moreover, for clients with robust bandwidth, Google Cloud Speech-to-Text offered real-time transcription capabilities, enabling users to transcribe live audio streams with minimal delay. Incentivized Verified User Anonymous Read full review	IBM I've never been a fan of the robotic voice of text-to-speech tools and I've always thought they sounded flat and bland. Until I tried IBM Watson Text to Speech, that is. This service uses advanced deep learning techniques to synthesize speech output in natural-sounding voices. I was blown away by how good the voices were that I've been using it for personal stuff like journaling and brainstorming sessions. And I'm definitely going to be using this for podcasting and my YouTube channel. Incentivized Siddhant singh Software Engineer Read full review
Return on Investment	Google It reduced our budget for assistants who transcribed files manually It speeds up the process, because we can have a transcriptions straight after the interviews It increased accuracy, because AI makes the transcriptions for every second, and you can find the words which were said at specific time. Incentivized Maria Sergeeva UX and Content Designer Read full review	IBM Enhance rapid and convenient customer interaction. Client issues are easy to solve by allocating critical info in their native language. Automating requests and transactions reduce hold time which improves customer satisfaction. Incentivized Amanda Donaldson Key Account Manager Read full review
ScreenShots	Google Cloud Speech-to-Text Screenshots