Amazon Transcribe vs. Google Cloud Speech-to-Text

Overview
ProductRatingMost Used ByProduct SummaryStarting Price
Amazon Transcribe
Score 8.1 out of 10
N/A
Amazon Transcribe uses a deep learning process called automatic speech recognition (ASR) to convert speech to text quickly and accurately. Amazon Transcribe can be used to transcribe customer service calls, to automate closed captioning and subtitling, and to generate metadata for media assets to create a searchable archive. Amazon Transcribe Medical can be added to provide medical speech to text capabilities to clinical documentation applications.
$0
per second
Google Cloud Speech-to-Text
Score 8.4 out of 10
N/A
Speech-to-Text on Google Cloud is a tool used to convert speech into text using an API powered by Google’s AI technologies. The vendor states users can transcribe content in real time or from stored files; deliver a better user experience in products through voice commands; and, gain insights from customer interactions to improve service.
$0.02
per min
Pricing
Amazon TranscribeGoogle Cloud Speech-to-Text
Editions & Modules
Custom Language Model
$0.0001
per second
Standard Pricing
$0.0004
per second
Automatic Content Redaction
$0.0004
per second
Transcribe Medical
$0.00125
per second
Speech-to-Text V2 API
$0.016
per min
Speech-to-Text V1 API
$0.024
per min
Offerings
Pricing Offerings
Amazon TranscribeGoogle Cloud Speech-to-Text
Free Trial
NoYes
Free/Freemium Version
NoYes
Premium Consulting/Integration Services
NoNo
Entry-level Setup FeeNo setup feeNo setup fee
Additional DetailsSpeech-to-Text V1 API V1 offers data residency for multi region only. Models include short, long, phone call, and video. V1 does not include audit logging. New customers get $300 in free credits and 60 minutes for transcribing and analyzing audio free per month, not charged against your credits. Speech-to-Text V2 API V2 offers data residency for multi and single region. Models include short, long, telephony, video, and Chirp. V2 does include audit logging and support for customer managed encryption keys.
More Pricing Information
Community Pulse
Amazon TranscribeGoogle Cloud Speech-to-Text
Considered Both Products
Amazon Transcribe
Chose Amazon Transcribe
I use Google Cloud Speech to Text and Amazon Transcribe. What makes Amazon Transcribe better for me is the accuracy of the audio-to-text conversion. I have found out that Amazone Transcribe is better at handling homophones, contractions, abbreviations, and acronyms. Another …
Google Cloud Speech-to-Text
Chose Google Cloud Speech-to-Text
The accuracy of Google Cloud Speech-to-Text is much better than any other tool. It has better API integration with 3rd party tools. The transcription is on at real-time basis with the best efficiency. It has good language support from across the globe. It provides better noise …
Chose Google Cloud Speech-to-Text
Google Cloud Speech-to-Text shows an impressive ROI with increased efficiency, time savings, accuracy, speed, productivity, customer satisfaction, and cost-effectiveness.
Top Pros

No answers on this topic

Top Cons

No answers on this topic

Best Alternatives
Amazon TranscribeGoogle Cloud Speech-to-Text
Small Businesses
Dragon Speech Recognition
Dragon Speech Recognition
Score 8.8 out of 10
RingCentral Contact Center
RingCentral Contact Center
Score 7.9 out of 10
Medium-sized Companies

No answers on this topic

Zoom Contact Center
Zoom Contact Center
Score 9.2 out of 10
Enterprises
Verint Speech Analytics
Verint Speech Analytics
Score 8.9 out of 10
Verint Speech Analytics
Verint Speech Analytics
Score 8.9 out of 10
All AlternativesView all alternativesView all alternatives
User Ratings
Amazon TranscribeGoogle Cloud Speech-to-Text
Likelihood to Recommend
9.4
(6 ratings)
8.0
(20 ratings)
Usability
-
(0 ratings)
7.3
(1 ratings)
User Testimonials
Amazon TranscribeGoogle Cloud Speech-to-Text
Likelihood to Recommend
Amazon AWS
Amazon Transcribe can be an excellent tool for businesses where being able to convert speech or audio to text, in a searchable and reportable form, would be useful. For a call center (inbound or outbound), the ability to have a rich transcription of each call (and being able to search it for keywords) is an incredibly valuable benefit. For business meetings, being able to turn a 60 or 90-minute call into a readable transcript to search or refresh yourself or others is a very large time saver which will help you work more efficiently. The software does offer many deeper integrations, such as being able to track script usage (for call centers) or interruptions, deviations, etc.. which would be very valuable to a management team and for training purposes.
Read full review
Google
Google Cloud speech-to-text is best suited when you want to work on live calls and transcribe interviews, meetings, customer service calls, and other audio or video recordings into text format. This helps create searchable archives, generate meeting minutes, and improve accessibility for individuals with hearing impairments. The service can provide real-time captioning for live events, webinars, broadcasts, and presentations. This enhances accessibility for individuals who are deaf or hard of hearing and those viewing content in noisy environments or without sound. It does not work well where the internet bandwidth is not that good; it requires a very good and strong internet connection to work well. And also where there are strong accents, especially in the Mandarin language.
Read full review
Pros
Amazon AWS
  • It converts live recordings to text with few errors.
  • It has powerful speech recognition models- it transcribes well even low quality audios.
Read full review
Google
  • An amazing tool which helps a lot in a meetings.
  • It's an efficient tool for improving efficiency by saving a lot of time typing. It saves at least 40-50% of our time, thus increasing efficiency.
  • Incredible accuracy with multiple accents & multiple language.
  • It takes punctuation into consideration.
Read full review
Cons
Amazon AWS
  • It was not easy to bring Amazon Transcribe to life, but kudos to the vendor for the free support they offered.
Read full review
Google
  • The software does occasionally get confused by confusing terminology.
  • Its web-based interface can also feel a tad hard to use compared to more appealing desktop apps.
  • I've experienced the occasional technical issue, though the provider's support team is quick to troubleshoot.
Read full review
Usability
Amazon AWS
No answers on this topic
Google
I can share insights with stakeholders in record time. And robust API connections let me pipe text into my CRM, marketing automation, and other mission-critical systems
Read full review
Alternatives Considered
Amazon AWS
I use Google Cloud Speech to Text and Amazon Transcribe. What makes Amazon Transcribe better for me is the accuracy of the audio-to-text conversion. I have found out that Amazone Transcribe is better at handling homophones, contractions, abbreviations, and acronyms. Another feature that makes Amazon Transcribe my No. 1 choice is its use of punctuation marks. I can also feed my own list of vocabulary into Amazon Transcribe to help me acquire better results.
Read full review
Google
The accuracy of Google Cloud Speech-to-Text is much better than any other tool. It has better API integration with 3rd party tools. The transcription is on at real-time basis with the best efficiency. It has good language support from across the globe. It provides better noise robustness compare to other tools.
Read full review
Return on Investment
Amazon AWS
  • Cost reductions in KYC automating our video approvals.
  • Better customer service.
  • A good base for building new products and services around speech to text.
Read full review
Google
  • Automating the transcription process saved time and resources compared to manual transcription.
  • Speech-to-text enabled us to make audio content accessible to a wider audience, including individuals with disabilities.
  • We gained valuable insights into customer preferences, behaviors, and sentiment by analyzing voice data.
Read full review
ScreenShots

Google Cloud Speech-to-Text Screenshots

Screenshot of audio transcription creation -  Using the Speech-to-Text API from within the Cloud Console by creating an audio transcription is done in just a few steps. It can transcribe short, long, and streaming audio.Screenshot of creating subtitles for videos using AI -  Transcriptions with captions and subtitles can be added to existing content or in real time to streaming content. Google's video transcription model can be used for indexing or subtitling video and/or multispeaker content and uses similar machine learning technology as YouTube does for video captioning.Screenshot of adding Speech-to-Text to apps - The video pictures covers how to add AI to an application without extensive machine learning model experience. The pretrained Speech-to-Text API lets users enable AI for applications.Screenshot of Language, speech, text, and translation with Google Cloud API - The pictures displays a section of Google training course, where learners use the Speech-to-Text API to transcribe an audio file into a text file, translate with the Google Cloud Translation API, and create synthetic speech with Natural Language AI.