Amazon Transcribe vs. Google Cloud Speech-to-Text

Overview
ProductRatingMost Used ByProduct SummaryStarting Price
Amazon Transcribe
Score 7.4 out of 10
N/A
Amazon Transcribe uses a deep learning process called automatic speech recognition (ASR) to convert speech to text quickly and accurately. Amazon Transcribe can be used to transcribe customer service calls, to automate closed captioning and subtitling, and to generate metadata for media assets to create a searchable archive. Amazon Transcribe Medical can be added to provide medical speech to text capabilities to clinical documentation applications.
$0
per second
Google Cloud Speech-to-Text
Score 8.3 out of 10
N/A
Speech-to-Text on Google Cloud is a tool used to convert speech into text using an API powered by Google’s AI technologies. The vendor states users can transcribe content in real time or from stored files; deliver a better user experience in products through voice commands; and, gain insights from customer interactions to improve service.
$0.02
per min
Pricing
Amazon TranscribeGoogle Cloud Speech-to-Text
Editions & Modules
Custom Language Model
$0.0001
per second
Standard Pricing
$0.0004
per second
Automatic Content Redaction
$0.0004
per second
Transcribe Medical
$0.00125
per second
Speech-to-Text V2 API
$0.016
per min
Speech-to-Text V1 API
$0.024
per min
Offerings
Pricing Offerings
Amazon TranscribeGoogle Cloud Speech-to-Text
Free Trial
NoYes
Free/Freemium Version
NoYes
Premium Consulting/Integration Services
NoNo
Entry-level Setup FeeNo setup feeNo setup fee
Additional DetailsSpeech-to-Text V1 API V1 offers data residency for multi region only. Models include short, long, phone call, and video. V1 does not include audit logging. New customers get $300 in free credits and 60 minutes for transcribing and analyzing audio free per month, not charged against your credits. Speech-to-Text V2 API V2 offers data residency for multi and single region. Models include short, long, telephony, video, and Chirp. V2 does include audit logging and support for customer managed encryption keys.
More Pricing Information
Community Pulse
Amazon TranscribeGoogle Cloud Speech-to-Text
Considered Both Products
Amazon Transcribe
Chose Amazon Transcribe
I use Google Cloud Speech to Text and Amazon Transcribe. What makes Amazon Transcribe better for me is the accuracy of the audio-to-text conversion. I have found out that Amazone Transcribe is better at handling homophones, contractions, abbreviations, and acronyms. Another …
Google Cloud Speech-to-Text
Chose Google Cloud Speech-to-Text
Google Cloud Speech-to-Text is more recommended by clients and also based on our research, we found that this is the best option for our application.
Chose Google Cloud Speech-to-Text
The accuracy of Google Cloud Speech-to-Text is much better than any other tool. It has better API integration with 3rd party tools. The transcription is on at real-time basis with the best efficiency. It has good language support from across the globe. It provides better noise …
Chose Google Cloud Speech-to-Text
Google Cloud Speech-to-Text shows an impressive ROI with increased efficiency, time savings, accuracy, speed, productivity, customer satisfaction, and cost-effectiveness.
Best Alternatives
Amazon TranscribeGoogle Cloud Speech-to-Text
Small Businesses
Dragon Speech Recognition
Dragon Speech Recognition
Score 8.4 out of 10
RingCentral Contact Center
RingCentral Contact Center
Score 8.1 out of 10
Medium-sized Companies
Dovetail
Dovetail
Score 8.5 out of 10
Zoom Contact Center
Zoom Contact Center
Score 8.8 out of 10
Enterprises
Verint Speech and Text Analytics
Verint Speech and Text Analytics
Score 8.4 out of 10
Verint Speech and Text Analytics
Verint Speech and Text Analytics
Score 8.4 out of 10
All AlternativesView all alternativesView all alternatives
User Ratings
Amazon TranscribeGoogle Cloud Speech-to-Text
Likelihood to Recommend
9.4
(6 ratings)
8.0
(35 ratings)
Usability
-
(0 ratings)
8.0
(16 ratings)
User Testimonials
Amazon TranscribeGoogle Cloud Speech-to-Text
Likelihood to Recommend
Amazon AWS
Amazon Transcribe can be an excellent tool for businesses where being able to convert speech or audio to text, in a searchable and reportable form, would be useful. For a call center (inbound or outbound), the ability to have a rich transcription of each call (and being able to search it for keywords) is an incredibly valuable benefit. For business meetings, being able to turn a 60 or 90-minute call into a readable transcript to search or refresh yourself or others is a very large time saver which will help you work more efficiently. The software does offer many deeper integrations, such as being able to track script usage (for call centers) or interruptions, deviations, etc.. which would be very valuable to a management team and for training purposes.
Read full review
Google
Google Cloud Speech to Text is extremely useful for capturing insights quickly and efficiently. It is pretty accurate and good at capturing volume ( I am pretty loud, but I've captured insights from co-workers who are quieter with no issues). However, it does not work offline, so it is hard to use when in a remote area or with limited connectivity.
Read full review
Pros
Amazon AWS
  • It converts live recordings to text with few errors.
  • It has powerful speech recognition models- it transcribes well even low quality audios.
Read full review
Google
  • An amazing tool which helps a lot in a meetings.
  • It's an efficient tool for improving efficiency by saving a lot of time typing. It saves at least 40-50% of our time, thus increasing efficiency.
  • Incredible accuracy with multiple accents & multiple language.
  • It takes punctuation into consideration.
Read full review
Cons
Amazon AWS
  • It was not easy to bring Amazon Transcribe to life, but kudos to the vendor for the free support they offered.
Read full review
Google
  • The software does occasionally get confused by confusing terminology.
  • Its web-based interface can also feel a tad hard to use compared to more appealing desktop apps.
  • I've experienced the occasional technical issue, though the provider's support team is quick to troubleshoot.
Read full review
Usability
Amazon AWS
No answers on this topic
Google
The reasoning behind my 10 is that the UI is very intuitive; I didn't require any formal training to use it. Google's speech-to-text is not just a conversion tool; it helps automate mundane tasks, saves time, and has an almost human-like understanding.
Read full review
Alternatives Considered
Amazon AWS
I use Google Cloud Speech to Text and Amazon Transcribe. What makes Amazon Transcribe better for me is the accuracy of the audio-to-text conversion. I have found out that Amazone Transcribe is better at handling homophones, contractions, abbreviations, and acronyms. Another feature that makes Amazon Transcribe my No. 1 choice is its use of punctuation marks. I can also feed my own list of vocabulary into Amazon Transcribe to help me acquire better results.
Read full review
Google
Google Cloud Speech-to-Text outperformed its competitors significantly in terms of accuracy, surpassing any other product available. Additionally, its support for multiple languages was unrivaled in the market. Moreover, for clients with robust bandwidth, Google Cloud Speech-to-Text offered real-time transcription capabilities, enabling users to transcribe live audio streams with minimal delay.
Read full review
Return on Investment
Amazon AWS
  • Cost reductions in KYC automating our video approvals.
  • Better customer service.
  • A good base for building new products and services around speech to text.
Read full review
Google
  • The ROI is evidenced in the automation of processes, when we get a machine to perform repetitive processes instead of a human, has a fairly rapid ROI, being able to perform efficient transcription processes helps these processes are performed correctly, improving the efficiency of the same.
Read full review
ScreenShots

Google Cloud Speech-to-Text Screenshots

Screenshot of audio transcription creation -  Using the Speech-to-Text API from within the Cloud Console by creating an audio transcription is done in just a few steps. It can transcribe short, long, and streaming audio.Screenshot of creating subtitles for videos using AI -  Transcriptions with captions and subtitles can be added to existing content or in real time to streaming content. Google's video transcription model can be used for indexing or subtitling video and/or multispeaker content and uses similar machine learning technology as YouTube does for video captioning.Screenshot of adding Speech-to-Text to apps - The video pictures covers how to add AI to an application without extensive machine learning model experience. The pretrained Speech-to-Text API lets users enable AI for applications.Screenshot of Language, speech, text, and translation with Google Cloud API - The pictures displays a section of Google training course, where learners use the Speech-to-Text API to transcribe an audio file into a text file, translate with the Google Cloud Translation API, and create synthetic speech with Natural Language AI.