TrustRadius: an HG Insights company

What is ReadSpeaker speechCloud API?

The ReadSpeaker speechCloud API is a cloud-based text-to-speech solution provided by ReadSpeaker. According to the vendor, this API is designed to speech-enable desktop, web, mobile applications, and Internet-connected devices. It is targeted at small to large companies in various professions and industries, including app developers, device manufacturers, e-learning platforms, call center solutions, and accessibility software providers.

Key Features

Dictionary: According to the vendor, the ReadSpeaker speechCloud API offers a built-in customer-specific dictionary that allows customers to have control over the pronunciation and reading of specific words.

Multiple audio formats: The API supports various audio file formats such as A-law, u-law, PCM, WAV, Ogg, and MP3, providing developers with flexibility in choosing the audio format that best suits their needs.

Languages: According to the vendor, the ReadSpeaker speechCloud API offers a wide range of languages and voices using state-of-the-art speech synthesis, catering to global applications.

Easy to use: The API provides sample code in different programming languages, including Java (Android), Objective C (iOS), PHP, ASP, and Flash/ActionScript, aiming to simplify the integration process with easy-to-follow code examples.

Statistics: Users can access a statistics interface via web and API to track and analyze usage data and performance metrics, as claimed by the vendor.

User experience: The API offers an easy-to-use web interface that contains everything needed to get started, providing a user-friendly interface for managing and configuring the speech generation process.

Linux: The ReadSpeaker speechCloud API provides an AGI script that supports most versions of Asterisk on Linux, enabling easy integration with the open-source communication application platform Asterisk, according to the vendor.

SSML control: The API offers SSML input support, allowing users to have more control over how the text is read. It enables the insertion of pauses/breaks, phonetic transcriptions, and voice/language switching within the text.

Online Payment: The API provides an online payment system for easy purchase of API credits for default voices, streamlining the payment process for accessing the speech generation capabilities.

Request Timing Information: Users can access timing information through the API, enabling the development of highlighting solutions and synchronization of text highlighting with the generated speech, as claimed by the vendor.

Categories & Use Cases