We’re sorry. We could not find a match for your search.

We suggest you try the following to help find what you're looking for:

Check the spelling of your keyword search.
Use synonyms for the keyword you typed, for example, try “application” instead of “software.”
Start a new search.

Menu Contact Sales Sign in to Oracle Cloud

AI Speech to Text

Oracle Cloud Infrastructure (OCI) Speech is an AI service that applies ASR (Automatic Speech Recognition) technology to transform audio-based content to text. Developers can easily make API calls to integrate OCI Speech’s pretrained models into their applications. OCI Speech can be used for accurate, text-normalized, time-stamped transcription via the Console and REST APIs as well as CLI or SDKs. You can also use OCI Speech in a Data Science notebook session. With OCI Speech, you can filter profanities, get confidence scores for both single words and complete transcriptions, and more.

OCI Speech features

Prebuilt acoustic and language models

OCI Speech uses automatic speech recognition, a deep learning process, to derive accurate transcription from natural conversations. Get started easily by using prebuilt acoustic and language models that don’t require users to have data science experience.

Analyze data from audio and video files

Search, index, and decipher data buried in your audio files. Convert recorded audio conversations to textual data to analyze with AI services. For example, you can use OCI Language to retrieve the sentiment and OCI Speech’s anomaly detection capabilities to identify chances of customer churn.

Native multilingual support

OCI Speech ASR models support English, Spanish, and Portuguese, so you can transcribe your audio files in your preferred languages.

Integrated transcription service

Eliminate reliance on third-party transcription offerings and practice more control over your data with end-to-end security and compliance.

Easy to integrate

OCI Speech is a versatile service that can be called via REST APIs, different SDKs, and Oracle CLI. Developers can easily deploy a scalable speech service without having data science or ML expertise.

Purpose-built for security and privacy

Oracle Cloud Infrastructure Speech protects our customers’ privacy. Prebuilt automatic speech recognition models transcribe your content, but do not store any data for training, debugging, or other purposes.

Integrated transcription service

OCI Speech uses proprietary models and architecture that enables fast conversion for speech into text.

Confidence score per word

We added a word-level confidence score to help you identify words that might have been transcribed incorrectly. Use the word confidence score to determine where to focus when building an application.

Profanity filters

We added prebuilt word filtering using a curated list of profanities. You can either mask, remove, or tag profanities.

OCI Speech use cases

Customer feedback analytics
Digital Media content search and closed captions

Automatically provide in-workflow closed caption on OCI platform for all content created and curated by digital media service. Index your content using OCI speech for easy search across your content.
Call centers, call analytics

OCI Speech can transcribe customer calls for easy search and retrieval of information. Use OCI Language and Anomaly Detection together to detect sentiment and identify customer churn and staff training opportunities.

OCI Speech resources

Documentation

We offer a wide range of documentation for the OCI Speech service. Learn how to create transcription jobs, use developer tools, and more.

View our documentation

FAQ

Get your questions about the OCI Speech service answered via the link below.

View the FAQ

April 27, 2022

Punctuation, Closed Captions and 8kHz models are now available for OCI Speech.

Guy Michaeli, Senior Principal Product Manager

Today, we are happy to announce three new capabilities for the Speech service at no additional cost: native support for 8kHz audio files, support for output in the SRT (a closed- caption file format), and automatic punctuation of output text. These new capabilities are now available in all OCI's commercial regions and are part of our commitment to provide high-quality, affordable transcription for our customers.

Read the full article

Featured OCI Speech blogs

March 12, 2022 Oracle Cloud Infrastructure Speech GA announcement
November 3, 2021 Easily add automatic speech recognition to your apps

View all

OCI Speech related products

OCI Language

Artificial intelligence and machine learning capabilities to detect languages and provide sentiment analysis in your unstructured text.

Read about OCI Language

OCI Anomaly Detection

Incorporate custom-trained, business-specific anomaly detection models into applications

Learn about OCI Anomaly Detection

Oracle Digital Assistant

Build conversational interfaces for your applications

Learn about Oracle Digital Assistant

Get started with OCI Speech

Oracle Cloud Free Tier

Build, test, and deploy applications on Oracle Cloud—for free.

Try Oracle Cloud Free Tier

FAQ

Answers to all your questions on OCI Speech.

Read the faq

Resources for

Why Oracle

Learn

What’s new

Country/Region

AI Speech to Text

Prebuilt acoustic and language models

Analyze data from audio and video files

Native multilingual support

Integrated transcription service

Easy to integrate

Purpose-built for security and privacy

Integrated transcription service

Confidence score per word

Profanity filters

OCI Speech use cases

Customer feedback analytics

Digital Media content search and closed captions

Call centers, call analytics

OCI Speech resources

Documentation

FAQ

Punctuation, Closed Captions and 8kHz models are now available for OCI Speech.

Featured OCI Speech blogs

OCI Speech related products

OCI Language

Artificial intelligence and machine learning capabilities to detect languages and provide sentiment analysis in your unstructured text.

OCI Anomaly Detection

Incorporate custom-trained, business-specific anomaly detection models into applications

Oracle Digital Assistant

Build conversational interfaces for your applications

Get started with OCI Speech

Oracle Cloud Free Tier

FAQ