AI Services
AI Services are a suite of intelligence APIs for common AI workloads. You can use these APIs to access flexible AI-powered features that process media data and create customized solutions.
Scribe API
Scribe API provides speech-to-text transcription for audio. It supports fast synchronous transcription and large-scale batch processing.
This API enables:
-
Faster processing
Transcribes up to 1,000 audio files in a single job
-
Smart formatting
Automatically adds timestamps, punctuation, and speaker separation
-
Common audio formats
Transcribes WAV, M4A, MP3, and MP4 files
Summarizer API
Summarizer API offers conversation and meeting summarization for transcripts from any system. It supports fast synchronous summarization and large-scale batch processing.
This API enables:
-
Conversation-focused summarization
Summarizes meeting, call, chat, and other dialogue-based transcript content
-
Structured outputs
Returns recap, action item, summary, or full summary results for workflows and automation
-
Flexible processing
Supports inline text requests and storage-based batch jobs for transcript files
Translator API
Translator API delivers machine translation for text across a broad range of languages. It supports fast synchronous translation and large-scale batch processing.
This API enables:
-
Fast translation
Translates plain text in a single synchronous API call
-
Batch processing
Processes multiple files asynchronously for large-scale translation workloads
-
Broad language support
Translates across 9 supported languages
Coming soon
AI Services with additional capabilities across speech, vision, and text in future releases.