Skip to content

Prerecorded

The REST API provides speech-to-text transcription for complete audio files. You upload an audio file and receive the full transcript in the response.

How It Works

  1. Authenticate — Include an authentication header in your request.
  2. Configure — Use query parameters to set the audio encoding and which fields to include in the response.
  3. Upload audio — Send the audio file as the request body with Content-Type: application/octet-stream.
  4. Receive transcript — The server processes the entire file and returns the transcript as JSON.

See the API reference for full details on fields and error codes.

Audio Format Detection

When the encoding query parameter is set to auto (the default), the server automatically detects the audio format by reading the container headers at the start of the data. Most common formats (WAV, OGG, FLAC, WebM, etc.) are supported. If you are sending raw PCM audio without container headers, set the encoding parameter explicitly (e.g., pcm_s16le) along with sample_rate and channels.