Request
This endpoint expects a multipart form containing a file.
modelstringRequired
The transcription model to use.
filefileRequired
The audio file to upload.
languagestringOptional
The language the audio is in.
promptstringOptional
An optional text to guide the model's style or continue a previous audio segment.
temperaturedoubleOptional
The temperature parameter for controlling randomness in transcription. Supports a range of 0.0-2.0.
timestamps_granularities[]string or list of anyOptional
Sets whether timestamps are returned and at what granularity.
diarizationbooleanOptional
Whether to diarize the audio and return speaker turns. Not currently supported in multi-tenant environments.
response_formatstringOptional
The format for the response object. Defaults to “json” and must be set to “verbose_json” when using diarization or timestamp granularities.
Response
Successful response.
textstring
The transcribed audio.
taskstring
The task used in the request.
languagestring
The language of the audio file.
durationdouble
The duration of the audio file in seconds.
segmentslist of objects
An array containing objects with segment level data.
wordslist of objects
An array containing objects with word level data.