Audio Transcription
This endpoint allows you to transcribe audio.
Headers
Authorization
Bearer authentication of the form Bearer <token>
, where token is your auth token.
Toxicity
Whether to check the output for toxicity.
Pii
Whether to check the output for Pii, and what to do if found. Supported values are 'replace' and 'block'.
Replace-Method
Method to replace any found PII. Supported values are 'category', 'fake', 'mask', and 'random'.
Injection
Whether to check the output for a prompt injection.
Request
This endpoint expects a multipart form containing a file.
model
The transcription model to use.
file
The audio file to upload.
language
The language the audio is in.
prompt
An optional text to guide the model's style or continue a previous audio segment.
temperature
The temperature parameter for controlling randomness in transcription.
timestamps_granularities[]
Sets whether timestamps are returned and at what granularity. Not currently supported in multi-tenant environments.
diarization
Whether to diarize the audio and return speaker turns. Not currently supported in multi-tenant environments.
response_format
The format for the response object. Defaults to “json” and must be set to “verbose_json” when using diarization or timestamp granularities.
Response
Successful response.
text
The transcribed audio.
task
The task used in the request
language
The language of the audio file