Audio Transcription

This endpoint allows you to transcribe audio.

This endpoint expects a multipart form containing a file.

modelstringRequired

The transcription model to use.

filefileRequired

The audio file to upload.

languagestringRequired

The language the audio is in.

promptstringRequired

An optional text to guide the model's style or continue a previous audio segment.

temperaturedoubleRequired

The temperature parameter for controlling randomness in transcription.

timestamps_granularities[]stringRequired

Sets whether timestamps are returned and at what granularity. Not currently supported in multi-tenant environments.

diarizationbooleanRequired

Whether to diarize the audio and return speaker turns. Not currently supported in multi-tenant environments.

response_formatstringRequired

The format for the response object. Defaults to “json” and must be set to “verbose_json” when using diarization or timestamp granularities.

Successful response.

textstring or null

The transcribed audio.

taskstring or null

The task used in the request

languagestring or null

The language of the audio file

1	curl -X POST https://api.predictionguard.com/audio/transcriptions \
2	-H "Toxicity: false" \
3	-H "Pii: " \
4	-H "Replace-Method: " \
5	-H "Injection: false" \
6	-H "Authorization: Bearer <token>" \
7	-H "Content-Type: multipart/form-data" \
8	-F model="base" \
9	-F [email protected]