Transcribe โ From audio file asynchronously
POST/transcribe-async
Generate a transcript from an audio file. Only audio/*
mime types are supported. The maximum duration is 60 minutes.
Requestโ
- application/json
Body
required
Possible values: [ENGLISH_US
, ENGLISH_UK
, SPANISH_ES
, SPANISH_MX
, FRENCH_FR
, ARABIC_EG
, ARABIC_LB
, ARABIC_MA
, ARABIC_SA
, ARMENIAN_AM
, BENGALI_IN
, CANTONESE_CN
, CROATIAN_HR
, FILIPINO_PH
, GERMAN_DE
, GREEK_GR
, GUJARATI_IN
, HEBREW_IL
, HINDI_IN
, ITALIAN_IT
, JAPANESE_JP
, KHMER_KH
, KOREAN_KR
, MANDARIN_CN
, PERSIAN_IR
, POLISH_PL
, PORTUGUESE_PT
, PUNJABI_IN
, RUSSIAN_RU
, SERBIAN_RS
, TAMIL_IN
, TELUGU_IN
, THAI_TH
, URDU_IN
, VIETNAMESE_VN
]
The spoken or written locale of the transcript, representing both the language and its specific regional variant.
The URL of the audio file to download & transcribe.
Indicates whether to segment transcription results at sentence boundaries. Default is false, meaning that a single transcript item may encompass multiple sentences, provided they are not delineated by pauses (silence) in the audio.
Possible values: <= 128 characters
An optional unique client-made id for this request. When specified, this id is included in the Json payload that will be sent asynchronously. This can help clients both guarantee idempotency and correlate asynchronous responses with their requests.
Note: this id must be different for every request. It is recommended to use a randomly generated UUID.
Responsesโ
- 200
Request received and processing started.
- application/json
- Schema
- Example (from schema)
Schema
- ONGOING
- FAILED
- SUCCEEDED
- Array [
- ]
The id of the asynchronous request.
Possible values: <= 128 characters
An optional unique client-made id for this request. When specified, this id is included in the Json payload that will be sent asynchronously. This can help clients both guarantee idempotency and correlate asynchronous responses with their requests.
Note: this id must be different for every request. It is recommended to use a randomly generated UUID.
Possible values: [ONGOING
, FAILED
, SUCCEEDED
]
The id of the asynchronous request.
Possible values: <= 128 characters
An optional unique client-made id for this request. When specified, this id is included in the Json payload that will be sent asynchronously. This can help clients both guarantee idempotency and correlate asynchronous responses with their requests.
Note: this id must be different for every request. It is recommended to use a randomly generated UUID.
Possible values: [ONGOING
, FAILED
, SUCCEEDED
]
payload objectrequired
The id of the asynchronous request.
Possible values: <= 128 characters
An optional unique client-made id for this request. When specified, this id is included in the Json payload that will be sent asynchronously. This can help clients both guarantee idempotency and correlate asynchronous responses with their requests.
Note: this id must be different for every request. It is recommended to use a randomly generated UUID.
Possible values: [ONGOING
, FAILED
, SUCCEEDED
]
payload object
The generated transcript.
transcript object[]required
Transcript items from the audio file.
The transcribed text.
Possible values: [DOCTOR
, PATIENT
, UNSPECIFIED
]
Who said the text in this transcript item.
Start time of this transcription item as the offset, in milliseconds, from the start of the audio file.
End time of this transcription item as the offset, in milliseconds, from the start of the audio file. Equals the start_time_ms
plus the duration of the related transcribed audio portion.
{}