Transcribe or Translate

1 var client = new RestClient("https://api.reka.ai/v1/transcription_or_translation"); 2 var request = new RestRequest(Method.POST); 3 request.AddHeader("X-Api-Key", "<apiKey>"); 4 request.AddHeader("Content-Type", "application/json"); 5 request.AddParameter("application/json", "{\n \"audio_url\": \"data:audio/wav;base64,<base64_encoded_audio>\",\n \"sampling_rate\": 16000,\n \"temperature\": 0,\n \"max_tokens\": 1024\n}", ParameterType.RequestBody); 6 IRestResponse response = client.Execute(request);

1 { 2 "transcript": "Example transcribed text from the audio", 3 "transcript_translation_with_timestamp": [ 4 { 5 "start": 0, 6 "end": 0.5, 7 "transcript": "Example" 8 }, 9 { 10 "start": 0.5, 11 "end": 1.2, 12 "transcript": "transcribed" 13 } 14 ] 15 }

Transcribe audio to text, translate speech to another language, or generate translated audio output.

This endpoint supports multiple modes:

Transcription only: Convert speech to text with optional word-level timestamps
Translation: Translate audio from one language to another
Speech-to-speech translation: Generate translated audio output

Transcribe audio to text, translate speech to another language, or generate translated audio output. This endpoint supports multiple modes: - Transcription only: Convert speech to text with optional word-level timestamps - Translation: Translate audio from one language to another - Speech-to-speech translation: Generate translated audio output

Authentication

X-Api-Keystring

API key for authentication

Request

This endpoint expects an object.

audio_urlstringRequired

URL to the audio file or base64-encoded audio as data URI (data:audio/wav;base64,…)

sampling_rateintegerRequiredDefaults to 16000

Audio sampling rate in Hz

target_languageenumOptional

Target language for translation

is_translatebooleanOptionalDefaults to false

Set to true to indicate translation request

return_translation_audiobooleanOptionalDefaults to false

If true, returns base64-encoded audio of the translated speech

temperaturedoubleOptional

Controls randomness in generation. Use 0.0 for deterministic output

max_tokensintegerOptional>=1Defaults to 1024

Maximum number of tokens to generate

Response

Successful transcription or translation

transcriptstring or null

Transcribed text in the original language

translationstring or null

Translated text in the target language (only if target_language is specified)

transcript_translation_with_timestamplist of objects or null

Word-level timestamps

audio_base64string or null

Base64-encoded WAV audio of the translated speech (only if return_translation_audio is true)

Authentication

Request

Response

Errors