Voice API
Szenarios (Non-Streaming) | Request data | Response data |
---|---|---|
Full voice request Audio + text_data → Audio + text_data | Mandatory:
Optional:
| Mandatory:
Optional:
|
Speech-to-text (STT) Audio (+ text_data) → text_data | Mandatory:
Optional:
| Mandatory:
Optional:
|
LLM-Response text_data → text_data | Mandatory:
Optional:
| Mandatory:
Optional:
|
Text-to-speech (TTS) text_data → Audio | Mandatory:
| Mandatory:
|
Data-Types:
Audio-type: .wav/.mp3
Language: Enumeration with a predefined list of available languages
Response-voice id/name: Enumeration with a predefined list of available voices
LLM-Model: Enumeration with a predefined list of available models
Temperature: float between and including 0 and 1