...
Szenarios (Non-Streaming) | Description/Notes | Request data | Response data |
---|---|---|---|
Full voice request Audio + text_data → Audio + text_data | Mandatory:
Optional:
| Mandatory:
Optional:
| |
Speech-to-text (STT) Audio (+ text_data) → text_data | Mandatory:
Optional:
| Mandatory:
Optional:
| |
LLM-Response text_data → text_data | Mandatory:
Optional:
| Mandatory:
Optional:
| |
Text-to-speech (TTS) text_data → Audio | Mandatory:
| Mandatory:
|
Data-Types:
Audio-type: .wav/.mp3
Language: Enumeration with a predefined list of available languages
Response-voice id/name: Enumeration with a predefined list of available voices
LLM-Model: Enumeration with a predefined list of available models
Temperature: float between and including 0 and 1