Voice API

Szenarios (Non-Streaming)	Request data	Response data
Full voice request	Mandatory: Audio-recording of user-voice Response-voice id/name Personality/Prompt Optional: LLM-Model to use (GPT3.5, GPT4, GPT4-turbo, Mistral-7B,…) Audio-type: .wav/.mp3 Language LLM-Temperature	Mandatory: Response audio-data Response transscription (text) Optional: Detected language
Speech-to-text (STT)	Mandatory: Audio-recording of user-voice Optional: Audio-type source (.wav/.mp3) Language	Mandatory: Response transscription (text) Optional: Detected language
LLM-Response	Mandatory: Personality/prompt Request text Optional: LLM-Model to use (GPT3.5, GPT4, GPT4-turbo, Mistral-7B,…) Language LLM-Temperature	Mandatory: Response text Optional: Detected language
Text-to-speech (TTS)	Mandatory: Text to transform Response-voice id/name	Mandatory: Response audio-data