/

Voice API

Voice API

Owned by Jürgen Baier

Last updated: Feb 05, 2024

1 min read

Szenarios (Non-Streaming)	Request data	Response data

Szenarios (Non-Streaming)

Request data

Response data

Full voice request

Audio + text_data → Audio + text_data

Mandatory:

Audio-recording of user-voice
Response-voice id/name
Personality/Prompt

Optional:

LLM-Model to use (GPT3.5, GPT4, GPT4-turbo, Mistral-7B,…)
Audio-type: .wav/.mp3
Language
LLM-Temperature

Mandatory:

Response audio-data
Response transscription (text)

Optional:

Detected language

Speech-to-text (STT)

Audio (+ text_data) → text_data

Mandatory:

Audio-recording of user-voice

Optional:

Audio-type source (.wav/.mp3)
Language

Mandatory:

Response transscription (text)

Optional:

Detected language

LLM-Response

text_data → text_data

Mandatory:

Personality/prompt
Request text

Optional:

LLM-Model to use (GPT3.5, GPT4, GPT4-turbo, Mistral-7B,…)
Language
LLM-Temperature

Mandatory:

Response text

Optional:

Detected language

Text-to-speech (TTS)

text_data → Audio

Mandatory:

Text to transform
Response-voice id/name

Mandatory:

Response audio-data

Data-Types:

Audio-type: .wav/.mp3
Language: Enumeration with a predefined list of available languages
Response-voice id/name: Enumeration with a predefined list of available voices
LLM-Model: Enumeration with a predefined list of available models
Temperature: float between and including 0 and 1

Related content

Voice recording

Voice recording

More like this

Power consumption in pib

Power consumption in pib

Read with this

pib kb test

More like this

Generate code from blockly-json

Generate code from blockly-json

Read with this

Example Python Script - Face Detection

Example Python Script - Face Detection

More like this

Development Software Setup

Development Software Setup

Read with this