Skip to end of metadata
Go to start of metadata

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 3 Next »

Szenario

Description

Request data

Response data

Full voice request

Mandatory:

  • Audio-recording of user-voice

  • Response-voice id/name

  • Personality/Prompt

Optional:

  • LLM-Model to use (GPT3.5, GPT4, GPT4-turbo, Mistral-7B,…)

  • Audio-type: .wav/.mp3

  • Language

  • LLM-Temperature

Mandatory:

  • Response audio-data

  • Response transscription (text)

Optional:

  • Detected language

Speech-to-text (STT)

Mandatory:

  • Audio-recording of user-voice

Optional:

  • Audio-type source (.wav/.mp3)

  • Language

Mandatory:

  • Response transscription (text)

Optional:

  • Detected language

LLM-Response

Mandatory:

  • Personality/prompt

  • Request text

Optional:

  • LLM-Model to use (GPT3.5, GPT4, GPT4-turbo, Mistral-7B,…)

  • Language

  • LLM-Temperature

Mandatory:

  • Response text

Optional:

  • Detected language

Text-to-speech (TTS)

Mandatory:

  • Text to transform

  • Response-voice id/name

Mandatory:

  • Response audio-data

  • No labels