Retell AI connects with user audio source, read and send audio bytes through this WebSocket.
wss://api.retellai.com/audio-websocket/{call_id}
Web
Protocol.
Commonly used to render live transcript in web frontend.close
event when closed on error.
enable_audio_alignment
is false.enable_update
is true.agent_turn
, user_turn
enable_audio_alignment
is true.metadata
media.payload
.
register-call
on your server to get the call id
When do I have to write audio capture and playback code?
web
to send
, receive, and play audio bytes.Do I have to deal with audio bytes myself?
Can I still mutate the audio bytes when making phone call?