Step 1: Configure global settings
Agent Global Settings
Click on empty canvas and click setting to access global setting. Here’s where you set a lot of agent level settings.
Configure Voice Settings
- Open the voice selection dropdown menu:
- Listen to the available voice samples and select the voice you want to use for the agent:
Custom Voices: You can also add voices from the ElevenLabs community or clone voices by clicking “Add custom voice”. Learn more in our voice configuration guide.
- You can also adjust a couple voice settings:
- voice temperature to make the voice more variant or stable.
- voice speed to make the agent speak faster or slower.
- voice volume to make the agent speak louder or quieter.
- voice model (if applicable): when using certain voice providers, you can choose between different models that support. Check out dashboard for detailed nuances of each models.
Select Language of Agent
Here you can select what language the agent will understand (the language of user audio). You need to write in prompt something like respond in Spanish
for the agent to respond in a specific language. If the user audio can be both Spanish and English, you can select the option multi
to allow agent to understand both.
Select a Language Model
Select the model you want to use for the agent. Please note that you can override this within individual nodes. Optionally you can tune the LLM temperature to make answers more variant or more stable.
We recommend starting with GPT-4o, which offers an optimal balance of:
- Response quality
- Latency
- Cost-effectiveness
Write Global Prompt
Here’s where you specify the agent’s persona, identity, guardrails, etc. This set of text will be available in every node, and will influence all response generation.
Configure Knowledge Base
Here’s where you can supply contexts to agent via documents, urls, texts. Read more at Knowledge Base Guide.
Configure Speech Settings
Here’s a lot of options that allow you to finetune how your agent interacts with user.
- Background sound: select a background sound that plays throughout the whole call to mimic an environment like call center, making the conversation more humanlike and engaging.
- Responsiveness: how responsive the agent is. Set it lower if you want agent to respond slower, which can be useful when talking to folks like elderlys.
- Interruption Sensitivity: how fast the agent gets interrupted by user interruptions. Set it lower if you want agent to be more resilient to background speech.
- Backchanneling: Set up how often and what words the agent uses to acknowledge users.
- Boosted Keywords: Provides some biases towards certain words, making it easier to get recognized. Common ones are brand names, people’s names, etc.
- Speech Normalization: convert entities like date, currency, numbers into plain words, which can help prevent issues where audio generated was not pronuncing those right.
- Disable Transcript Formatting: return transcript with entities in plain words, not formatted to timestamps, numbers, etc. Can prevent issues that are caused by incorrect transcript formatting.
- Reminder frequency: how often the agent will remind the user when user is inactive.
- Pronunciation: set up pronunciation guide for specific words.
Configure Call Settings
Here’s a couple of settings that’s more call operation related.
- Voicemail related settings: set up voicemail detection and what to do when voicemail is detected. See more at Handle Voicemail.
- End call on silence: set up if user is active for a certain amount of time, the call will be ended.
- Call duration: set up maximum duration of the call.
- Pause before speaking: For the beginning of the call, if agent speaks first, it will wait for the configured duration before speaking, useful to handle scenarios when user is still picking up the phone.
Configure Post Call Analysis
Probably set up later, read more at Post Call Analysis Guide.
Configure Privacy & Webhook
Here’s where you can set up whether to opt out sensitive data storage, and configure webhook settings for receiving call related events.
Configure Who Speaks First
Click on begin
icon, and you can select who speaks first in the call.