We have enforced some constraints and limitations to ensure the smooth operation of your agents, and prevent any misuse of the service. Note that these constraints can be adjusted based on your operational needs, on a case-by-case basis.

Concurrency

As part of our service, Pay-As-You-Go users are allocated a quota of 20 concurrent calls. Should your operational needs require additional concurrency, please reach out to our team at support@retellai.com to discuss options for scaling up.

You can check your current number of concurrent calls and your concurrency limit by calling the Get Concurrency API.

  • Handling Multiple Calls per Agent: You don’t need to create multiple agents to manage multiple calls concurrently. Each agent within your plan is capable of handling an unlimited number of calls, provided that the total concurrency remains within your designated quota. This means you can efficiently manage your workload without unnecessary agent duplication.

Max Call Duration

The maximum duration of a call is 1 hours by default, and the call will end automatically after 1 hour. Should your operational needs require longer calls, please reach out to our team at support@retellai.com to discuss options.

Max Prompt Token Length

The maximum length of prompt when using Retell LLM framework is 8192 by default, and longer prompt will be rejected when creating or updating the LLM. Should your operational needs require longer context, please reach out to our team at support@retellai.com to discuss options.