You can monitor the latency of individual calls in the “Call History” section.

Call History

Understanding Latency Metrics

End-to-end latency measures the total time from when the user stops speaking until the AI agent begins responding. This includes processing time, network delays, and model inference time.

Key Metrics Explained

  • P90 (90th Percentile): 90% of calls have latency below this value.
  • Median (50th Percentile): Half of the calls have latency less than this value
  • Min: The fastest response time achieved in any call.