Skip to main content
Voice agents

Voice agents overview

Learn how to build voice agents with Speechmatics integrations and the Voice SDK.

Our Voice SDK provides features optimized for conversational AI, which we use to build our integrations. Our integration partners are the quickest way to get a production voice agent up and running,

Features

Speechmatics provides building blocks you can use through integrations and the Voice SDK.

It includes:

  • Turn detection: detect when a speaker has finished talking.
  • Intelligent segmentation: group partial transcripts into clean, speaker-attributed segments.
  • Diarization: identify and label different speakers.
  • Speaker focus: focus on or ignore specific speakers in multi-speaker scenarios.
  • Preset configurations: start quickly with ready-to-use settings.
  • Structured events: work with clean segments instead of raw word-level events.

Integrations

Use an integration to handle audio transport and wiring, so you can focus on your agent logic:

Voice SDK

Use the Voice SDK to handle turn detection, group transcripts into clean segments, and apply diarization for LLM workflows.

See Voice SDK for getting started, presets, and configuration.

If you’re building an integration and want to work with us, contact support.