Speechmatics STT + multi-speaker conversations #2036

sam-s10s · 2025-06-20T14:45:47Z

This is an early development of the Speechmatics STT plugin for pipecat. It not only takes advantage of our fast STT engine, but also introduces the speaker diarization - this means the agent knows who is speaking in multi-speaker conversations.

User: <S1>Hello, it's John here!</S1> <S2>And Emma!</S2>
Assistant: Hello both of you!
User: <S2>Who is speaking now?</S2>
Assistant: That's Emma!

By providing instructions in the system context, it is possible for larger LLMs to be able to deduce from the speaker tags who is who. With Speechmatics, you can also pass in known speaker voice-prints and this will mean the STT and LLM will know who is speaking by name and not just an index.

This is a work in progress, as there are challenges in how the speaker information is used within other LiveKit pipeline modules, such as end of turn detection. At the moment the Speechmatics STT will do speech-based endpointing and this is used for end of utterance.

The README in the module directory is for reference as we develop!

sam-s10s added 15 commits June 19, 2025 15:35

initial config

7209bbd

skeleton

62d3e8d

Added a README (to be added to).

991ddea

Payloads coming from the ASR.

198542c

doc update

6ce7177

handle the partials and finals

8e75c1e

enable diarization in the example

ca9b2e0

support sending messages to pipecat pipeline

7e82110

requirements fix in README

51fb918

updated example (with amusement)

ed75f73

Merge branch 'main' into speechmatics-stt

282157e

updated example to match master

2834d5f

updated docs

0e041fb

support for diarization tags

3e6205f

logic fix for wrapper

69aa872

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Speechmatics STT + multi-speaker conversations #2036

Speechmatics STT + multi-speaker conversations #2036

sam-s10s commented Jun 20, 2025 •

edited

Loading

Uh oh!

Uh oh!

Speechmatics STT + multi-speaker conversations #2036

Are you sure you want to change the base?

Speechmatics STT + multi-speaker conversations #2036

Conversation

sam-s10s commented Jun 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

sam-s10s commented Jun 20, 2025 •

edited

Loading