Channels

Telephony

How the Bivariant platform handles voice calls and SMS through SIP trunks, phone number management, and LiveKit-powered real-time audio.

The telephony channel connects your platform workspace to the public phone network. It supports inbound and outbound voice calls, SMS messaging, and integration with AI agents for automated voice interactions.

Architecture

Telephony on the platform is built on:

ComponentRole
SIP TrunksConnect the platform to telecom carriers
Phone NumbersInbound numbers mapped to spaces and agents
LiveKitReal-time audio/video infrastructure
AgentsAI-powered voice agents that handle calls

Phone numbers

Phone numbers are the entry point for telephony. Each number is associated with a space and can be routed to an agent or flow.

Provisioning

Phone numbers can be:

  • Purchased through a supported carrier integration
  • Ported from an existing provider by configuring SIP trunk routing

Routing

When a call arrives at a phone number, the platform:

  1. Identifies the associated space and channel configuration
  2. Creates or resumes a conversation
  3. Routes the call to the configured agent or flow
  4. Streams audio in real time via LiveKit

SIP trunks

SIP trunks provide the connection between the platform and telecom carriers.

Configuration

To configure a SIP trunk:

  1. Navigate to Channels → Telephony → SIP Trunks
  2. Enter the trunk credentials:
    • SIP server address
    • Authentication credentials (username/password or IP allowlist)
    • Codec preferences
  3. Map phone numbers to the trunk

Supported features

FeatureSupport
Inbound calls
Outbound calls
SMS (inbound)
SMS (outbound)
Call transfer
DTMF tones
Call recording

Voice agents

AI agents can handle voice calls through the telephony channel. When connected to a phone number, the agent:

  1. Receives the audio stream — real-time audio from the caller
  2. Transcribes speech — converts audio to text using the configured ASR provider
  3. Processes the message — runs the agent's response pipeline (context, knowledge base, LLM)
  4. Synthesizes speech — converts the response text to audio using TTS
  5. Streams the response — delivers audio back to the caller in real time

This creates a natural conversational experience over the phone, powered by the same agent configuration used in chat channels.

Call lifecycle

PhaseWhat happens
RingingIncoming call arrives; platform identifies the channel
ConnectedCall is answered; audio streams are established
ActiveConversation in progress; messages exchanged in real time
Hold / TransferCall can be placed on hold or transferred
EndedCall terminates; conversation is closed or paused

Recording and transcription

Calls can be recorded and transcribed automatically:

  • Recordings are stored as assets in the space's file storage
  • Transcriptions are attached to the conversation as messages
  • Both are subject to the space's retention and access policies