Schemas

Last updated April 16, 2026

AMDParams

FieldTypeDescription
initial_silence_timeoutintegeroptionalMax milliseconds of silence before declaring no_speech 2500
greeting_durationintegeroptionalSpeech duration threshold (ms) above which answerer is classified as machine 1500
after_greeting_silenceintegeroptionalSilence duration (ms) after initial speech to declare human 800
total_analysis_timeintegeroptionalMax analysis window in milliseconds 5000
minimum_word_lengthintegeroptionalMinimum speech burst duration (ms) to count as a word 100
beep_timeoutintegeroptionalMax time (ms) to wait for the voicemail beep after machine detection. 0 or omitted = disabled. 0

AddLegRequest

FieldTypeDescription
leg_idstringrequiredID of the leg to add
mutebooleanoptionalIf set, apply this mute state to the leg atomically before it joins the mixer (no race where un-muted audio enters the mix). Omit to leave current state untouched (useful when moving between rooms).
deafbooleanoptionalIf set, apply this deaf state to the leg atomically before it joins the mixer. Omit to leave current state untouched.
accept_dtmfbooleanoptionalIf set, control whether this leg receives DTMF digits broadcast from other legs in the same room. Omit to leave current state untouched (default for new legs is true).

AgentMessageRequest

FieldTypeDescription
messagestringrequiredContext or instruction to inject into the running agent session

CreateLegRequest

FieldTypeDescription
typeenumrequiredLeg type
Values: sip
uristringrequiredSIP URI to dial
fromstringoptionalCaller ID — sets the user part of the SIP From header (e.g. "+15551234567", "alice")
privacystringoptionalSIP Privacy header value (e.g. "id", "none")
ring_timeoutintegeroptionalSeconds to wait for answer; 0 = no timeout 0
max_durationintegeroptionalMaximum call duration in seconds after connect. Automatically hung up when reached. 0 or omitted = no limit. 0
codecsarray[enum]optionalCodec preference order
headersobjectoptionalCustom SIP headers to include in the outbound INVITE (e.g. X-Correlation-ID)
room_idstringoptionalRoom ID to auto-add the leg to once media is ready (early_media or connected). If the room does not exist, it is automatically created.
authanyoptionalSIP digest authentication credentials. If the remote challenges with 401/407, sipgo will retry with these credentials.
webhook_urlstring(uri)optionalRoute all events for this leg exclusively to this URL instead of global webhooks.
webhook_secretstringoptionalHMAC-SHA256 signing secret for the per-leg webhook.
amdanyoptionalEnable Answering Machine Detection on outbound calls. Include the object (even empty) to enable with defaults; omit to disable.
accept_dtmfbooleanoptionalIf false, this leg will not receive DTMF digits broadcast from other legs in the same room. Defaults to true. true

DTMFRequest

FieldTypeDescription
digitsstringrequiredDTMF digits to send (0-9, *, #)

DeepgramAgentRequest

FieldTypeDescription
settingsobjectoptionalFull Deepgram agent settings object (agent.listen, agent.think, agent.speak, etc.). When omitted, sensible defaults are used (nova-3 STT, gpt-4o-mini LLM, aura-2-asteria-en TTS).
greetingstringoptionalAgent greeting message
languagestringoptionalLanguage code (e.g. "en", "es")
api_keystringoptionalAPI key override (falls back to DEEPGRAM_API_KEY env var)

ElevenLabsAgentRequest

FieldTypeDescription
agent_idstringrequiredElevenLabs agent ID
first_messagestringoptionalOverride the agent's first message
languagestringoptionalLanguage code (e.g. "en", "es")
dynamic_variablesobjectoptionalKey-value pairs passed to the agent as dynamic variables
api_keystringoptionalAPI key override (falls back to ELEVENLABS_API_KEY env var)

Error

FieldTypeDescription
instance_idstringoptionalInstance identifier
errorstringrequiredError message

ICECandidateInit

FieldTypeDescription
candidatestringrequiredICE candidate string
sdpMidstringoptionalMedia stream identification tag
sdpMLineIndexintegeroptionalIndex of the media description

Leg

FieldTypeDescription
instance_idstringoptionalInstance identifier
idstringrequiredUnique leg identifier (UUID)
typeenumrequiredLeg type
Values: sip_inbound, sip_outbound, webrtc
stateenumrequiredLeg state
Values: ringing, early_media, connected, held, hung_up
room_idstringoptionalRoom ID if the leg is in a room, empty otherwise
mutedbooleanrequiredWhether the leg is muted (cannot be heard by others)
deafbooleanrequiredWhether the leg is deaf (cannot hear others)
accept_dtmfbooleanrequiredWhether the leg receives DTMF digits broadcast from other legs in the same room. Defaults to true.
heldbooleanrequiredWhether the call is on hold (SIP legs only)
sip_headersobjectoptionalX-* headers from the inbound INVITE. Only present on sip_inbound legs.

PipecatAgentRequest

FieldTypeDescription
websocket_urlstring(uri)requiredWebSocket URL of the Pipecat bot (e.g. ws://my-bot:8765)

PlaybackRequest

FieldTypeDescription
urlstring(uri)requiredURL of the audio file (mutually exclusive with tone)
tonestringrequiredBuilt-in telephone tone name. Format: {country}_{type} or bare {type} (defaults to US). Types: ringback, busy, dial, congestion. Countries: us, gb, de, fr, au, jp, it, in, br, pl, ru. Examples: us_ringback, gb_busy, dial.
mime_typestringrequiredMIME type (e.g. audio/wav). Required when using url.
repeatintegerrequiredNumber of times to repeat playback (url only) 0
volumeintegerrequiredVolume adjustment in dB (-8 to 8) 0

RecordingRequest

FieldTypeDescription
storageenumrequired"file" (default) — local disk, "s3" — upload to S3 after recording stops
Values: file, s3
multi_channelbooleanrequiredWhen true, record each participant to a separate mono WAV file in addition to the full mix. Only applies to room recordings. false
s3_bucketstringrequiredS3 bucket name. Overrides S3_BUCKET env var. Required if env var is not set.
s3_regionstringrequiredAWS region. Overrides S3_REGION env var. Default us-east-1.
s3_endpointstringrequiredCustom S3 endpoint (MinIO, etc.). Overrides S3_ENDPOINT env var.
s3_prefixstringrequiredKey prefix (e.g. recordings/). Overrides S3_PREFIX env var.
s3_access_keystringrequiredAWS access key ID. Overrides default credential chain.
s3_secret_keystringrequiredAWS secret access key. Must be set together with s3_access_key.

Room

FieldTypeDescription
instance_idstringoptionalInstance identifier
idstringrequiredRoom identifier
participantsarray[object]requiredLegs currently in this room

RoomCreateRequest

FieldTypeDescription
idstringrequiredCustom room ID (auto-generated UUID if omitted)
webhook_urlstring(uri)optionalRoute all events for this room exclusively to this URL instead of global webhooks.
webhook_secretstringoptionalHMAC-SHA256 signing secret for the per-room webhook.

SIPAuth

FieldTypeDescription
usernamestringrequiredSIP auth username
passwordstringrequiredSIP auth password

STTRequest

FieldTypeDescription
languagestringrequiredLanguage code (e.g. "en", "es")
partialbooleanrequiredEmit partial (non-final) transcripts false
providerenumoptionalSTT provider: "elevenlabs" (default) or "deepgram"
Values: elevenlabs, deepgram
api_keystringoptionalAPI key override (falls back to ELEVENLABS_API_KEY or DEEPGRAM_API_KEY env var depending on provider)

StatusResponse

FieldTypeDescription
instance_idstringoptionalInstance identifier
statusstringrequired

TTSRequest

FieldTypeDescription
textstringrequiredText to synthesize
voicestringrequiredProvider-specific voice identifier. ElevenLabs: voice name or ID. AWS Polly: voice ID (e.g. Joanna, Matthew). Google Cloud: voice name — either full format (e.g. en-US-Neural2-F) or short name for Gemini models (e.g. Achernar, Kore). Deepgram: model name (e.g. aura-2-asteria-en).
model_idstringrequiredProvider-specific model/engine. ElevenLabs: model ID. AWS Polly: engine (standard, neural, long-form, generative; default neural). Google Cloud: model name (e.g. gemini-2.5-pro-tts, chirp3-hd).
languagestringoptionalLanguage code (e.g. "en-US", "pl-pl"). Required for Google Gemini TTS voices that use short names (e.g. Achernar). Auto-extracted from full voice names like en-US-Neural2-F.
promptstringoptionalStyle/tone instruction for promptable voice models (Google Gemini TTS only). E.g. "Read aloud in a warm, welcoming tone."
volumeintegerrequiredVolume adjustment in dB (-8 to 8) 0
providerenumoptionalTTS provider: "elevenlabs" (default), "aws", "google", or "deepgram"
Values: elevenlabs, aws, google, deepgram
api_keystringoptionalElevenLabs: API key override (falls back to ELEVENLABS_API_KEY env var). AWS: optional ACCESS_KEY:SECRET_KEY override (falls back to default AWS credential chain). Google Cloud: optional API key override (falls back to Application Default Credentials). Deepgram: API key override (falls back to DEEPGRAM_API_KEY env var).

TransferRequest

FieldTypeDescription
targetstringrequiredSIP URI to transfer the call to (e.g. "sip:bob@example.com").
replaces_leg_idstringoptionalID of an existing connected SIP leg whose dialog should be replaced (attended transfer). Omit for blind transfer.

VAPIAgentRequest

FieldTypeDescription
assistant_idstringrequiredVAPI assistant ID
first_messagestringoptionalOverride the agent's first message
variable_valuesobjectoptionalKey-value pairs passed as VAPI variable values (assistantOverrides.variableValues)
api_keystringoptionalAPI key override (falls back to VAPI_API_KEY env var)

VolumeRequest

FieldTypeDescription
volumeintegerrequiredVolume adjustment (-8 to 8, ~3dB per step, 0 = unchanged)

WebRTCOfferRequest

FieldTypeDescription
sdpstringrequiredSDP offer from the browser

WebhookEvent

Event envelope delivered via HTTP POST to registered webhook URLs. Event-specific fields are flattened into the top-level object (no "data" wrapper). Includes X-Signature-256 header when a secret is configured.

FieldTypeDescription
typeenumrequired
Values: leg.ringing, leg.early_media, leg.connected, leg.disconnected, leg.joined_room, leg.left_room, leg.muted, leg.unmuted, leg.hold, leg.unhold, dtmf.received, speaking.started, speaking.stopped, playback.started, playback.finished, playback.error, tts.started, tts.finished, tts.error, recording.started, recording.finished, recording.paused, recording.resumed, leg.transfer_initiated, leg.transfer_requested, leg.transfer_progress, leg.transfer_completed, leg.transfer_failed, room.created, room.deleted, stt.text, agent.connected, agent.disconnected, agent.user_transcript, agent.agent_response
timestampstring(date-time)required
instance_idstringoptionalInstance identifier