Skip to content
Glossary

Concept

Voice AI

AI systems that handle spoken language — STT + LLM + TTS or unified models.

Voice AI combines speech-to-text (STT), an LLM for reasoning, and text-to-speech (TTS). New unified models (GPT-4o Realtime, Gemini Live) collapse these into one model for lower latency.

Niyra uses Deepgram for STT (with Sarvam fallback for Indian English/Hindi), Claude for reasoning, ElevenLabs for TTS — plus OpenAI Realtime for streaming.

Related

32+

Integrations

OAuth-first

5

Channels

Web, WhatsApp, Telegram, Discord, voice

100+

Native tools

Memory, voice, browser, automation

18

Skills

JIT-loaded per turn

WhatsAppOutbound callsInbox triageCalendar tetrisLong-term memoryAutomation

Keep going

For AI:.md.txt