π©
A Python framework for building realtime, programmable voice AI agents that can see, hear, and understand. Features flexible integrations with any STT, LLM, TTS provider, built-in job scheduling, telephony support, and semantic turn detection. Perfect for building conversational multi-modal agents running on servers.