Lightweight full-duplex voice model offering the same real-time conversational capabilities as the full GPT-4o Realtime at significantly reduced cost and latency. Ideal for high-volume voice agent deployments.
Talk naturally with gpt-4o-mini-realtime
Start a conversation and speak freely. The AI will listen and respond naturally — no buttons between messages.
Demo Mode · Voice: Browser
Up to 80% cost reduction versus the full Realtime model
Faster time-to-first-token for latency-sensitive voice agents
Same API surface enables seamless model swapping
Deploy AI voice agents that handle customer inquiries with natural conversation flow and real-time responses.
Build always-on voice assistants for enterprise applications with full-duplex capabilities.
Enable voice-first healthcare consultations with HIPAA-compliant conversational AI.
Replace traditional IVR menus with natural language voice agents that understand intent.
// gpt-4o-mini-realtime — Conversational Voice Session
import { VoiceSession } from "@arkitekton/voice";
const session = await VoiceSession.create({
model: "vm-oai-002",
vendor: "openai",
config: {
fullDuplex: true,
language: "en-US",
turnDetection: "server_vad",
},
});
session.on("speech_started", () => {
console.log("Agent is speaking...");
});
session.on("transcript", (text) => {
console.log("User said:", text);
});
// Connect to audio stream
const mic = await navigator.mediaDevices.getUserMedia({ audio: true });
session.connect(mic);Foundation models for real-time voice and transcription