Gemini 3.1 Flash: Natural Audio AI Revolutionizes Real-Time Conversations

Gemini 3.1 Flash Live is Google's new, high-quality audio model designed for real-time, natural-sounding AI conversations. It reduces latency, filters background noise, and understands acoustic nuances to create more human-like interactions. This model powers Gemini Live and Search Live, expanding access to real-time multimodal AI assistance globally.

Gemini 3.1 Flash Live improves AI conversations by reducing the delay between speaking and hearing a response, making interactions more fluid. It also filters out environmental distractions like traffic or television noise, focusing on relevant speech. The model scored 90.8% on ComplexFuncBench Audio, demonstrating its ability to handle multi-step function calls with constraints.

SynthID is an imperceptible audio watermark integrated into Gemini 3.1 Flash Live's audio output to detect AI-generated content. This watermark helps prevent the spread of misinformation by allowing reliable identification of AI-generated audio. Google acknowledges the challenge of distinguishing between human and AI interaction and uses SynthID to address it.

Gemini 3.1 Flash Live is available globally in over 200 countries through Gemini Live and Search Live. Developers can access it in preview via the Gemini Live API in Google AI Studio to build voice agents. Enterprises can also use it in Gemini Enterprise for Customer Experience.

Gemini 3.1 Flash Live excels in complex audio tasks, achieving a score of 90.8% on the ComplexFuncBench Audio benchmark. It also scored 36.1% on Scale AI’s Audio MultiChallenge, which tests complex instruction following and long-horizon reasoning amid typical human interruptions and hesitations. This demonstrates its ability to handle real-world conversational scenarios effectively.

Gemini 3.1 Flash Live: Making audio AI more natural and reliable

AI Overview

Why Google's Latest Audio AI Changes Everything for Real-Time Interaction

What This Means For You

FAQFrequently Asked Questions

Related Articles

Figma Make: Master Builds with Context & Control

Ace BI Engineering: 30 AI Era Interview Questions

A Developer Cut Claude's Token Use by 75% — With Broken English

Gemma 4 Powers Agentic AI at the Edge

Beat Claude Caps: 4 Habits for Limitless AI Use

Microsoft Unleashes VibeVoice: Open-Source Frontier Voice AI

Mercor Eyes Your Past Work to Train AI

Windows 11 Deploys Widespread Haptic Feedback