Google AI developer relations lead Logan Kilpatrick announced on April 15 the release of Gemini 3.1 Flash TTS—the latest text-to-speech model from Google. This model supports 70 languages, fine-grained control at the level of scene direction and speakers, and audio tags. It is now available for use in the audio playground in Google AI Studio and in the Gemini API.
Four core features
Gemini 3.1 Flash TTS comes with four notable upgrades compared with its predecessor:
Scene Direction — You can set a context for the voice, such as “speaking softly in a noisy café” or “excitedly announcing good news,” and the model will adjust tone, speaking pace, and emotion based on the scene
Speaker-Level Specificity — In multi-role conversations, you can set different voice characteristics for each character
Audio Tags — Supports inserting sound-effect instructions into text to control details like pauses and tone changes
Support for 70 languages — Significantly expands multilingual coverage, including Chinese
More natural, more expressive voices
Google emphasized improvements in voice naturalness with this model. Traditional TTS models are often criticized for output that “sounds like AI.” Gemini 3.1 Flash TTS aims to narrow the gap with human speech through richer prosody variations and emotional expression. Kilpatrick noted that progress from Gemini 2.5 to 3.1 is “very significant.”
How developers can use it
Developers can use it in two ways:
Google AI Studio Audio Playground — Test and preview voice effects directly in the web interface
Gemini API — Integrate into applications for scenarios such as voice assistants, audiobooks, automatic Podcast generation, and multilingual customer service
Gemini product line keeps expanding
Flash TTS is part of the recent flurry of releases in the Gemini 3.1 series. Previously, Google rolled out Gemini Robotics ER 1.6 (robot vision reasoning), Tab Tab Tab (Vibe Coding prompt completion), and design preview features. Google is expanding Gemini from a “chat model” into a full-modal AI platform spanning text, speech, vision, and robotics.
This article Google releases Gemini 3.1 Flash TTS: Supports 70 languages and scene direction, for more natural AI voices first appeared on Liannews ABMedia.
Disclaimer: The information on this page may come from third parties and does not represent the views or opinions of Gate. The content displayed on this page is for reference only and does not constitute any financial, investment, or legal advice. Gate does not guarantee the accuracy or completeness of the information and shall not be liable for any losses arising from the use of this information. Virtual asset investments carry high risks and are subject to significant price volatility. You may lose all of your invested principal. Please fully understand the relevant risks and make prudent decisions based on your own financial situation and risk tolerance. For details, please refer to
Disclaimer.
Related Articles
Marvell teams up with Google to develop an AI MPU chip, and the stock price jumps 6.3% on the news
Google is discussing collaboration with Marvell to develop dedicated memory processing units (MPU) and tensor processing units (TPU) to address memory bottlenecks. If successful, the design will be completed in 2027. The collaboration is intended to strengthen Google’s competitiveness in the custom ASIC market, and Marvell’s operating performance has been strong, which has pushed the stock price up.
ChainNewsAbmedia2h ago
Nvidia Stock Touches $199.86 as Google, Startups Challenge Its AI Chip Dominance
Nvidia's stock fell to $199.48 amid increased competition in the AI chip market, particularly with Google launching new TPUs focused on inference. AI chip startups raised $8.3 billion in 2026, signaling a robust sector, with Rebellions raising substantial funding to target U.S. customers.
GateNews2h ago
a16z latest report: Why blockchain is the missing infrastructure piece that AI agents need?
a16z crypto’s latest report says that AI agents are evolving from support tools into economic actors, yet there are still major gaps in core infrastructure such as identity, payments, and cross-platform collaboration. The report emphasizes that as AI becomes involved in governance and transactions, verification mechanisms become the key to trust, and blockchain technology can provide verifiable infrastructure to address these challenges. The future will require cryptographic mechanisms to ensure that AI agents truly represent users’ intent and to change traditional payment systems.
ChainNewsAbmedia4h ago
Moonshot AI Releases Kimi K2.6 with Enhanced Coding and Agent Capabilities
Moonshot AI has released Kimi K2.6, featuring chat and Agent modes on kimi.com. It excels in coding benchmarks, supports 4,000 tool invocations, and upgraded parallel functionality for autonomous scenarios.
GateNews6h ago
Optiver Takes Equity Stake in Crypto and AI-Focused VC Firm Eden Block
Optiver Holding BV has invested in Eden Block, a venture capital firm focusing on cryptocurrency and AI. This move aims to enhance Optiver's exposure to innovative companies in these sectors, as both technologies could transform trading and capital markets.
GateNews7h ago
Cerebras Refiles for Nasdaq IPO After Clearing National Security Review Over UAE Ties
Cerebras Systems is reviving its Nasdaq IPO plans after passing a national security review. The AI chipmaker has diversified its revenue and reported significant growth while securing major partnerships, positioning itself as a competitor to Nvidia.
GateNews7h ago