Google launches Gemini 3.1 Flash TTS: Supports 70 languages and scenario directors, making AI voices more natural

ChainNewsAbmedia

Google AI developer relations lead Logan Kilpatrick announced on April 15 the release of Gemini 3.1 Flash TTS—the latest text-to-speech model from Google. This model supports 70 languages, fine-grained control at the level of scene direction and speakers, and audio tags. It is now available for use in the audio playground in Google AI Studio and in the Gemini API.

Four core features

Gemini 3.1 Flash TTS comes with four notable upgrades compared with its predecessor:

Scene Direction — You can set a context for the voice, such as “speaking softly in a noisy café” or “excitedly announcing good news,” and the model will adjust tone, speaking pace, and emotion based on the scene

Speaker-Level Specificity — In multi-role conversations, you can set different voice characteristics for each character

Audio Tags — Supports inserting sound-effect instructions into text to control details like pauses and tone changes

Support for 70 languages — Significantly expands multilingual coverage, including Chinese

More natural, more expressive voices

Google emphasized improvements in voice naturalness with this model. Traditional TTS models are often criticized for output that “sounds like AI.” Gemini 3.1 Flash TTS aims to narrow the gap with human speech through richer prosody variations and emotional expression. Kilpatrick noted that progress from Gemini 2.5 to 3.1 is “very significant.”

How developers can use it

Developers can use it in two ways:

Google AI Studio Audio Playground — Test and preview voice effects directly in the web interface

Gemini API — Integrate into applications for scenarios such as voice assistants, audiobooks, automatic Podcast generation, and multilingual customer service

Gemini product line keeps expanding

Flash TTS is part of the recent flurry of releases in the Gemini 3.1 series. Previously, Google rolled out Gemini Robotics ER 1.6 (robot vision reasoning), Tab Tab Tab (Vibe Coding prompt completion), and design preview features. Google is expanding Gemini from a “chat model” into a full-modal AI platform spanning text, speech, vision, and robotics.

This article Google releases Gemini 3.1 Flash TTS: Supports 70 languages and scene direction, for more natural AI voices first appeared on Liannews ABMedia.

Disclaimer: The information on this page may come from third parties and does not represent the views or opinions of Gate. The content displayed on this page is for reference only and does not constitute any financial, investment, or legal advice. Gate does not guarantee the accuracy or completeness of the information and shall not be liable for any losses arising from the use of this information. Virtual asset investments carry high risks and are subject to significant price volatility. You may lose all of your invested principal. Please fully understand the relevant risks and make prudent decisions based on your own financial situation and risk tolerance. For details, please refer to Disclaimer.

Related Articles

Marvell teams up with Google to develop an AI MPU chip, and the stock price jumps 6.3% on the news

Google is discussing collaboration with Marvell to develop dedicated memory processing units (MPU) and tensor processing units (TPU) to address memory bottlenecks. If successful, the design will be completed in 2027. The collaboration is intended to strengthen Google’s competitiveness in the custom ASIC market, and Marvell’s operating performance has been strong, which has pushed the stock price up.

ChainNewsAbmedia2h ago

Nvidia Stock Touches $199.86 as Google, Startups Challenge Its AI Chip Dominance

Nvidia's stock fell to $199.48 amid increased competition in the AI chip market, particularly with Google launching new TPUs focused on inference. AI chip startups raised $8.3 billion in 2026, signaling a robust sector, with Rebellions raising substantial funding to target U.S. customers.

GateNews2h ago

a16z latest report: Why blockchain is the missing infrastructure piece that AI agents need?

a16z crypto’s latest report says that AI agents are evolving from support tools into economic actors, yet there are still major gaps in core infrastructure such as identity, payments, and cross-platform collaboration. The report emphasizes that as AI becomes involved in governance and transactions, verification mechanisms become the key to trust, and blockchain technology can provide verifiable infrastructure to address these challenges. The future will require cryptographic mechanisms to ensure that AI agents truly represent users’ intent and to change traditional payment systems.

ChainNewsAbmedia4h ago

Moonshot AI Releases Kimi K2.6 with Enhanced Coding and Agent Capabilities

Moonshot AI has released Kimi K2.6, featuring chat and Agent modes on kimi.com. It excels in coding benchmarks, supports 4,000 tool invocations, and upgraded parallel functionality for autonomous scenarios.

GateNews6h ago

Optiver Takes Equity Stake in Crypto and AI-Focused VC Firm Eden Block

Optiver Holding BV has invested in Eden Block, a venture capital firm focusing on cryptocurrency and AI. This move aims to enhance Optiver's exposure to innovative companies in these sectors, as both technologies could transform trading and capital markets.

GateNews7h ago

Cerebras Refiles for Nasdaq IPO After Clearing National Security Review Over UAE Ties

Cerebras Systems is reviving its Nasdaq IPO plans after passing a national security review. The AI chipmaker has diversified its revenue and reported significant growth while securing major partnerships, positioning itself as a competitor to Nvidia.

GateNews7h ago
Comment
0/400
No comments