Google Launches Gemini Robotics ER 1.6: SOTA Robot Model, Excelling in Visual and Spatial Reasoning

ChainNewsAbmedia

Google DeepMind has released a brand-new robotics foundation model, Gemini Robotics ER 1.6, where “ER” stands for Embodied Reasoning (embodied reasoning). This model achieves the current best performance (SOTA) in visual and spatial reasoning, and is already available through the Gemini API. Logan Kilpatrick, the Head of Developer Relations at Google AI, announced this on social media. (Source)

What is Embodied Reasoning?

Embodied Reasoning refers to an AI model’s ability to understand and reason about the physical world. Unlike traditional language models, embodied reasoning models must process the positions, shapes, materials, and physical interaction relationships of objects in three-dimensional space. Gemini Robotics ER 1.6 is specifically optimized for these kinds of tasks, enabling robots to understand their surroundings more accurately and make appropriate action decisions.

Core capabilities

The main advantages of Gemini Robotics ER 1.6 focus on two areas:

Capability Description Visual reasoning Able to identify objects from images and videos, understand the structure of the scene, and make decisions accordingly Spatial reasoning Understand the relative positions, distances, and directions of objects in three-dimensional space, supporting complex operation planning

The combination of these two capabilities allows robots to handle more complex real-world tasks. For example, in a warehouse environment, robots need to identify objects of different shapes at the same time and calculate the best grasp angle and placement position—this is exactly the kind of scenario Gemini Robotics ER 1.6 excels at.

Using the Gemini API

Unlike many past robot models that only existed at the paper stage, Gemini Robotics ER 1.6 is already accessible via the Gemini API. This means developers and hardware vendors can integrate this model directly into their own robotic systems, without having to train the model from scratch.

Opening up the API also lowers the development barrier for robot AI. In the past, building a robot system with visual and spatial reasoning capabilities required a large amount of data collection and model training work. Now, developers can focus on developing hardware design and application scenarios, leaving the underlying reasoning capabilities to Gemini Robotics ER 1.6.

Google’s robotics AI roadmap

Gemini Robotics ER 1.6 is the latest achievement by Google DeepMind in the field of robotics. From the early RT-2 to the present Gemini Robotics series, Google has continued extending the capabilities of large language models into interactions with the physical world. The ER 1.6 version further improves reasoning accuracy on top of its predecessors, performing especially well in scenarios that require precise operations.

As the robotics industry enters a new growth cycle, foundation models with strong visual and spatial reasoning capabilities will become key infrastructure. To learn more about the development of the Gemini ecosystem, you can refer to the complete Gemini guide.

This article Google launches Gemini Robotics ER 1.6: SOTA robot model, strong in visual and spatial reasoning was first published on Chain News ABMedia.

Disclaimer: The information on this page may come from third parties and does not represent the views or opinions of Gate. The content displayed on this page is for reference only and does not constitute any financial, investment, or legal advice. Gate does not guarantee the accuracy or completeness of the information and shall not be liable for any losses arising from the use of this information. Virtual asset investments carry high risks and are subject to significant price volatility. You may lose all of your invested principal. Please fully understand the relevant risks and make prudent decisions based on your own financial situation and risk tolerance. For details, please refer to Disclaimer.

Related Articles

Marvell teams up with Google to develop an AI MPU chip, and the stock price jumps 6.3% on the news

Google is discussing collaboration with Marvell to develop dedicated memory processing units (MPU) and tensor processing units (TPU) to address memory bottlenecks. If successful, the design will be completed in 2027. The collaboration is intended to strengthen Google’s competitiveness in the custom ASIC market, and Marvell’s operating performance has been strong, which has pushed the stock price up.

ChainNewsAbmedia38m ago

Nvidia Stock Touches $199.86 as Google, Startups Challenge Its AI Chip Dominance

Nvidia's stock fell to $199.48 amid increased competition in the AI chip market, particularly with Google launching new TPUs focused on inference. AI chip startups raised $8.3 billion in 2026, signaling a robust sector, with Rebellions raising substantial funding to target U.S. customers.

GateNews50m ago

a16z latest report: Why blockchain is the missing infrastructure piece that AI agents need?

a16z crypto’s latest report says that AI agents are evolving from support tools into economic actors, yet there are still major gaps in core infrastructure such as identity, payments, and cross-platform collaboration. The report emphasizes that as AI becomes involved in governance and transactions, verification mechanisms become the key to trust, and blockchain technology can provide verifiable infrastructure to address these challenges. The future will require cryptographic mechanisms to ensure that AI agents truly represent users’ intent and to change traditional payment systems.

ChainNewsAbmedia2h ago

Moonshot AI Releases Kimi K2.6 with Enhanced Coding and Agent Capabilities

Moonshot AI has released Kimi K2.6, featuring chat and Agent modes on kimi.com. It excels in coding benchmarks, supports 4,000 tool invocations, and upgraded parallel functionality for autonomous scenarios.

GateNews4h ago

Optiver Takes Equity Stake in Crypto and AI-Focused VC Firm Eden Block

Optiver Holding BV has invested in Eden Block, a venture capital firm focusing on cryptocurrency and AI. This move aims to enhance Optiver's exposure to innovative companies in these sectors, as both technologies could transform trading and capital markets.

GateNews5h ago

Cerebras Refiles for Nasdaq IPO After Clearing National Security Review Over UAE Ties

Cerebras Systems is reviving its Nasdaq IPO plans after passing a national security review. The AI chipmaker has diversified its revenue and reported significant growth while securing major partnerships, positioning itself as a competitor to Nvidia.

GateNews5h ago
Comment
0/400
No comments