Google Launches Gemini Robotics ER 1.6: SOTA Robot Model, Excelling in Visual and Spatial Reasoning

ChainNewsAbmedia

Google DeepMind has released a brand-new robotics foundation model, Gemini Robotics ER 1.6, where “ER” stands for Embodied Reasoning (embodied reasoning). This model achieves the current best performance (SOTA) in visual and spatial reasoning, and is already available through the Gemini API. Logan Kilpatrick, the Head of Developer Relations at Google AI, announced this on social media. (Source)

What is Embodied Reasoning?

Embodied Reasoning refers to an AI model’s ability to understand and reason about the physical world. Unlike traditional language models, embodied reasoning models must process the positions, shapes, materials, and physical interaction relationships of objects in three-dimensional space. Gemini Robotics ER 1.6 is specifically optimized for these kinds of tasks, enabling robots to understand their surroundings more accurately and make appropriate action decisions.

Core capabilities

The main advantages of Gemini Robotics ER 1.6 focus on two areas:

Capability Description Visual reasoning Able to identify objects from images and videos, understand the structure of the scene, and make decisions accordingly Spatial reasoning Understand the relative positions, distances, and directions of objects in three-dimensional space, supporting complex operation planning

The combination of these two capabilities allows robots to handle more complex real-world tasks. For example, in a warehouse environment, robots need to identify objects of different shapes at the same time and calculate the best grasp angle and placement position—this is exactly the kind of scenario Gemini Robotics ER 1.6 excels at.

Using the Gemini API

Unlike many past robot models that only existed at the paper stage, Gemini Robotics ER 1.6 is already accessible via the Gemini API. This means developers and hardware vendors can integrate this model directly into their own robotic systems, without having to train the model from scratch.

Opening up the API also lowers the development barrier for robot AI. In the past, building a robot system with visual and spatial reasoning capabilities required a large amount of data collection and model training work. Now, developers can focus on developing hardware design and application scenarios, leaving the underlying reasoning capabilities to Gemini Robotics ER 1.6.

Google’s robotics AI roadmap

Gemini Robotics ER 1.6 is the latest achievement by Google DeepMind in the field of robotics. From the early RT-2 to the present Gemini Robotics series, Google has continued extending the capabilities of large language models into interactions with the physical world. The ER 1.6 version further improves reasoning accuracy on top of its predecessors, performing especially well in scenarios that require precise operations.

As the robotics industry enters a new growth cycle, foundation models with strong visual and spatial reasoning capabilities will become key infrastructure. To learn more about the development of the Gemini ecosystem, you can refer to the complete Gemini guide.

This article Google launches Gemini Robotics ER 1.6: SOTA robot model, strong in visual and spatial reasoning was first published on Chain News ABMedia.

Disclaimer: The information on this page may come from third parties and does not represent the views or opinions of Gate. The content displayed on this page is for reference only and does not constitute any financial, investment, or legal advice. Gate does not guarantee the accuracy or completeness of the information and shall not be liable for any losses arising from the use of this information. Virtual asset investments carry high risks and are subject to significant price volatility. You may lose all of your invested principal. Please fully understand the relevant risks and make prudent decisions based on your own financial situation and risk tolerance. For details, please refer to Disclaimer.

Related Articles

Anthropic Deploys Election Safeguards for Claude Ahead of 2026 Midterms

Anthropic announced Friday a set of election integrity measures designed to prevent its Claude AI chatbot from being weaponized to spread misinformation or manipulate voters ahead of the 2026 U.S. midterm elections and other major contests around the world this year. The San Francisco-based

CryptoFrontier5h ago

DeepRoute.ai Advanced Driver Assistance System breakthrough: over 300k vehicles deployed. 2026 target: 1 million City NOA fleet.

DeepRoute.ai announced that its advanced driver-assistance system has been deployed in China for a cumulative total of more than 300k vehicles. In the past year, it helped avoid more than 180k potential incidents. Its 2026 goal is for its city NOA vehicle fleet to reach 1 million vehicles, with utilization exceeding 50%, and it is seen as a key step toward large-scale commercial deployment of Robotaxis. This move shows that autonomous driving in China has entered routine usage, while also creating a divergence from the United States’ vertical integration pathway, affecting the timing of the Asia-Pacific supply chain.

ChainNewsAbmedia6h ago

DeepSeek Releases V4-Pro and V4-Flash Models at 98% Lower Cost Than OpenAI's GPT-5.5 Pro

Gate News message, April 25 — DeepSeek released preview versions of V4-Pro and V4-Flash on April 24, both open-weight models with one million token context windows. V4-Pro features 1.6 trillion total parameters but activates only 49 billion per inference pass using a Mixture-of-Experts architecture.

GateNews11h ago

Judge Dismisses Fraud Claims in Elon Musk's OpenAI Lawsuit; Case Advances to Trial with Two Remaining Allegations

Gate News message, April 24 — A federal judge has dismissed fraud claims from Elon Musk's lawsuit against OpenAI, Sam Altman, Greg Brockman, and Microsoft, clearing the way for the case to proceed to trial on two remaining allegations: breach of charitable trust and unjust enrichment. U.S.

GateNews14h ago

OpenAI CEO Sam Altman Apologizes for Failing to Report School Shooter's Banned Account to Police

Gate News message, April 25 — OpenAI Chief Executive Officer Sam Altman apologized to the Tamborine community in Canada for the company's failure to notify police about a banned account linked to Jesse Van Rootselaar, who killed eight people at a school in February before taking his own life. OpenAI

GateNews15h ago

UAE Announces Shift Toward AI Government Model in the Next Two Years

His Highness Sheikh Mohammed bin Rashid Al Maktoum stated that the goal was for 50% of government sectors to operate through autonomous agentic AI. The transition will also include the training of federal employees to “master AI” and will be overseen by Sheikh Mansour bin Zayed. Key Takeaways:

Coinpedia15h ago
Comment
0/400
No comments