OpenAI partners with Paradigm to launch "EVMbench," testing whether AI can become the ultimate protector of smart contracts

ETH-5,06%

At the rapid intersection of artificial intelligence and blockchain technology, OpenAI led by Sam Altman has partnered with crypto investment giant Paradigm to officially launch EVMbench. This new benchmarking tool aims to rigorously evaluate whether AI agents can effectively detect, patch, and even simulate high-risk vulnerabilities in Ethereum smart contracts, safeguarding digital assets worth hundreds of billions of dollars.
(Background: Are cryptocurrencies never designed for humans? Dragonfly partner: The true users are AI agents.)
(Additional context: Sam Altman personally recruited! OpenClaw founder joins OpenAI, with personal AI agents “soon becoming core products.”)

Table of Contents

  • The Growing Security Challenges of Smart Contracts
  • How Does EVMbench Work?
  • Why Is OpenAI Investing in Blockchain Security?
  • Future Outlook: Can AI Become the Gatekeeper of Blockchain?

As AI technology advances rapidly, OpenAI recently announced a collaboration with crypto investment firm Paradigm to launch the new benchmark tool “EVMbench.” This tool is specifically designed to assess the performance of AI agents in the field of blockchain smart contract security. OpenAI states that this move aims to establish clearer AI evaluation standards for blockchain security while addressing the increasing need to protect assets in the decentralized finance (DeFi) sector.

The Growing Security Challenges of Smart Contracts

Smart contracts are self-executing code deployed on Ethereum Virtual Machine (EVM)-compatible blockchains and have become the core infrastructure supporting decentralized exchanges, lending platforms, and stablecoin payments. Currently, the total value of open-source crypto assets protected by these contracts often exceeds $100 billion. Since these contracts are usually immutable once on-chain, any vulnerabilities can lead to massive fund losses. Several high-profile attacks have occurred over the past years. Therefore, effective auditing and strengthening of smart contract security have become one of the most urgent issues in the blockchain industry.

How Does EVMbench Work?

EVMbench is based on real-world cases, collecting 120 severe vulnerabilities from 40 audit projects, most of which come from public code audit competitions like Code4rena, and additionally incorporating Paradigm-supported vulnerabilities related to Tempo blockchain payments. The test covers three core capabilities:

  • Detection: AI agents review smart contract code to identify known vulnerabilities, scoring based on severity and audit rewards.
  • Patch: AI must modify vulnerable contracts to eliminate exploitable risks while retaining original functionality, verifying effectiveness through automated testing and attack simulations.
  • Exploit: In a sandbox blockchain environment, AI agents execute full-scale fund theft attacks, with the system programmatically verifying attack success.

Through these three aspects, EVMbench provides a percentage-based overall performance score, allowing researchers and developers to clearly compare different AI models’ capabilities in smart contract security tasks.

Why Is OpenAI Investing in Blockchain Security?

OpenAI emphasizes in its official blog that as AI agents’ abilities to read, write, and execute code continue to improve, their role in highly valuable environments will become increasingly critical for defense. EVMbench is not only a test of AI limits but also aims to encourage the industry to apply AI to proactive auditing and reinforcement of deployed contracts, thereby reducing overall risk.

OpenAI also notes that this benchmark aligns closely with the “Preparedness Framework” describing high-risk network scenarios, demonstrating its comprehensive approach to AI security governance.

Future Outlook: Can AI Become the Gatekeeper of Blockchain?

The launch of EVMbench marks AI technology’s transition from general applications to highly specialized blockchain security. As DeFi and stablecoin payments continue to grow, reliable AI performance in detecting and patching vulnerabilities could significantly enhance the entire ecosystem’s security. However, the benchmark also reminds us that AI’s ability to exploit vulnerabilities must be strictly regulated to prevent malicious use. As AI models advance, EVMbench may become an important indicator of whether AI is capable of safeguarding digital assets.

View Original
Disclaimer: The information on this page may come from third parties and does not represent the views or opinions of Gate. The content displayed on this page is for reference only and does not constitute any financial, investment, or legal advice. Gate does not guarantee the accuracy or completeness of the information and shall not be liable for any losses arising from the use of this information. Virtual asset investments carry high risks and are subject to significant price volatility. You may lose all of your invested principal. Please fully understand the relevant risks and make prudent decisions based on your own financial situation and risk tolerance. For details, please refer to Disclaimer.

Related Articles

F2Pool Co-founder Wang Chun: ETH rebounded from $1,386 to $4,956 within 4 months, and investors should not be swayed by short-term panic emotions.

F2Pool Co-founder Wang Chun pointed out that the cryptocurrency market is cyclical and warned investors not to panic over short-term fluctuations. He emphasized that Bitcoin's mining mechanism is crucial, with miners playing a central role in network governance and security, and mentioned the key role miners have played in past controversies.

GateNewsBot1h ago

"Strategy Opponent Position" closes BTC and ETH short positions for profit, and reverses to build a $12 million BTC long position

According to BlockBeats, the whale address "Strategy Opponent" recently closed large-scale BTC and ETH short positions. After securing substantial profits, it established a 40x leveraged BTC long position, with a position size of $11.97 million, earning $2.85 million in the past 7 days. This address is considered a market participant opposite to MicroStrategy.

GateNewsBot2h ago

Gate Ventures: Increased volatility in mainstream assets, continuous development of industry infrastructure

Recently, market risk aversion has increased, the US dollar is strong, long-term government bond yields are rising, and gold has hit new highs. Cryptocurrency assets have declined, with significant net outflows from BTC and ETH, and market sentiment is extremely fearful. Traditional derivatives institutions and mining companies are adjusting their strategies, and market funds are becoming more cautious.

GateNewsBot2h ago
Comment
0/400
No comments
Trade Crypto Anywhere Anytime
qrCode
Scan to download Gate App
Community
  • 简体中文
  • English
  • Tiếng Việt
  • 繁體中文
  • Español
  • Русский
  • Français (Afrique)
  • Português (Portugal)
  • Bahasa Indonesia
  • 日本語
  • بالعربية
  • Українська
  • Português (Brasil)