OpenAI unveils EVMbench to test AI for smart contract security

OpenAI has launched EVMbench, a new framework developed with Paradigm, to evaluate whether artificial intelligence can effectively secure smart contracts on blockchains like Ethereum. The tool assesses AI's ability to identify, exploit, and fix vulnerabilities in these self-executing codes. This initiative aims to set standards for AI in blockchain security amid growing stakes in decentralized finance.

OpenAI is expanding its involvement in cryptocurrency security through the introduction of EVMbench, a benchmark designed to gauge the effectiveness of modern AI systems in handling smart contract vulnerabilities. Smart contracts, which are self-executing programs on blockchains such as Ethereum, power decentralized exchanges, lending protocols, and various onchain financial services. Once deployed, these contracts are generally immutable, making any security flaws potentially severe.

Collaborating with crypto investment firm Paradigm, OpenAI created EVMbench using real-world vulnerabilities identified in past audits and security competitions. The framework evaluates AI performance in three key areas: detecting security bugs, exploiting them in a controlled setting, and repairing the code without disrupting functionality.

The benchmark seeks to provide a standardized method for assessing AI's role in blockchain security, particularly as decentralized finance manages billions in user funds. OpenAI emphasized the importance of this work in a blog post, stating: “Smart contracts routinely secure $100B+ in open-source crypto assets. As AI agents improve at reading, writing, and executing code, it becomes increasingly important to measure their capabilities in economically meaningful environments, and to encourage the use of AI systems defensively to audit and strengthen deployed contracts.”

This launch highlights ongoing efforts to integrate AI defensively into crypto ecosystems, where the value at risk continues to grow.

Liittyvät artikkelit

Illustration of MetaMask's new AI agent wallet with security features for DeFi trades.
AI:n luoma kuva

MetaMask launches self-custodial AI agent wallet

Raportoinut AI AI:n luoma kuva

MetaMask has introduced a new wallet designed for AI agents to conduct decentralized finance trades while maintaining user oversight. The launch was announced Monday by the Consensys-owned provider. It includes built-in security measures such as transaction simulations and spending controls.

The Shanghai Futures Exchange is designing derivative contracts on AI tokens, while CME Group and Intercontinental Exchange announce futures for GPU rentals.

Raportoinut AI

Nearly 1,000 developers gathered in Miami Beach for the EasyA Hackathon at Consensus Miami 2026, building AI-native projects focused on autonomous payments, drones, and consumer apps. The event highlighted a shift toward practical AI and blockchain applications.

The Linux Foundation has launched a new initiative using Anthropic's Claude Mythos preview for defensive cybersecurity in open source software. Partners include AWS, Apple, Broadcom, Cisco, CrowdStrike, Google, JPMorgan, Microsoft, NVIDIA, and Palo Alto Networks. The effort aims to secure critical software amid the rise of AI for open source maintainers.

Raportoinut AI

Anthropic's latest AI model Claude Mythos has leaked despite being deemed too dangerous for public release. Financial institutions now face advanced AI-powered attacks capable of exploiting unknown vulnerabilities.

Tämä verkkosivusto käyttää evästeitä

Käytämme evästeitä analyysiä varten parantaaksemme sivustoamme. Lue tietosuojakäytäntömme tietosuojakäytäntö lisätietoja varten.
Hylkää