Feb 19, 2026

OpenAI pits AI agents against each other to detect smart contract flaws

OpenAI said it is becoming increasingly important to evaluate the performance of AI agents in “economically meaningful environments” as their adoption grows.

OpenAI has launched a new benchmark that evaluates how well different AI models detect, patch and even exploit security vulnerabilities found in crypto smart contracts.

OpenAI released the “EVMbench: Evaluating AI Agents on Smart Contract Security” paper on Wednesday, in collaboration with crypto investment firm Paradigm and crypto security firm OtterSec, to evaluate how much the AI agents could theoretically exploit from 120 smart contract vulnerabilities.

Anthropic’s Claude Opus 4.6 came out on top with an average “detect award” of $37,824, followed by OpenAI’s OC-GPT-5.2 and Google’s Gemini 3 Pro at $31,623 and $25,112, respectively.

Source: Cointelegraph →

Trending Category

$ADA $ARB $BITF $BLSH $BMNR $BONK $DBKSF $ENA $LUXFF $MATH $MTT token $NIGHT $PUMP $RAY $S $SEA $SOL $SPACE token $SPTZ $SPX

OpenAI pits AI agents against each other to detect smart contract flaws

Related News

Ethereum Foundation starts staking ETH as client diversity concerns persist

‘Bitcoin scarcity is dead’: Crypto executives push back on viral claim

Solo Bitcoin miner bags over $200K block reward using rented hashrate

Vitalik sells 17K ETH in one month after earmarking $45M for privacy

Stablecoin stagnation, tariffs a headwind for Bitcoin prices, analysts say

Trending Category

Latest News

Bitcoin Faces $3.4B Long Liquidation Risk Near $66.5K Zone

DeFi User Loses $50.4M in One Swap as MEV Bots and Protocol Failures Collide

Ethereum Futures Volume Surpasses Spot Trading Sixfold as Macro Pressures Mount

Bitcoin Whale Activity Hits Six-Year High as Retail Participation Stays Near Cyc...

HYPE Token Shows Net Daily Emission as HyperCore Buybacks Fall Short of Rewards

CLARITY Act risks handing crypto to centralized players: Gnosis exec

Is Bittensor (TAO) the Next Big Crypto Move? Investors Point to Revenue, Scarcit...

You Can Control an AI Agent's Crypto Spending With Ledger Hardware Wallets and M...

BIS Warns Stablecoins Can Depeg Even with Full Reserves: Here’s Why

Vitalik Buterin: Proof-of-Stake Is More Secure and Resilient Than Proof-of-Work

Legal

Crypto News

OpenAI pits AI agents against each other to detect smart contract flaws

Related News

Trending Category

Latest News