Close Menu
Bpay News
  • Home
  • Topics
    • Bitcoin
    • Ethereum
    • Altcoin
    • DeFi & Stablecoins
    • Regulation & Policy
    • Security & Hacks
  • Tokens
  • On-chain Briefs
  • Spotlights
  • Tools
    • Terminal
    • FlowDesk
    • Insight
  • Search
What's Hot
Bitcoin Drops Below $67K Amidst Market Turbulence

Bitcoin Drops Below $67K Amidst Market Turbulence

2 hours ago
OKX Launches New Toolkit for AI Agents on ChainOS

OKX Launches New Toolkit for AI Agents on ChainOS in Crypto Exchange

2 hours ago
CORZ Sells $175M BTC for AI Shift

CORZ Sells $175M BTC for AI Shift in Bitcoin

3 hours ago
Facebook X (Twitter) Instagram
Facebook X (Twitter) Instagram Pinterest Telegram RSS
Bpay News
  • Home
  • Topics
    • Bitcoin
    • Ethereum
    • Altcoin
    • DeFi & Stablecoins
    • Regulation & Policy
    • Security & Hacks
  • Tokens
  • On-chain Briefs
  • Spotlights
  • Tools
    • Terminal
    • FlowDesk
    • Insight
  • Search
Bpay News
Sponsored Partners
BingXBingX Partner OfferJoin BingX with our partner referral and unlock lower trading fees.BingX 45% fee discountJoin BingXHTXHTX Partner OfferCreate your HTX account with referral perks and reduced fees.HTX 30% fee discountJoin HTXOKXOKX Partner OfferStart on OKX using the partner link and trade with lower fees.OKX 30% fee discountJoin OKXGate.ioGate.io Partner OfferAccess Gate.io campaigns and referral fee discounts in one click.Gate.io 30% fee discountJoin Gate.ioBitunixBitunix Partner OfferRegister with Bitunix VIP code and claim discounted fee access.Bitunix 40% fee discountJoin Bitunix
Home»Security & Hacks»OpenZeppelin: EVMbench Dataset Breaches Trust in Crypto Security
OpenZeppelin: EVMbench Dataset Breaches Trust
Security & Hacks

OpenZeppelin: EVMbench Dataset Breaches Trust in Crypto Security

Bpay NewsBy Bpay News9 hours agoUpdated:March 3, 20263 Mins Read
Share
Facebook Twitter LinkedIn Pinterest Email

Blockchain security firm OpenZeppelin says it has found methodological flaws and data contamination in its audit of OpenAI’s new artificial intelligence benchmark for blockchain security, EVMbench.

Aixovia Sponsored Banner

EVMbench was launched in partnership with crypto investment firm Paradigm in mid-February. It was built to evaluate how well different artificial intelligence models can identify, patch, and exploit smart contract vulnerabilities.

In an X post on Monday, OpenZeppelin said it welcomed the initiative but recently decided to put EVMbench “through the same scrutiny” it applies to all the protocols it helps secure, including the likes of decentralized finance heavyweights Aave, Lido and Uniswap.

In its audit, OpenZeppelin found two key issues: training data contamination and classification issues related to several high-severity vulnerabilities.

“We reviewed the dataset and identified methodological flaws and invalid vulnerability classifications, including at least four issues labeled high severity that are not exploitable in practice,” OpenZeppelin said.​

The release of the EVMbench saw an evaluation of how well AI agents could theoretically exploit smart contract vulnerabilities. Anthropic’s Claude Open 4.6 topped the list, followed by OpenAI’s OC-GPT-5.2 and Google’s Gemini 3 Pro.

EVMbench testing may need revising

Looking at the first issue in data contamination, OpenZeppelin said the most important capability in “AI security is finding novel vulnerabilities in code the model has never seen before.”

However, during the EVMbench’s testing of AI agents, OpenZeppelin said that all the AI agents that scored the highest had “likely been exposed to the benchmark’s vulnerability reports during pretraining.”

During EVMbench testing, internet access was cut off for the AI agents, meaning they couldn’t simply search for solutions to problems. However, the benchmark was based on curated vulnerabilities from 120 audits conducted between 2024 and mid-2025, with the knowledge training cutoffs for these agents generally set to mid-2025.

As such, it ran the risk that the AI agents already had the answers to all of the problems stored in their memory.

“While this does not necessarily enable the model to identify the issue immediately, it reduces the quality of the test. The dataset’s limited size further narrows the evaluation surface, making these contamination concerns more significant,” OpenZeppelin said.

​Related: Energym AI dystopia goes viral as crypto projects tout user-owned AI agents

​Finally, OpenZeppelin said that there had been some significant factual errors in the EVMbench’s dataset, arguing that several “high-severity vulnerabilities” were invalid.

OpenZeppelin said it had assessed at least four vulnerabilities that EVMbench classified as high risk, but that don’t actually work. However, EVMbench had been scoring AI agents correctly for finding these supposedly false vulnerabilities.

“These aren’t subjective severity disagreements; they are findings where the described exploit doesn’t work.”

Ultimately, OpenZeppelin reiterated that AI will have a significant impact on bolstering blockchain security, but stressed the importance of applying the tech and testing it properly to maximize its potential.

“The question isn’t whether AI will transform smart contract security — it will. The question is whether the data and benchmarks we use to build and evaluate these tools are held to the same standard as the contracts they’re meant to protect.”

Context

Current positioning around Security & Hacks remains sensitive to primary-source updates, policy interpretation, and execution risk across major venues.

What To Watch

Focus on incident-response updates, wallet flow tracking, and whether recovery or mitigation actions are independently verified.

Follow-up coverage should prioritize confirmed technical details, affected systems, and user-protection timelines rather than speculative loss estimates.

Related: More from Security & Hacks | US Reclaim Millions Stolen in Romance Scam in Crypto Security | Which Cryptocurrency Survives AI Attacks? (4 Responses) in Crypto Security

Related Tokens

  • Uniswap (UNI)
  • NOT (NOT)
  • Aave (AAVE)
Share. Facebook Twitter Pinterest LinkedIn Tumblr Telegram Email
Previous ArticleUS Senate Proposes Ban on Federal Reserves CBDC Issuance in Crypto Regulation
Next Article Australias Digital Finance Potential: $24B Estimate in Crypto Regulation

Related Posts

US Reclaim Millions Stolen in Romance Scam
Security & Hacks 15 hours ago2 Mins Read

US Reclaim Millions Stolen in Romance Scam in Crypto Security

15 hours ago
Which Cryptocurrency Survives AI Attacks? (4 Responses)
Security & Hacks 23 hours ago4 Mins Read

Which Cryptocurrency Survives AI Attacks? (4 Responses) in Crypto Security

23 hours ago
White Hat Hacker Restores $1.85M from Foom Cash Theft
Security & Hacks 24 hours ago3 Mins Read

White Hat Hacker Restores $1.85M from Foom Cash Theft in Crypto Security

24 hours ago
Add A Comment
Leave A Reply Cancel Reply

Subscribe

There was an error trying to submit your form. Please try again.

This field is required.

There was an error trying to submit your form. Please try again.

Recent Post

  • Bitcoin Drops Below $67K Amidst Market Turbulence2 hours ago
  • OKX Launches New Toolkit for AI Agents on ChainOS in Crypto Exchange2 hours ago
  • CORZ Sells $175M BTC for AI Shift in Bitcoin3 hours ago
  • BTC News: 95% Bitcoin Mined, Remaining Takes Century+3 hours ago
  • OKB Token Suffers Despite OKXs New AI Dev Toolkit in Crypto Exchange3 hours ago
  • Senate Housing Bill Prohibits Central Bank Digital Currencies in Crypto Regulation4 hours ago
  • PayPay Eyes $1.1B IPO as Partial Binance Japan Owner in Crypto Exchange4 hours ago
  • Core Scientific Posts Q4 Missings in Bitcoin4 hours ago
  • Australias Digital Finance Potential: $24B Estimate in Crypto Regulation4 hours ago
  • OpenZeppelin: EVMbench Dataset Breaches Trust in Crypto Security9 hours ago
  • US Senate Proposes Ban on Federal Reserves CBDC Issuance in Crypto Regulation9 hours ago
  • Judge Upholds Uniswap in Cryptocurrency Scam Case in Crypto Regulation10 hours ago
  • Vitalik Buterin Announces ETH Block Builder Centralization Solution in Ethereum10 hours ago
  • Bitcoin’s latest governance clash escalated this week as the first block signaling10 hours ago
  • HYPE Surges Through Bear Market in Crypto Market11 hours ago
  • Crypto Industry Pressured to End Stablecoin Rewards12 hours ago
  • Nasdaq Joins Cboe in Binary Option Prediction Market in Crypto Market12 hours ago
  • BTC Targets $69K Amid Stock Rally, Ignoring Iran Strikes in Bitcoin12 hours ago
  • Iran Cryptocurrency Exports Surge 700% Post-US in Crypto Market13 hours ago
  • BMNR Slides as $6B Staked ETH Dips in Ethereum13 hours ago
Crypto
  • Google News
  • Bitcoin News
  • Ethereum News
  • Altcoin News
  • DeFi & Stablecoins
  • Regulation & Policy
  • Exchange News

Archives

  • March 2026
  • February 2026
  • January 2026
  • December 2025
  • November 2025
  • October 2025
  • September 2025
  • August 2025

Legal

  • Cookies Policy
  • Terms of Use
  • Privacy Policy
  • Editorial Policy

Bpay Product

  • Bpay News
  • Bpay Rsi
  • Bpay Price
  • Bpay Liq
  • Bpay CN
  • Sitemap
© 2026 Powered by BPAY NEWS.
  • Home
  • Terminal
  • FlowDesk
  • About
  • Privacy Policy
  • Terms of Use

Type above and press Enter to search. Press Esc to cancel.