Close Menu
Bpay News
  • Home
  • Topics
    • Bitcoin
    • Ethereum
    • Altcoin
    • DeFi & Stablecoins
    • Regulation & Policy
    • Security & Hacks
  • Tokens
  • On-chain Briefs
  • Spotlights
  • Tools
    • Terminal
    • FlowDesk
    • Insight
  • Search
What's Hot

ARB Token Spotlight: Funding Pressure and Positioning Check

2 days ago
BPay News is the editorial desk for this coverage. Editorial Desk·About·Editorial Policy·Corrections Policy
Institutional Investors Boost Crypto Exposure Aimed for 2026 Survey Finds

OKX says it wont go public until it can deliver returns

2 weeks ago
BPay News is the editorial desk for this coverage. Editorial Desk·About·Editorial Policy·Corrections Policy
Gauntlet Secures $380M Exit in OKX Crypto Campaign

Canada Eyes Ban on Crypto Political Donations

2 weeks ago
BPay News is the editorial desk for this coverage. Editorial Desk·About·Editorial Policy·Corrections Policy
Facebook X (Twitter) Instagram
Facebook X (Twitter) Instagram Pinterest Telegram RSS
Bpay News
  • Home
  • Topics
    • Bitcoin
    • Ethereum
    • Altcoin
    • DeFi & Stablecoins
    • Regulation & Policy
    • Security & Hacks
  • Tokens
  • On-chain Briefs
  • Spotlights
  • Tools
    • Terminal
    • FlowDesk
    • Insight
  • Search
Bpay News
Home»Security & Hacks»OpenZeppelin: EVMbench Dataset Breaches Trust in Crypto Security
OpenZeppelin: EVMbench Dataset Breaches Trust
Security & Hacks

OpenZeppelin: EVMbench Dataset Breaches Trust in Crypto Security

BPay NewsBy BPay News1 month agoUpdated:March 3, 20263 Mins Read
BPay News is the editorial desk for this coverage. Editorial Desk·About·Editorial Policy·Corrections Policy
Share
Facebook Twitter LinkedIn Pinterest Email

Blockchain security firm OpenZeppelin says it has found methodological flaws and data contamination in its audit of OpenAI’s new artificial intelligence benchmark for blockchain security, EVMbench.

EVMbench was launched in partnership with crypto investment firm Paradigm in mid-February. It was built to evaluate how well different artificial intelligence models can identify, patch, and exploit smart contract vulnerabilities.

In an X post on Monday, OpenZeppelin said it welcomed the initiative but recently decided to put EVMbench “through the same scrutiny” it applies to all the protocols it helps secure, including the likes of decentralized finance heavyweights Aave, Lido and Uniswap.

In its audit, OpenZeppelin found two key issues: training data contamination and classification issues related to several high-severity vulnerabilities.

“We reviewed the dataset and identified methodological flaws and invalid vulnerability classifications, including at least four issues labeled high severity that are not exploitable in practice,” OpenZeppelin said.​

The release of the EVMbench saw an evaluation of how well AI agents could theoretically exploit smart contract vulnerabilities. Anthropic’s Claude Open 4.6 topped the list, followed by OpenAI’s OC-GPT-5.2 and Google’s Gemini 3 Pro.

EVMbench testing may need revising

Looking at the first issue in data contamination, OpenZeppelin said the most important capability in “AI security is finding novel vulnerabilities in code the model has never seen before.”

However, during the EVMbench’s testing of AI agents, OpenZeppelin said that all the AI agents that scored the highest had “likely been exposed to the benchmark’s vulnerability reports during pretraining.”

During EVMbench testing, internet access was cut off for the AI agents, meaning they couldn’t simply search for solutions to problems. However, the benchmark was based on curated vulnerabilities from 120 audits conducted between 2024 and mid-2025, with the knowledge training cutoffs for these agents generally set to mid-2025.

As such, it ran the risk that the AI agents already had the answers to all of the problems stored in their memory.

“While this does not necessarily enable the model to identify the issue immediately, it reduces the quality of the test. The dataset’s limited size further narrows the evaluation surface, making these contamination concerns more significant,” OpenZeppelin said.

​Related: Energym AI dystopia goes viral as crypto projects tout user-owned AI agents

​Finally, OpenZeppelin said that there had been some significant factual errors in the EVMbench’s dataset, arguing that several “high-severity vulnerabilities” were invalid.

OpenZeppelin said it had assessed at least four vulnerabilities that EVMbench classified as high risk, but that don’t actually work. However, EVMbench had been scoring AI agents correctly for finding these supposedly false vulnerabilities.

“These aren’t subjective severity disagreements; they are findings where the described exploit doesn’t work.”

Ultimately, OpenZeppelin reiterated that AI will have a significant impact on bolstering blockchain security, but stressed the importance of applying the tech and testing it properly to maximize its potential.

“The question isn’t whether AI will transform smart contract security — it will. The question is whether the data and benchmarks we use to build and evaluate these tools are held to the same standard as the contracts they’re meant to protect.”

Context

Current positioning around Security & Hacks remains sensitive to primary-source updates, policy interpretation, and execution risk across major venues.

What To Watch

Focus on incident-response updates, wallet flow tracking, and whether recovery or mitigation actions are independently verified.

Follow-up coverage should prioritize confirmed technical details, affected systems, and user-protection timelines rather than speculative loss estimates.

Related Tokens

  • Uniswap (UNI)
  • Aave (AAVE)
  • NOT (NOT)
Share. Facebook Twitter Pinterest LinkedIn Tumblr Telegram Email
Previous ArticleUS Senate Proposes Ban on Federal Reserves CBDC Issuance in Crypto Regulation
Next Article Nasdaq Backs Wall Streets Prediction Market Push in Crypto Market

Related Posts

BPayNews Crypto News
Security & Hacks 3 weeks ago3 Mins Read

Stablecoin Crash Hits 70%, Attacker Siphons $25M ETH

3 weeks ago
BPay News is the editorial desk for this coverage. Editorial Desk·About·Editorial Policy·Corrections Policy
BPayNews Crypto News
Security & Hacks 3 weeks ago2 Mins Read

OpenClaw Phishing Airdrop Scam Exploits $5K Token Offers

3 weeks ago
BPay News is the editorial desk for this coverage. Editorial Desk·About·Editorial Policy·Corrections Policy
BPayNews Crypto News
Security & Hacks 3 weeks ago4 Mins Read

Capital flight? The blackout factor Within minutes of missiles striking Iranian soil

3 weeks ago
BPay News is the editorial desk for this coverage. Editorial Desk·About·Editorial Policy·Corrections Policy
Add A Comment
Leave A Reply Cancel Reply

Subscribe

There was an error trying to submit your form. Please try again.

This field is required.

There was an error trying to submit your form. Please try again.

Recent Post

  • ARB Token Spotlight: Funding Pressure and Positioning Check2 days ago
  • OKX says it wont go public until it can deliver returns2 weeks ago
  • Canada Eyes Ban on Crypto Political Donations2 weeks ago
  • Stragegys (MSTR) STRC shares rebound to par value faster than historical average2 weeks ago
  • Wall Street wants the tech but not the transparency. DRWs Don Wilson2 weeks ago
  • XRP Sharpe Ratio Rise Aligns With Sustained Whale Inflows2 weeks ago
  • Bitcoin price news: BTC slips below $69,000 as oil rebounds on fading2 weeks ago
  • Bitcoin (BTC) holds ground as precious metals slide on ETF outflows2 weeks ago
  • Lummis Says CLARITY Act Offers Strong DeFi Protections2 weeks ago
  • The NYSE wants to bring blockchain to Wall Street without breaking2 weeks ago
  • Are stablecoins the infrastructure reshaping global finance2 weeks ago
  • Citi says stablecoin rewards restrictions could slow Circles USDC, not stop it2 weeks ago
  • Bitcoin Drops Below $68K but Long-Term Holder Buying Accelerates2 weeks ago
  • U.S. midterms pack major digital assets wallop as Stand With Crypto preps2 weeks ago
  • Brazil passes law turning seized crypto into public-security war chest2 weeks ago
  • Trust Will Become Cryptos Real Currency In The AI Economy2 weeks ago
  • Coinbase, Fannie Mae bring crypto-backed mortgages to home buyers2 weeks ago
  • Treasury Plans to Add Donald Trumps Signature to US Currency2 weeks ago
  • Everyone’s calling bitcoin resilient, may be it’s just complacent2 weeks ago
  • Crypto slides as oil spike, macro jitters trigger derivatives unwind2 weeks ago
Crypto
  • Google News
  • Bitcoin News
  • Ethereum News
  • Altcoin News
  • DeFi & Stablecoins
  • Regulation & Policy
  • Exchange News

Archives

  • April 2026
  • March 2026
  • February 2026
  • January 2026
  • December 2025
  • November 2025
  • October 2025
  • September 2025
  • August 2025

Legal

  • Cookies Policy
  • Terms of Use
  • Privacy Policy
  • Editorial Policy

Bpay Product

  • Bpay News
  • Bpay Rsi
  • Bpay Price
  • Bpay Liq
  • Bpay CN
  • Sitemap
© 2026 Powered by BPAY NEWS.
  • Home
  • Terminal
  • FlowDesk
  • About BPay News
  • Privacy Policy
  • Terms of Use
  • Corrections Policy

Type above and press Enter to search. Press Esc to cancel.