Close Menu
Bpay News
  • Home
  • Topics
    • Bitcoin
    • Ethereum
    • Altcoin
    • DeFi & Stablecoins
    • Regulation & Policy
    • Security & Hacks
  • Tokens
  • On-chain Briefs
  • Spotlights
  • Tools
    • Terminal
    • FlowDesk
    • Insight
  • Search
What's Hot
Canada Launches First Tokenized Bond in BoC Pilot

Canada Launches First Tokenized Bond in BoC Pilot in Stablecoin

30 minutes ago
Bitcoin's Resurgence: The $13B Options Magnet

Bitcoins Resurgence: The $13B Options Magnet in Bitcoin

4 hours ago
KuCoin Airdrops $1M for New Futures Holders

KuCoin Airdrops $1M for New Futures Holders in Crypto Exchange

8 hours ago
Facebook X (Twitter) Instagram
Facebook X (Twitter) Instagram Pinterest Telegram RSS
Bpay News
  • Home
  • Topics
    • Bitcoin
    • Ethereum
    • Altcoin
    • DeFi & Stablecoins
    • Regulation & Policy
    • Security & Hacks
  • Tokens
  • On-chain Briefs
  • Spotlights
  • Tools
    • Terminal
    • FlowDesk
    • Insight
  • Search
Bpay News
Sponsored Partners
BingXBingX Partner OfferJoin BingX with our partner referral and unlock lower trading fees.BingX 45% fee discountJoin BingXHTXHTX Partner OfferCreate your HTX account with referral perks and reduced fees.HTX 30% fee discountJoin HTXOKXOKX Partner OfferStart on OKX using the partner link and trade with lower fees.OKX 30% fee discountJoin OKXGate.ioGate.io Partner OfferAccess Gate.io campaigns and referral fee discounts in one click.Gate.io 30% fee discountJoin Gate.ioBitunixBitunix Partner OfferRegister with Bitunix VIP code and claim discounted fee access.Bitunix 40% fee discountJoin Bitunix
Home»Security & Hacks»OpenZeppelin: EVMbench Dataset Breaches Trust in Crypto Security
OpenZeppelin: EVMbench Dataset Breaches Trust
Security & Hacks

OpenZeppelin: EVMbench Dataset Breaches Trust in Crypto Security

Bpay NewsBy Bpay News5 days agoUpdated:March 3, 20263 Mins Read
Share
Facebook Twitter LinkedIn Pinterest Email

Blockchain security firm OpenZeppelin says it has found methodological flaws and data contamination in its audit of OpenAI’s new artificial intelligence benchmark for blockchain security, EVMbench.

Aixovia Sponsored Banner

EVMbench was launched in partnership with crypto investment firm Paradigm in mid-February. It was built to evaluate how well different artificial intelligence models can identify, patch, and exploit smart contract vulnerabilities.

In an X post on Monday, OpenZeppelin said it welcomed the initiative but recently decided to put EVMbench “through the same scrutiny” it applies to all the protocols it helps secure, including the likes of decentralized finance heavyweights Aave, Lido and Uniswap.

In its audit, OpenZeppelin found two key issues: training data contamination and classification issues related to several high-severity vulnerabilities.

“We reviewed the dataset and identified methodological flaws and invalid vulnerability classifications, including at least four issues labeled high severity that are not exploitable in practice,” OpenZeppelin said.​

The release of the EVMbench saw an evaluation of how well AI agents could theoretically exploit smart contract vulnerabilities. Anthropic’s Claude Open 4.6 topped the list, followed by OpenAI’s OC-GPT-5.2 and Google’s Gemini 3 Pro.

EVMbench testing may need revising

Looking at the first issue in data contamination, OpenZeppelin said the most important capability in “AI security is finding novel vulnerabilities in code the model has never seen before.”

However, during the EVMbench’s testing of AI agents, OpenZeppelin said that all the AI agents that scored the highest had “likely been exposed to the benchmark’s vulnerability reports during pretraining.”

During EVMbench testing, internet access was cut off for the AI agents, meaning they couldn’t simply search for solutions to problems. However, the benchmark was based on curated vulnerabilities from 120 audits conducted between 2024 and mid-2025, with the knowledge training cutoffs for these agents generally set to mid-2025.

As such, it ran the risk that the AI agents already had the answers to all of the problems stored in their memory.

“While this does not necessarily enable the model to identify the issue immediately, it reduces the quality of the test. The dataset’s limited size further narrows the evaluation surface, making these contamination concerns more significant,” OpenZeppelin said.

​Related: Energym AI dystopia goes viral as crypto projects tout user-owned AI agents

​Finally, OpenZeppelin said that there had been some significant factual errors in the EVMbench’s dataset, arguing that several “high-severity vulnerabilities” were invalid.

OpenZeppelin said it had assessed at least four vulnerabilities that EVMbench classified as high risk, but that don’t actually work. However, EVMbench had been scoring AI agents correctly for finding these supposedly false vulnerabilities.

“These aren’t subjective severity disagreements; they are findings where the described exploit doesn’t work.”

Ultimately, OpenZeppelin reiterated that AI will have a significant impact on bolstering blockchain security, but stressed the importance of applying the tech and testing it properly to maximize its potential.

“The question isn’t whether AI will transform smart contract security — it will. The question is whether the data and benchmarks we use to build and evaluate these tools are held to the same standard as the contracts they’re meant to protect.”

Context

Current positioning around Security & Hacks remains sensitive to primary-source updates, policy interpretation, and execution risk across major venues.

What To Watch

Focus on incident-response updates, wallet flow tracking, and whether recovery or mitigation actions are independently verified.

Follow-up coverage should prioritize confirmed technical details, affected systems, and user-protection timelines rather than speculative loss estimates.

Related Tokens

  • Uniswap (UNI)
  • NOT (NOT)
  • Aave (AAVE)
Share. Facebook Twitter Pinterest LinkedIn Tumblr Telegram Email
Previous ArticleUS Senate Proposes Ban on Federal Reserves CBDC Issuance in Crypto Regulation
Next Article Nasdaq Backs Wall Streets Prediction Market Push in Crypto Market

Related Posts

Crypto Pros Face Risks in ClickFix Scam Spread
Security & Hacks 5 days ago3 Mins Read

Crypto Pros Face Risks in ClickFix Scam Spread in Crypto Security

5 days ago
US Reclaim Millions Stolen in Romance Scam
Security & Hacks 5 days ago2 Mins Read

US Reclaim Millions Stolen in Romance Scam in Crypto Security

5 days ago
Which Cryptocurrency Survives AI Attacks? (4 Responses)
Security & Hacks 5 days ago4 Mins Read

Which Cryptocurrency Survives AI Attacks? (4 Responses) in Crypto Security

5 days ago
Add A Comment
Leave A Reply Cancel Reply

Subscribe

There was an error trying to submit your form. Please try again.

This field is required.

There was an error trying to submit your form. Please try again.

Recent Post

  • Canada Launches First Tokenized Bond in BoC Pilot in Stablecoin30 minutes ago
  • Bitcoins Resurgence: The $13B Options Magnet in Bitcoin4 hours ago
  • KuCoin Airdrops $1M for New Futures Holders in Crypto Exchange8 hours ago
  • BTC Loses $110B This Week Amidst Iran Developments in Bitcoin12 hours ago
  • Kraken Fed Access, MARA Bitcoin, NYSE Token Push16 hours ago
  • ADA Price Stagnates Near $0.27 After SPAR Integration in Crypto Market1 day ago
  • Vancouver Mayor Blocks BTC Reserves Proposal in Bitcoin1 day ago
  • Ethereum Surges to $2.2K: Traders Watch for Trend Shift1 day ago
  • OKB Soars After ICE Invests in OKX in Crypto Exchange2 days ago
  • Kraken Builds Own Bank to Access Federal Reserve Successfully in Crypto Exchange2 days ago
  • Bitcoin Surges Over $72K Amid ETF Inflows Market Update2 days ago
  • Sky Tokens Surge Amid Governance Vote Changes Supply Dynamics in Crypto Market2 days ago
  • Bitcoin Surges to $71,800 Amidst Middle East Tensions3 days ago
  • US Bitcoin ETFs Surge with BTC Above $73K Market Update3 days ago
  • Ethereum Price at $2,500 Amid Scaling Calls3 days ago
  • Kraken First Crypto Firm to Gain Fed Master Account Access in Crypto3 days ago
  • Tradewybe Pledges $31M to Crossover Markets Crypto Platform in Crypto Market3 days ago
  • Solana Sale Launches for Bank Stake Pools in Altcoin3 days ago
  • Byreal Launches AI Copy Farming Skillset for Solana DEX Agents in Altcoin3 days ago
  • Dogecoin Bounces Back After Iran War Hit in Altcoin3 days ago
Crypto
  • Google News
  • Bitcoin News
  • Ethereum News
  • Altcoin News
  • DeFi & Stablecoins
  • Regulation & Policy
  • Exchange News

Archives

  • March 2026
  • February 2026
  • January 2026
  • December 2025
  • November 2025
  • October 2025
  • September 2025
  • August 2025

Legal

  • Cookies Policy
  • Terms of Use
  • Privacy Policy
  • Editorial Policy

Bpay Product

  • Bpay News
  • Bpay Rsi
  • Bpay Price
  • Bpay Liq
  • Bpay CN
  • Sitemap
© 2026 Powered by BPAY NEWS.
  • Home
  • Terminal
  • FlowDesk
  • About
  • Privacy Policy
  • Terms of Use

Type above and press Enter to search. Press Esc to cancel.