AI Can Now Find 70% of Crypto Exploits | Alpin Yukseloglu, EVMBench Creator

AI's ability to exploit smart contracts has jumped from 20% to over 70% in just six months. Alpin Yukseloglu, co-author of EVMBench, reveals that frontier AI models are now catching up to the best human security auditors. As these capabilities surge toward superhuman levels—possibly by year-end—the industry faces a critical question: will defenders or attackers harness this power first? The conversation opens with an unexpected philosophical turn: how do you maintain agency when staring into the singularity, and why has crypto remained so contrarian and invisible to the AI labs building these tools?

BanklessInvesting7 Pessoas mencionadas 5 Termos do glossário

Duração do vídeo: 1:04:29·Publicado 5 de mar. de 2026·Idioma do vídeo: English

6–7 min de leitura·12,205 palavras faladas → resumido para 1,287 palavras (9x)·

Assistir no YouTube ↗

1 —

Pontos-chave

1

Top AI models have gone from finding less than 20% of critical smart contract bugs to over 70% in six months, approaching the performance of elite human auditors.

2

Crypto's verifiability gives AI a clear training signal, meaning models will get extremely good at crypto tasks very quickly—faster than most other domains.

3

The short-term risk is real: if only black hats deploy these tools, the industry is vulnerable. But the long-term outcome is in the hands of those who act with agency now.

4

EVMBench solves the false-positive problem by using a production-grade EVM environment, ensuring that if an AI claims to find a bug, it can prove it with a working exploit.

5

Crypto has been largely ignored by AI labs due to social stigma and reputational risk, but that creates an opening for the industry to shape its own AI future.

Em resumo

AI security capabilities are accelerating toward superhuman levels, and crypto's fate hinges on whether the industry acts with enough agency to ensure defense stays ahead of offense. Long-term, perfect verifiability makes crypto the ideal substrate for AI; short-term, it's a race.

2 —

The Singularity and the Agency Response

Staring into AI's exponential future requires agency, not acceptance or denial.

Alpin opens with a frank admission: thinking about superintelligent AI in the limit can be «psychosis inducing.» The key to staying sane, he argues, is agency—the belief that you can bend the arc of the future. Both doomers and accelerationists are wrong because they imply passivity: acceptance and denial are two sides of the same coin. The current frontier is «experimentally bound,» meaning no one can theorize their way to certainty. You have to get into the trenches, run experiments, and adapt.

Peter Thiel's framework is instructive: there are times when speed matters more than cohesion. In an environment where the frontier is unknowable, moving fast and shipping within 24 hours beats sitting for two weeks to plan the perfect strategy. This applies not just to AI labs but to anyone feeling frozen by the singularity. The solution isn't faith—it's action. Alpin believes the industry's fate is not predetermined; it depends on whether people exercise agency now, while the models are still catchable.

3 —

AI Exploit Detection: From 20% to 70% in Six Months

Frontier models now catch over 70% of critical smart contract bugs.

AI Exploit Detection (6 Months Ago)

Less than 20%

Models could find only 12–13% of fund-draining critical bugs when EVMBench work began.

AI Exploit Detection (Current)

Over 70%

GPT-5.3 CodeX can now find more than 70% of critical bugs in open audit contests like Code Arena.

Total DeFi Assets at Risk

Nearly $100 billion

The approximate value of assets locked in DeFi contracts that could be at risk if AI capabilities are misused.

Expected Timeline to Superhuman AI Auditor

6 to 8 months (by end of year)

Alpin is confident that AI will surpass the best human auditors within this timeframe.

4 —

How EVMBench Solves the False Positive Problem

🎯

Detect

The model identifies potential vulnerabilities in smart contracts by analyzing code patterns and logic flaws.

🛠️

Patch

The AI proposes fixes for discovered bugs, offering code corrections that can be tested and validated.

💣

Exploit

The model runs a proof-of-concept exploit against a production-grade EVM environment, draining funds to prove the bug is real—eliminating false positives.

5 —

Why Crypto is Invisible to AI Labs

Reputational volatility and stigma have kept crypto off the AI frontier—until now.

Alpin describes crypto as «the biggest industry that has remained the most contrarian» among his peer group. AI labs have avoided crypto due to social stigma, reputational risk, and a perception that the industry is dominated by scams. The gap between the best people in crypto and the median participant is wider than in any other field, and if you only see the median, the whole industry looks like a casino. This has created an opening: there are no 30 to 50 crypto benchmarks competing inside OpenAI or Anthropic. EVMBench entered without significant competition.

Alpin also points to liability concerns and the cultural mismatch between crypto and AI research communities. Yet this invisibility has been an alpha generator for investors. As the industry pushes its weight into model labs, crypto is finally entering the Overton window—and the labs are deferring to the industry to define what matters. That deference is agency-inducing: crypto can shape its own AI future.

6 —

The Verifiability Advantage

Crypto's on-chain verification makes it the fastest-improving AI domain.

💡

The Verifiability Advantage

Alpin divides the future into two categories: verifiable and unverifiable. Verifiable domains—where models get clear training signals—improve rapidly. Crypto is the most verifiable software substrate in existence. Every transaction, every state change, every bug exploit can be proven on-chain. This is why models are getting «strikingly good» at crypto despite limited training data. As Alpin puts it: «If you took the whole universe of code and looked at which pocket was the most verifiable, you'd end up with crypto.»

7 —

What's at Risk in the Short Term

Long-tail contracts and small-cap protocols are first in the firing line.

MOST AT RISK

Long-Tail, Low-TVL Protocols

Small-cap protocols on well-understood stacks like the EVM have historically been sheltered by obscurity. If exploiting a contract nets only a few thousand dollars, human attackers ignore it. But when inference costs drop to $10–50, bots will sweep the long tail. These contracts will be exploited en masse unless they harden quickly.

RELATIVELY SAFER

Battle-Tested OG DeFi

Major protocols with high TVL and years of Lindy—Uniswap, Aave, Compound—are safer in the near term because they've been stress-tested by talented adversaries. But they're not immune. The prize is larger, so when AI crosses the superhuman threshold, even these contracts will need defensive measures.

8 —

Speed Over Cohesion: The New Paradigm

In an unknowable frontier, shipping in 24 hours beats planning for two weeks.

“There are times when speed is more important than cohesion. We're in an era of speed over cohesion.”
— Alpin Yukseloglu (paraphrasing a colleague)

9 —

The Path Forward for Crypto

Crypto must act with agency to ensure defense stays ahead of offense.

1

Push Crypto Into Model Labs Get crypto-related benchmarks and training environments adopted by frontier labs like OpenAI and Anthropic.

2

Harden Major Contracts Defensively Assume AI will reach superhuman auditing by year-end. Top protocols must adopt AI-powered defensive scanning now.

3

Expand EVMBench Across Ecosystems Extend the benchmark to Solana, protocol-layer security, and other stacks to cover the full crypto surface area.

4

Embrace Formal Verification Use AI-powered formal verification (e.g., Harmonic's foundation math model) to prove correctness and reduce exploitability.

10 —

Why Crypto is Positively Levered to AI

Verifiability, scarcity, and extra-sovereign rails align crypto with AI's trajectory.

Alpin argues that crypto is positively levered to almost every macro trend. As AI commoditizes intelligence and new goods, scarce assets like Bitcoin and Ethereum become more valuable. As geopolitical instability rises, extra-sovereign financial rails—outside any jurisdiction—gain appeal. Crypto is «the equivalent of end-to-end encryption for finance.» People in unstable regions, like Turkey where Alpin grew up, are already using crypto as a lifeboat.

From first principles, if agents want to transact at the speed of light, they need rails faster than the pre-automobile banking system. Crypto solves the double-spend problem and enables near-instant, expressive transactions. The models are learning these rails quickly because they're verifiable. As Alpin puts it, if you rederive payments from first principles, «you end up in a place very similar to where we currently landed with crypto.» Long-term, crypto is the inevitable substrate for AI-native finance.

11 —

Títulos mencionados

ETHEthereum

SOLSolana

BTCBitcoin

12 —

Pessoas

Alpin Yukseloglu

Investment and Research Partner at Paradigm, Co-author of EVMBench

guest

David (Bankless host)

Podcast Host

host

Ryan (Bankless host)

Podcast Host

host

Matt (Paradigm partner)

Partner at Paradigm

mentioned

Justin Drake

Ethereum Researcher

mentioned

Peter Teal

Investor

mentioned

Vlad Tanov

Co-founder of Harmonic, formerly Robinhood and Tutor

mentioned

Glossário

EVMBenchAn open benchmark and agent harness developed by Paradigm and OpenAI to measure AI's ability to detect, patch, and exploit smart contract vulnerabilities on the Ethereum Virtual Machine.

Agent HarnessScaffolding around a large language model that gives it tools and superpowers specialized to a task, such as deploying contracts to a test EVM or running exploits.

False PositiveIn security auditing, a reported bug that is not actually exploitable—a major problem that EVMBench solves by requiring AI to prove bugs with working exploits.

Formal VerificationA mathematical proof that software behaves according to its specification, reducing the surface area for bugs by verifying correctness at a logical level.

Fund-Draining Critical BugsVulnerabilities in smart contracts that allow an attacker to steal all funds from the contract, as opposed to minor issues like temporary freezes.

Aviso: Este é um resumo gerado por IA de um vídeo do YouTube para fins educacionais e de referência. Não constitui aconselhamento de investimento, financeiro ou jurídico. Verifique sempre as informações com as fontes originais antes de tomar decisões. O TubeReads não é afiliado ao criador do conteúdo.