E
EVMbench
Benchmark for evaluating AI agents on smart contract security tasks
Open SourceFree
About
EVMbench is a research benchmark that systematically evaluates AI agents' capabilities in blockchain security. It tests agents across three critical dimensions: detecting vulnerabilities in smart contracts, patching identified security flaws, and exploiting weaknesses. Designed for researchers and developers building autonomous agents for Web3 security, it measures code comprehension, generation, and execution abilities specific to EVM-based smart contracts.
Details
| Type | |
| Integrations | |
| Language |
Tags
evaluationcoding-agentautonomousopen-sourcetool-use
Quick Info
- Organization
- Research Team (Wang et al.)
- Pricing
- open-source
- Free Tier
- Yes
- Updated
- Mar 13, 2026
Also in Dev Tools
C
Crawl4AI
Open-source web crawler optimized for LLMs and AI agents — 62K+ stars
OSSFree
unclecode
63.1Ktoday72
F
Firecrawl
Web scraping API built for LLMs — turn any website into LLM-ready data — 89K+ stars
OSSfreemium
Mendable
102.1Ktoday138
H
Headroom Context Optimization
Reduce LLM API costs by 50-90% through advanced context compression
OSSFree
Shubham Saboo
104.2Ktoday74