S
SWE-Explore
Benchmark for evaluating coding agents' repository exploration and code understanding abilities
Open SourceFree
About
Research benchmark that measures fine-grained capabilities of coding agents in repository exploration. Unlike traditional benchmarks, it evaluates specific skills including repository understanding, context retrieval, code localization, and bug diagnosis rather than binary pass/fail metrics. Designed to help researchers and developers assess how well AI agents navigate and comprehend codebases.
Details
| Type | |
| Integrations | |
| Language |
Tags
evaluationcoding-agentopen-sourceframework
Quick Info
- Organization
- Research Team
- Pricing
- open-source
- Free Tier
- Yes
- Updated
- Jun 8, 2026
Also in Dev Tools
C
Crawl4AI
Open-source web crawler optimized for LLMs and AI agents — 62K+ stars
OSSFree
unclecode
68.1K4d ago76
F
Firecrawl
Web scraping API built for LLMs — turn any website into LLM-ready data — 89K+ stars
OSSfreemium
Mendable
130.1Ktoday148
H
Headroom Context Optimization
Reduce LLM API costs by 50-90% through advanced context compression
OSSFree
Shubham Saboo
113.8K5d ago79