C
CapCode
Cheat-proof coding evaluation framework with performance-capped randomized tests
Open SourceFree
About
Research framework for building coding benchmarks that prevent agents from achieving perfect scores through memorization or shortcuts. Uses randomized test cases with performance caps to ensure evaluation scores reflect genuine problem-solving ability rather than dataset exploitation. Designed for rigorous assessment of code generation models and autonomous coding agents.
Details
| Type | |
| Integrations | |
| Language |
Tags
evaluationcoding-agentopen-sourceframeworkautonomous
Quick Info
- Organization
- Research Team
- Pricing
- open-source
- Free Tier
- Yes
- Updated
- Jun 8, 2026
Also in Dev Tools
C
Crawl4AI
Open-source web crawler optimized for LLMs and AI agents — 62K+ stars
OSSFree
unclecode
68.1K4d ago76
F
Firecrawl
Web scraping API built for LLMs — turn any website into LLM-ready data — 89K+ stars
OSSfreemium
Mendable
130.1Ktoday148
H
Headroom Context Optimization
Reduce LLM API costs by 50-90% through advanced context compression
OSSFree
Shubham Saboo
113.8K5d ago79