M
MalSkillBench
Security benchmark for detecting malicious AI agent skills with runtime verification
Open SourceFree
About
Academic benchmark for evaluating security tools that detect malicious agent skills. Provides verified ground truth test cases covering both code-based and natural language instruction-based threats. Addresses supply chain security risks from third-party agent components by testing detection capabilities against known malicious patterns in hybrid skill formats.
Details
| Type | |
| Integrations | |
| Language |
Tags
evaluationopen-sourceautonomoustool-useframework
Quick Info
- Organization
- Research Team
- Pricing
- open-source
- Free Tier
- Yes
- Updated
- Jun 8, 2026
Also in Dev Tools
C
Crawl4AI
Open-source web crawler optimized for LLMs and AI agents — 62K+ stars
OSSFree
unclecode
68.1K4d ago76
F
Firecrawl
Web scraping API built for LLMs — turn any website into LLM-ready data — 89K+ stars
OSSfreemium
Mendable
130.1Ktoday148
H
Headroom Context Optimization
Reduce LLM API costs by 50-90% through advanced context compression
OSSFree
Shubham Saboo
113.8K5d ago79