F
FinToolBench
Specialized benchmark for evaluating LLM agents on real-world financial tool use and compliance
Open SourceFree
About
Academic benchmark designed to test LLM agents in finance-specific scenarios requiring tool use, compliance awareness, and handling of volatile data. Unlike general-purpose evaluations, FinToolBench addresses high-stakes decision-making with real-world financial constraints. Evaluates dynamic agentic interactions beyond static text analysis, making it essential for organizations deploying AI in regulated financial environments.
Details
| Type | |
| Integrations | |
| Language |
Tags
evaluationtool-useautonomousopen-source
Quick Info
- Organization
- Research Collaboration
- Pricing
- open-source
- Free Tier
- Yes
- Updated
- Mar 10, 2026
Also in Dev Tools
C
Crawl4AI
Open-source web crawler optimized for LLMs and AI agents — 62K+ stars
OSSFree
unclecode
63.1Ktoday72
F
Firecrawl
Web scraping API built for LLMs — turn any website into LLM-ready data — 89K+ stars
OSSfreemium
Mendable
102.1Ktoday138
H
Headroom Context Optimization
Reduce LLM API costs by 50-90% through advanced context compression
OSSFree
Shubham Saboo
104.2Ktoday74