B
Braintrust
AI evaluation and experiment tracking platform for production LLM apps
commercialfreemium
About
Braintrust is a developer-first evaluation and experiment tracking platform built for LLM applications. It lets teams define scoring functions, run evaluations against golden datasets, compare prompt and model variants side-by-side, and track quality metrics over time. The platform integrates directly into CI/CD pipelines so regressions are caught before they reach production.
Details
| Type | evaluation |
| Integrations | openai, anthropic, langchain, litellm |
| Language | python, typescript |
Tags
evaluationexperiment-trackingllm-opstestingci-cd
Quick Info
- Organization
- Braintrust
- Pricing
- $50/mo
- Free Tier
- Yes
- Updated
- May 26, 2026
1.2Ktoday87
Also in Dev Tools
C
Crawl4AI
Open-source web crawler optimized for LLMs and AI agents — 62K+ stars
OSSFree
unclecode
66.3Ktoday71
F
Firecrawl
Web scraping API built for LLMs — turn any website into LLM-ready data — 89K+ stars
OSSfreemium
Mendable
124.5Ktoday145
H
Headroom Context Optimization
Reduce LLM API costs by 50-90% through advanced context compression
OSSFree
Shubham Saboo
111.8K3d ago78