B
Braintrust
AI evaluation and experiment tracking platform for production LLM apps
commercialfreemium
About
Braintrust is a developer-first evaluation and experiment tracking platform built for LLM applications. It lets teams define scoring functions, run evaluations against golden datasets, compare prompt and model variants side-by-side, and track quality metrics over time. The platform integrates directly into CI/CD pipelines so regressions are caught before they reach production.
Details
| Type | evaluation |
| Integrations | openai, anthropic, langchain, litellm |
| Language | python, typescript |
Tags
evaluationexperiment-trackingllm-opstestingci-cd
Quick Info
- Organization
- Braintrust
- Pricing
- $50/mo
- Free Tier
- Yes
- Updated
- Apr 1, 2026
1.1Ktoday82
Also in Dev Tools
C
Crawl4AI
Open-source web crawler optimized for LLMs and AI agents — 62K+ stars
OSSFree
unclecode
63.1Ktoday72
F
Firecrawl
Web scraping API built for LLMs — turn any website into LLM-ready data — 89K+ stars
OSSfreemium
Mendable
102.1Ktoday138
H
Headroom Context Optimization
Reduce LLM API costs by 50-90% through advanced context compression
OSSFree
Shubham Saboo
104.2Ktoday74