T
Together AI
4
Fast and affordable inference platform for open-source AI models
commercialpaid
About
Together AI is a cloud platform specializing in fast, efficient inference and fine-tuning of open-source AI models. It offers an OpenAI-compatible API to run models like Llama 4, Mixtral, DeepSeek, and Qwen at high throughput with competitive pricing. Together AI also provides dedicated GPU clusters for custom training and research collaborations.
Details
| Type | inference-platform |
| GPU Types | NVIDIA H100, NVIDIA A100 |
| Regions | US |
| Starting Price | $0.20/1M tokens |
Tags
inference-apiopen-source-modelsfine-tuningopenai-compatiblefast-inferencegpu-clusters
Quick Info
- Organization
- Together AI
- Pricing
- $0.20/1M tokens (Llama 3.1 8B)
- Free Tier
- Yes
- Popularity
- 0/100
- MAU
- 100K+
- Updated
- Feb 19, 2026
Also in Infrastructure
L
Lambda Labs
GPU cloud and workstations purpose-built for AI training
Commercialpaid
Lambda$1.85/hr (H100/H200) / $2.99/hr (B200)
M
Modal
Serverless cloud for AI and data-intensive applications
Commercialpaid
Modal Labs$30/month free credits / $0.000356/sec (A100 40GB)
R
RunPod
Affordable GPU cloud for AI inference and training
Commercialpaid
RunPod$0.44/hr (RTX 4090) / $1.99/hr (H100 PCIe) / $3.59/hr (H200)