R
Replicate
2
Run and deploy open-source AI models with a cloud API
commercialpaid
About
Replicate is a cloud platform that makes it easy to run open-source machine learning models via a simple API. It hosts thousands of models including Stable Diffusion, Llama, and Whisper, allowing developers to run inference without managing GPU infrastructure. Users can also deploy custom models using Cog, Replicate's open-source model packaging tool.
Details
| Type | model-hosting |
| Models Available | Llama 4,Stable Diffusion XL,Flux,Whisper,LLaVA,5000+ community models |
| Key Features | Serverless Inference API, Custom Model Deployment, Streaming, Webhooks, Fine-tuning, Cog Packaging |
| SDKs | Python, Node.js, Swift, Elixir, REST API |
Tags
model-hostinginference-apiopen-source-modelsserverlessgpudeployment
Quick Info
- Organization
- Replicate
- Pricing
- Pay-per-second GPU pricing (from $0.000225/sec)
- Free Tier
- Yes
- Popularity
- 6/100
- Stars
- 9.2K
- MAU
- 500K+
- Updated
- Feb 19, 2026
Also in Platforms
H
Hugging Face
The open-source AI community and model hub
OSSfreemium
Hugging Face157.0K$9/month (Pro) / Pay-per-use Inference Endpoints
A
Anthropic
AI safety company building reliable and steerable AI systems
Commercialfreemium
Anthropic$20/month (Claude Pro) / Pay-per-token API
O
OpenAI
Leading AI research lab and provider of GPT models
Commercialfreemium
OpenAI$20/month (ChatGPT Plus) / Pay-per-token API