DeepYard
R

Replicate

2

Run and deploy open-source AI models with a cloud API

commercialpaid

About

Replicate is a cloud platform that makes it easy to run open-source machine learning models via a simple API. It hosts thousands of models including Stable Diffusion, Llama, and Whisper, allowing developers to run inference without managing GPU infrastructure. Users can also deploy custom models using Cog, Replicate's open-source model packaging tool.

Details

Typemodel-hosting
Models AvailableLlama 4,Stable Diffusion XL,Flux,Whisper,LLaVA,5000+ community models
Key FeaturesServerless Inference API, Custom Model Deployment, Streaming, Webhooks, Fine-tuning, Cog Packaging
SDKsPython, Node.js, Swift, Elixir, REST API

Tags

model-hostinginference-apiopen-source-modelsserverlessgpudeployment