Qwen3 235B A22B

Qwen3-235B-A22B is a 235B parameter mixture-of-experts (MoE) model developed by ...

Open Sourcepaid

About

Qwen3-235B-A22B is a 235B parameter mixture-of-experts (MoE) model developed by Qwen, activating 22B parameters per forward pass. It supports seamless switching between a "thinking" mode for complex reasoning, math, and code tasks, and a "non-thinking" mode for general conversational efficiency. The model demonstrates strong reasoning ability, multilingual support (100+ languages and dialects), advanced instruction-following, and agent tool-calling capabilities. It natively handles a 32K token c

Details

Modality	text
Context Window	131.1K tokens
Release Date	Apr 28, 2025
API Available	Yes
Hosting	self-hosted, api
Output Speed	120 tokens/sec
Time to First Token	600ms
Quality Index	82/100
Coding Index	75/100
Reasoning Index	84/100

Benchmarks

Chatbot Arena ELO

1400/1500

MMLU-Pro

84.5/100

SWE-bench Verified

52/100

MATH-500

90.5/100

GPQA Diamond

72/100

HumanEval

92.5/100

Resources

Research Paper Model Card / Docs

Qwen3 235B A22B

About

Details

Benchmarks

Resources

Tags

Quick Info

Also in AI Models

Claude Opus 4.6

Claude Sonnet 4.6

Gemini 3 Flash