A
Agents-A1
Open-source multimodal agent model with image-text reasoning on Qwen 3.5 MoE architecture
Open SourceFree
About
Agents-A1 is a multimodal agent model from InternScience built on the Qwen 3.5 Mixture-of-Experts (MoE) architecture. It processes both images and text to generate text responses, specifically optimized for agent tasks like tool use and multi-step reasoning. Includes evaluation benchmarks for measuring agent performance across various tasks, making it useful for researchers and developers building vision-enabled AI agents.
Details
| Type | coding-agent |
| Deployment | self-hosted |
| Supported Models |
Tags
autonomousopen-sourcemulti-agenttool-useevaluationpython
Quick Info
- Organization
- InternScience
- Pricing
- open-source
- Free Tier
- Yes
- Updated
- Jul 2, 2026
Also in Agents
A
AI Data Analysis Agent
Autonomous agent that analyzes datasets and generates visual insights
OSSFree
Shubham Saboo
116.3K2w ago80
A
AI Deep Research Agent
Autonomous agent that conducts comprehensive multi-source research investigations
OSSFree
Shubham Saboo
116.3K2w ago80
A
AI Journalist Agent
Autonomous agent that researches topics and writes structured news articles
OSSFree
Shubham Saboo
116.3K2w ago80