Back to Blog List

Qwen-AgentWorld: Language World Models for General Agents

6/25/2026
Author: Q Yang
Category: AI
Qwen-AgentWorld: Language World Models for General Agents

TL;DR

Qwen just released Qwen-AgentWorld, a language model trained to simulate agentic environments. Instead of training agents in real environments (slow, expensive, uncontrollable), train a model that can predict what the environment would return for any given action. The model is the environment.

It covers 7 domains in a single model: MCP, Search, Terminal, Android, Web Browser, OS, and SWE.

Model Sizes

  • Qwen-AgentWorld-35B-A3B — MoE, 35B total / 3B active, 256K context — open source
  • Qwen-AgentWorld-397B-A17B — not open source

Three-Stage Training Pipeline

The paper "Language World Models for General Agents" describes a three-stage process:

  1. CPT (Continued Pre-Training) — 10M+ real-world interaction trajectories across 7 domains. The model learns state transition dynamics.
  2. SFT (Supervised Fine-Tuning) — Activates next-state-prediction reasoning via long chain-of-thought.
  3. RL (Reinforcement Learning) — Hybrid rubric-and-rule reward to sharpen simulation fidelity.

The key difference from prior work: environment modeling is the core training objective from stage one, not a post-hoc add-on. Hence "native world model."

Benchmark Results

Qwen-AgentWorld benchmark results

They created AgentWorldBench, spanning 7 domains with data from 5 frontier models.

ModelOverall
Qwen-AgentWorld-397B-A17B58.71
GPT-5.458.25
Claude Opus 4.657.80
Claude Opus 4.856.59
Qwen-AgentWorld-35B-A3B56.39
Gemini 3.1 Pro54.57

The 35B-A3B version (3B active parameters) matches Claude Opus 4.8 within 1 point. Without world model training, the same architecture scores only 47.73 — LWM training provides a +8.66 boost.

Key Discovery 1: Simulated Training Beats Real Training

Using Qwen-AgentWorld as a simulated environment for RL actually outperforms training in real environments.

MetricBaseline+ Sim RL (397B env)Delta
Claw-Eval65.469.7+4.3
QwenClawBench47.955.0+7.1

It also supports controllable simulation: inject perturbations, construct fictional worlds, train under harder conditions. Agents trained in fictional search worlds still generalize to real search (F1 +16.29).

Key Discovery 2: Predicting Environments Makes Agents Stronger

LWM warm-up on single-turn, non-agentic trajectories improves downstream agent performance — even on out-of-domain tasks with zero agent-specific fine-tuning.

BenchmarkWithout LWMWith LWMDelta
Terminal-Bench 2.033.2539.55+6.30
SWE-Bench Verified64.4767.86+3.39
WideSearch F1 (OOD)33.3846.17+12.79
Claw-Eval (OOD)53.6064.88+11.28
BFCL v4 (OOD)62.2971.25+8.96

Why This Matters

AgentWorldBench overview

This is the AlphaGo approach applied to agents: self-play in a simulated world, without needing real environments. If it scales, agent training costs drop dramatically — one model replaces thousands of browser instances, API calls, and Docker containers.

The open question: how well does the simulation generalize? The gap between simulated and real environments still exists. But as a proof of concept, this is one of the most interesting agent research directions of 2026.

Resources

  1. Paper: https://arxiv.org/abs/2606.24597
  2. GitHub: https://github.com/QwenLM/Qwen-AgentWorld
  3. Model: https://huggingface.co/Qwen/Qwen-AgentWorld-35B-A3B
  4. Benchmark: https://huggingface.co/datasets/Qwen/AgentWorldBench
  5. X announcement: https://x.com/Alibaba_Qwen/status/2069720365442719867
Share this article

Leave your comment

  • No comments yet.
Ad
Ad not loaded or not displayed

Recommended AI Tools

Carefully selected AI tools to improve your work, study, and live efficiency.

Virtual Try On

AI-powered virtual try-on for clothes, hairstyles, and accessories.

SPONSORED
Circle Crop Image

Circle Crop Image is a free online tool for creating round images.

SPONSORED
OpenArt

OpenArt is a versatile AI image and video generator.

SPONSORED
 Lipsync Studio

Transform your videos with advanced lip sync technology.

61.2K
SPONSORED
SAM TTS

Experience the nostalgic Microsoft SAM voice from Windows XP in your browser.

23.2K
SPONSORED
Image to Image AI

AI-powered image transformation for professional creative workflows.

SPONSORED
Grayscale Image

Grayscale Image is a free online tool for converting color photos to black and white with professional controls.

SPONSORED

Related Articles

Kimi Linear emerges: revolutionizing the attention architecture of Transformer, boosting long text processing efficiency by 6 times.
News
10/31/2025
Kimi Linear emerges: revolutionizing the attention architecture of Transformer, boosting long text processing efficiency by 6 times.
Author: Kimi Lv

A major breakthrough has been achieved in the core architecture of large-scale models! The release of Kimi Linear marks the first time that linear attention technology has comprehensively surpassed and significantly outperformed the traditional Transformer full-attention model in both performance and efficiency. This "win-win" achievement is expected to significantly reduce the computational barriers and costs for long text processing, complex reasoning, and AI agent applications, potentially changing the competitive landscape of underlying technologies for large-scale models.

In-depth analysis of OpenAI Polaris Alpha technology: A key sequel to the GPT-5.1 leak incident
News
11/12/2025
In-depth analysis of OpenAI Polaris Alpha technology: A key sequel to the GPT-5.1 leak incident
Author: Lydia

Over the past week, the AI ​​community's attention has been drawn to a mysterious model that quietly emerged on the OpenRouter platform—Polaris Alpha. As a direct continuation of yesterday's discussion of the GPT-5.1 leak, this suddenly appearing model brings more technical details and strategic signals worthy of in-depth exploration.

Grokipedia - xAI Launches New AI Knowledge Platform to Challenge Traditional Encyclopedias with AI Revolution
AI
10/28/2025
Grokipedia - xAI Launches New AI Knowledge Platform to Challenge Traditional Encyclopedias with AI Revolution
Author: Lucas

A new paradigm in knowledge acquisition has arrived, this time powered by AI.

2025, looking at the evolution of artificial intelligence
AI
4/24/2025
2025, looking at the evolution of artificial intelligence
Author: Q Yang

Standing at this moment in 2025, when we look back at the development journey of artificial intelligence, we witness how this revolutionary technology has reshaped every aspect of human society. From initial theoretical concepts to today's practical applications, each step forward in AI technology has changed the way we live. Let's revisit this fascinating journey together.

Most Popular AI Tools

Pollo AI

Pollo AI is a versatile AI image and video generator.

LogoAi
30% offCode:aiwithme

Create a stunning logo effortlessly with LogoAi.

FLUX API - PiAPI
5% offCode:AIWITHME

FLUX API by PiAPI offers advanced image generation capabilities.

Klap
30% offCode:AIWITHME

Klap transforms long videos into engaging shorts effortlessly.

458.4K
Midjourney API by PiAPI
5% offCode:AIWITHME

Transform text into stunning images with Midjourney API.