Anybody Can AI
Posts
Google Launches Chrome DevTools MCP in Public Preview

Google Launches Chrome DevTools MCP in Public Preview

PLUS: Meta FAIR Unveils Code World Model (CWM)

Jenil Soni
September 27, 2025

Introducing Chrome DevTools MCP: Giving AI “Eyes” to Inspect, Trace & Fix Web Pages in Real Time

Google has rolled out the public preview of Chrome DevTools MCP (Model Context Protocol) — a new bridge that allows AI coding agents to directly interact with a live Chrome browser using full DevTools capabilities. Until now, AI assistants could generate code, but had no way to observe how that code behaves at runtime. With MCP, these agents can now inspect network activity, debug console logs, trace performance, simulate user actions, and validate fixes — all in a sandboxed environment.

Key Points:

Live inspection & debugging - AI agents can connect via MCP to see and manipulate the browser’s DOM, CSS, console logs, network requests, and JavaScript execution — effectively giving them “eyes” inside Chrome.
Performance tracing & audit tools - Agents can issue performance trace commands (e.g. performance_start_trace) to record metrics like Core Web Vitals (LCP, FID, etc.), analyze bottlenecks, and recommend optimizations.
Simulated user flows & automated validation - MCP supports navigation, clicks, form filling, waiting, and more — enabling AI agents to reproduce bugs, test flows, and verify that their fixes actually work under real browser conditions.

Conclusion
The introduction of Chrome DevTools MCP marks a meaningful shift in how AI-assisted web development can operate: no longer is an AI simply a code generator — it becomes an active participant in debugging, testing, and performance optimization. By closing the loop between suggestion and verification using actual browser data, MCP promises more reliable, context-aware fixes and faster development cycles.

Moving Beyond Token Prediction: Meta’s CWM Learns Code by Simulating Execution

Meta FAIR has released CWM (Code World Model), a 32-billion-parameter open-weights language model specifically engineered for code generation research with world modeling. Unlike traditional code models that treat code as static text, CWM is trained on observation–action traces and agentic interactions, allowing it to reason about how code changes system state

Key Points:

World-modeling via execution traces and agentic trajectories - CWM’s “mid-training” is done on execution traces (Python interpreter states) and multi-step interactions (in containerized environments), so the model can simulate how code affects state, not just predict text.
Long context & architectural innovations - The model supports up to ~131,000 token contexts, using an alternating local/global attention scheme (8k local windows, 131k global slides) to scale to large codebases and multi-file reasoning.
Competitive benchmark performance & open research use - CWM achieves strong results: ~65.8% pass@1 on SWE-bench Verified (with test-time scaling), ~68.6% on LiveCodeBench, 96.6% on Math-500, among others.

Conclusion

The release of CWM signifies a shift from “predictive code completion” toward a deeper ambition: understanding what code does. By embedding the dynamics of execution into training, CWM promises more robust reasoning, debugging, and repair capabilities in future code agents. While still research-oriented and not intended for production use, it gives the community a powerful open platform to explore agentic coding, long-horizon reasoning, and integrated world models in software engineering.

Cursor Integrates GPT-5-Codex — A Fine-Tuned Agentic Coding Model

Cursor has announced support for GPT-5-Codex, the new variant of GPT-5 tailored for software engineering agents. With this integration, developers using Cursor can now access a model fine-tuned for multi-step coding tasks, code review, refactoring, and autonomous bug fixing — within their preferred editor environment.

Key Points:

Specialized model for agentic coding - GPT-5-Codex is a version of GPT-5 further optimized for software engineering tasks — building full projects, carrying out refactors, issuing code reviews, debugging, and working independently on complex coding flows.
Dynamic reasoning & task adaptation - The model adjusts its internal “thinking time” depending on task complexity: simple requests are handled quickly, while more complex ones can evolve over hours of automated work and iteration.
Cursor support and early feedback
Cursor has officially enabled GPT-5-Codex. Some users report integration quirks such as model list refresh issues or pausing in long-running tasks.

Conclusion

The inclusion of GPT-5-Codex within Cursor marks a significant step toward bringing truly agentic coding capabilities into everyday developer workflows. By combining model specialization, dynamic reasoning, and tight tool integration, this move could shift the balance from “AI-assisted coding” to “AI-driven engineering partner.” That said, early user feedback suggests room for smoothing out UX and execution consistency as the integration matures.

Thankyou for reading.