Agent Browser Protocol Achieves 90.5% on Mind2Web Benchmark With Chromium Fork

Developer Solves AI Agent Browser Automation With State Synchronization

On March 11, 2026, developer @theredsix released Agent Browser Protocol (ABP), an open-source Chromium fork designed specifically for AI agents that achieved 90.5% accuracy on the Online Mind2Web benchmark using Claude Opus 4.6. The project addresses a fundamental problem in browser automation: AI agents reasoning from stale state rather than current page conditions.

Traditional Browser Automation Fails Due to Timing Issues

The developer identified that most browser-agent failures stem not from models misunderstanding web pages, but from reasoning based on outdated information. Between capturing a screenshot and executing an action, web pages can change—modals appear, autocomplete dropdowns open, or dynamic filters cause reflow. Traditional Chrome DevTools Protocol (CDP) approaches don't synchronize state capture with action execution, leading to unreliable automation.

Agent Browser Protocol Freezes State After Each Action

ABP implements a novel approach by forking Chromium at the browser level:

After each action (click, type, scroll), ABP freezes JavaScript execution and rendering
Captures the exact resulting browser state at that frozen moment
Compiles structured event summaries including navigation, file pickers, permission prompts, alerts, and downloads
Sends synchronized screenshot plus event data back to the agent
Agent reasons from fresh, accurate state for the next action

This turns browser interaction into a "multimodal chat loop" that matches how LLMs naturally interact with tools. The developer notes: "I think modern LLMs already understand websites, they just need a better tool to interact with them."

Open Source Release Targets Agent Framework Integration

The project is available on GitHub under a BSD license and integrates with Claude Code, Codex, and other agent frameworks via a simple command: claude mcp add browser -- npx -y agent-browser-protocol --mcp. The 90.5% Mind2Web result represents a significant improvement over traditional browser automation approaches, demonstrating that architectural changes to how browsers expose state can dramatically improve AI agent reliability without requiring better models.

The release generated 105 points and 33 comments on Hacker News, with developers praising the approach of solving browser automation at the infrastructure level rather than through prompt engineering or model improvements.

Key Takeaways

Agent Browser Protocol achieved 90.5% accuracy on Online Mind2Web benchmark with Claude Opus 4.6
The Chromium fork freezes JavaScript execution after each action to eliminate stale state issues
Open-source release available on GitHub under BSD license with easy integration via npx
Architecture turns browser automation into synchronized multimodal chat loop for AI agents
Project demonstrates that better tooling, not just better models, can improve AI agent performance

Developer Solves AI Agent Browser Automation With State Synchronization

Traditional Browser Automation Fails Due to Timing Issues

Agent Browser Protocol Freezes State After Each Action

ABP implements a novel approach by forking Chromium at the browser level:

After each action (click, type, scroll), ABP freezes JavaScript execution and rendering

Captures the exact resulting browser state at that frozen moment

Compiles structured event summaries including navigation, file pickers, permission prompts, alerts, and downloads

Sends synchronized screenshot plus event data back to the agent

Agent reasons from fresh, accurate state for the next action

Open Source Release Targets Agent Framework Integration

Key Takeaways

Agent Browser Protocol achieved 90.5% accuracy on Online Mind2Web benchmark with Claude Opus 4.6

The Chromium fork freezes JavaScript execution after each action to eliminate stale state issues

Open-source release available on GitHub under BSD license with easy integration via npx

Architecture turns browser automation into synchronized multimodal chat loop for AI agents

Project demonstrates that better tooling, not just better models, can improve AI agent performance