
Agent Browser Protocol (ABP) is an open-source custom Chromium build that integrates Model Context Protocol (MCP) servers and REST APIs directly into the browser engine, enabling deterministic web automation for AI agents. ABP completely pauses JavaScript execution and virtual time between agent actions, transforming web browsing into a discrete step machine where agents only act on stable, frozen world states. This eliminates race conditions between continuous web rendering and step-by-step AI reasoning. On the Online Mind2Web benchmark, ABP achieved a 90.53% reproducible success rate, drastically outperforming traditional DOM-scraping approaches that break on dynamic content, shadow DOMs, and anti-bot measures.
Why it matters
Traditional web automation tools like Selenium and Playwright operate at the DOM level — they query HTML elements, click buttons, and read text from the rendered page. This approach is fundamentally fragile because modern web applications use dynamic rendering, shadow DOMs, single-page application routing, and aggressive anti-bot detection. Every time a website changes its markup, the automation breaks. AI agents face an additional problem: they reason in discrete steps, but the browser renders continuously. Between an agent deciding to click a button and the click executing, the page state may have changed. ABP solves both problems simultaneously by freezing the entire browser state between agent actions and exposing browser capabilities through standardized MCP tools rather than brittle DOM selectors. This enables reliable autonomous web workflows that were previously impossible.

How it works
ABP modifies the Chromium browser engine at three levels. First, it intercepts the JavaScript event loop and virtual clock, completely pausing execution between agent steps — no animations run, no timers fire, no network requests complete until the agent signals readiness. Second, it exposes browser capabilities as MCP servers, meaning any AI model that speaks MCP can navigate pages, fill forms, click elements, and extract content using standardized tool calls rather than custom scripting. Third, it provides REST APIs for tasks that need direct programmatic control, such as screenshot capture, cookie management, and network interception. The combination of frozen state and standardized interfaces transforms the inherently asynchronous, non-deterministic web into a predictable state machine that AI agents can reliably navigate step by step.
Example
Consider an AI agent that needs to compare prices across five e-commerce sites, each with different layouts, dynamic loading patterns, and anti-bot measures. With traditional Playwright automation, you would need site-specific selectors, wait-for-element logic, and CAPTCHA handling for each site — and the scripts would break whenever a site redesigns. With ABP, the agent navigates to each site using MCP tools, the browser freezes after each page load, the agent reads the stable DOM to find prices, and moves to the next site. No custom selectors, no timing issues, no race conditions. If a site uses a shadow DOM or loads prices via JavaScript, ABP handles it at the engine level because the entire page is frozen and fully rendered before the agent examines it.