LLM Navigated Browser Automation

Diving deeper into

David Mlcoch, co-founder & CEO of Asteroid, on browser automation and the last mile problem of AI

Interview
the LLM will navigate the website itself
Analyzed 5 sources

This is the core shift from scripted automation to adaptive automation. Older tools needed a developer to prewrite every click path, so a changed button, popup, or form branch could break the workflow. Here the model is acting more like a human operator, reading the page in real time, deciding what element matters, then turning successful runs into reusable scripts that make later runs faster and cheaper.

  • The practical value shows up in messy back office work, not demos. In insurance quoting or healthcare scheduling, the agent has to move through long forms, conditional questions, and old portals with no API, where a human used to watch the screen and improvise step by step.
  • This sits between developer infrastructure and finished workflow software. Browserbase sells hosted browser infrastructure for developers, while Asteroid is packaging browser agents so non technical operations teams can supervise and run repeatable workflows at enterprise scale.
  • A close parallel is AI native testing. Momentic also replaces brittle selector based flows with systems that preserve user intent when the UI changes. The difference is that testing checks whether software works, while Asteroid is trying to complete business work inside the software.

The next step is a stack where voice agents collect information, browser agents enter it into old systems, and generated scripts compress common tasks from minutes to seconds. That turns browser use from a one off assistant feature into workflow infrastructure for industries still running on web portals built long before APIs were standard.