Nebius acquires Tavily for web grounding
Tavily
This deal turns Tavily from a standalone API into a built in layer of Nebius’s agent stack, which matters because web grounding is one of the last missing pieces between running a model and deploying a useful agent. Token Factory already handles model hosting, fine tuning, deployment, governance, and high volume inference. Tavily adds the live web access that lets those agents pull current facts, verify claims, and return citations without customers stitching together another vendor.
-
Tavily’s product is built for this exact handoff. A single query can fan out across up to 20 websites, crawl pages in real time, rank relevance, and return condensed text chunks with citations that an LLM can immediately use. That removes a lot of glue code customers otherwise write after a basic search API returns only links.
-
The integration also changes distribution. Tavily had been selling usage based API credits to developers and enterprise teams one account at a time. Inside Nebius, it can be activated through the cloud platform and Token Factory, which lets Nebius bundle search with inference and sell one combined workflow instead of two separate tools.
-
This follows a broader pattern in AI infrastructure. Exa and Parallel both moved beyond raw search into research, extraction, and agent workflows, while OpenAI, Google, and Microsoft have been pulling search into their own platforms. Owning Tavily gives Nebius a first party answer to that bundling pressure instead of leaving a critical feature to partners.
The next step is a more opinionated Nebius agent platform, where customers start with a model endpoint and then turn on search, research, and retrieval as native features. That should push Nebius up the stack from renting GPU backed inference to powering full production agents, where the software layer is stickier and captures more spend per customer.