Brave's Growing Copyright Exposure
Brave
This risk grows with revenue, not just with traffic. Brave is no longer only crawling the web to rank pages for its own search engine, it is packaging snippets and LLM ready context as a paid input for other AI products, which makes each indexed publisher page part of a commercial data supply chain. That widens the set of publishers with both an economic reason and a legal theory to challenge how Brave collects, stores, summarizes, and resells their content.
-
The exposure is tied to scale. Brave says its index covers 40 billion pages, updates more than 100 million pages per day, serves 1.6 billion monthly queries, and now supplies most of the top 10 LLMs with real time web data. More pages indexed and more downstream AI customers means more possible rightsholders and more surface area for claims.
-
The News Corp case matters because it tested this exact boundary. The suit targeted Brave tools that let third party chatbots use content from News Corp sites, and the case was voluntarily dismissed without prejudice, so the underlying claims were not resolved on the merits and can return later in a stronger fact pattern.
-
Comparable AI search companies are already moving toward publisher deals and revenue sharing. Perplexity launched a publisher program and later expanded it with partners like Gannett, which shows where this market is heading when answer engines depend on publisher content but want to reduce litigation risk and keep distribution rights stable.
The likely next step is a split web. One layer stays open for indexing, and another gets governed by licenses, robots controls, and commercial publisher programs. As Brave pushes deeper into AI infrastructure, it will need the same thing cloud software companies need with data vendors, clear rights to the inputs that power the product.