Unprofitable Long Tail in Aggregation
Diving deeper into
Tony Xiao, founder and CEO of Venice, on the opportunities in financial data aggregation
It's really expensive to build that and it's not very profitable because you have a long tail of institutions
Analyzed 3 sources
Reviewing context
The core economic weakness in bank aggregation is that the last 10,000 connections cost far more to maintain than they earn. Big aggregators keep their own pipelines for the highest traffic banks, then buy or outsource the rest, which is why coverage overlaps heavily and why fintechs still miss many edge case institutions even after paying for multiple vendors.
-
The product work is not just one connector. Each institution can require its own login flow, scraper, retry logic, and data cleanup. For a fintech that needs full financial visibility, unsupported institutions force manual CSV imports or custom integrations, which adds engineering cost fast.
-
Plaid won early on with easier pricing and better developer tooling, not because it had uniquely broad coverage. In practice, Plaid, Yodlee, Finicity, and MX often share the same long tail connectors, so adding a second aggregator improves reliability at the margin, not coverage in proportion.
-
This is why a multiplexer layer exists at all. Companies like Venice sit before the linking flow, route each institution to the provider that actually supports it, and let fintechs avoid rebuilding auth, credential storage, and normalization every time they need one more edge case connection.
As open banking APIs replace screen scraping at major institutions, raw access will keep becoming more standardized and less differentiated. The value will move toward routing, fallback coverage, and enrichment, meaning the winners will be the companies that turn messy, fragmented account data into something dependable and usable across many institution types.