TL;DR: Fivetran ($5.6B) has built a ~$190M a year business as the Zapier of ETL, but like Zapier, they face an existential threat as SaaS companies like Salesforce and Stripe begin to offer native integrations—in Fivetran’s case, to the cloud data warehouse. Check out our coverage of Fivetran (dataset) and read our interview with Conor McCarter, co-founder of Prequel, for more.
Key points from our research:
- With the rise of the cloud data warehouse and proliferation of SaaS, the need arose for connectors that could bring data into the cloud data warehouse from all the SaaS apps that a team uses. Many SaaS applications built their own BI tools for data analysis, but these were generally inflexible and couldn’t analyze data joined across disparate datasets, limiting teams’ ability to make sense of the data. (link)
- Fivetran ($5.6B) found initial product-market fit as the “Zapier” of ETL, saving data engineers time piping data from SaaS apps into a data warehouse by building and maintaining ~200 connectors for the most popular applications. Fivetran’s model is built on charging companies for the time-consuming work of maintaining their fat head of connectors built on top of SaaS vendors’ ever-changing APIs. (link)
- Fivetran made $95M in 2021, growing revenue by $30M after acquiring enterprise competitor HVR to move upmarket—they’re now projected to hit $190M by the end of 2022. Fivetran charges customers on a per-usage basis (how many rows of data they’re syncing) indexing them on the rise of companies syncing data from their apps into their cloud data warehouse. (link)
- Zapier grew in a capital efficient way by using their 6M+ uniques per month to incentivize companies to build their own zaps as a distribution channel—Fivetran’s model, where they build each connector themselves, comes with expensive ongoing engineering maintenance costs. Fivetran’s model indexes on quality of connectors and reducing work for companies’ data engineers due to the high criticality of the analysis jobs running through it, while Zapier can take a few more liberties on quality as a primarily prosumer, non-mission-critical tool. (link)
- Similar to how Plaid used screen scraping to build its universal API into banking data, Fivetran relies on API scraping to build its universal API into SaaS vendor data, which means Fivetran’s connectors break (requiring 3-7 days to fix) when those APIs change. Fivetran’s reliance on vendor APIs also opens up the risk, as with Plaid, that those vendors will want to reclaim the integration function for themselves and own the data that flows through their pipes. (link)
- As with Zapier, the data connector is getting commoditized, with Fivetran now competing with an increasing number of ETL companies like Matillion ($1.5B), Rivery (raised $48M), Meltano (raised $12.4M), Hevo Data (raised $43M), Nexla ($15.5M) and Airbyte ($1.5B) that traverse the spectrum of connector quantity/quality and managed/custom. Some of these companies expose API templates to companies so their engineers can manually tweak their off-the-shelf connectors, while others aim to use community-contributed connectors to cover more of the space. (link)
- Open source Airbyte ($1.5B) made around $1M last year and has become a prime competitor attacking the long-tail of SaaS-data warehouse integrations that Fivetran won’t build by shifting the responsibility of building connectors to their customer base (9,000+ strong as of 2021). The downside of Airbyte’s model is that while it already offers 200+ connectors, it relies on the companies that use their data warehouse connectors to maintain and update those connectors—with inconsistent quality and results—over time. (link)
- Today, there’s a growing trend of SaaS companies like Stripe, Salesforce and Customer.io offering native data warehouse integrations that increase retention/activation and revenue (Stripe charges $0.03 per transaction). SaaS vendors can design and support their customers’ export needs end-to-end rather than send them to a 3rd party tool like Fivetran. (link)
- These native integrations are rising first across high-volume vendors like Stripe, which cost Fivetran customers the most to sync because of Fivetran’s per-row pricing model. Customers don’t churn from all their Fivetran connectors at once, but instead begin to shift usage of high-value/high-frequency connectors first, ala ecommerce business that have to run frequent syncs on all their transactional data moving to Stripe’s native data warehouse integration. (link)
- Where Fivetran sold connectors and Airbyte sold their platform/community for building connectors, new tools like Prequel ($125K raised) sell the ability for vendors to build their own data warehouse connectors. Instead of ceding the revenue customers will pay a Fivetran/Airbyte for a warehouse integration—companies can sell their own native connector, charging anywhere from 10-30% of ACV for it. (link)
- With the continuing growth of both data warehouses and the long tail of SaaS, there's an opportunity for Fivetran and similar companies to build a platform that provides teams with visibility across all their data pipelines for observability and debugging. Today, engineers have to spend time monitoring a variety of different transformations, sources, and destinations to have a comprehensive view across all their data pipelines. (link)
For more, check out our full interview with Conor McCarter, co-founder at Prequel, and this other research from our platform:
- Fivetran company profile (dataset)
- Databricks company profile (dataset)
- Julia Schottenstein, Product Manager at dbt Labs, on the business model of open source
- Sean Lynch, co-founder of Census, on reverse ETL's role in the modern data stack
Read more from
Instabase: the $46M/year Palantir of banking
Unlocked Report
Continue Reading
Instabase revenue, growth, and valuation
Unlocked Report
Continue Reading
+1 others
Read more from
Read more from
Writer revenue, growth, and valuation
Unlocked Report
Continue Reading