Unified Data-Centric ML Platforms
Diving deeper into
Oscar Beijbom, co-founder and CTO of Nyckel, on the opportunites in the AI/ML tooling market
It's all the same thing and it's very hard to build an effective product around just one of them.
Analyzed 3 sources
Reviewing context
The winning ML product is shifting from a toolbox for specialists to a single workflow around a customer’s own data. In practice, labeling, model search, deployment, and monitoring feed each other. The same examples used to teach a model also show failure cases, guide what data to collect next, and determine whether a model is safe to ship. That makes any product built around only one slice of the stack feel incomplete.
-
Nyckel’s view is that the real interface should be inputs and outputs, not model knobs. Customers upload examples, review predictions on their own data, add labels in the UI, and get a deployed model quickly. That only works if training, evaluation, and deployment are tightly connected behind the scenes.
-
Scale’s move from labeling into Nucleus, Spellbook, InstantML, and Launch shows why point solutions get pulled together. Once better base models reduce the amount of labeled data needed, standalone annotation becomes less valuable, while the company that owns the full loop can capture more of the workflow and budget.
-
DataRobot is another example of the same consolidation logic. Its product spans model development, deployment, and monitoring for enterprise teams, and it reached an estimated $225M in revenue in 2023. The market has rewarded vendors that bundle the workflow rather than sell a single technical feature.
From here, ML tooling keeps moving toward a Twilio like abstraction for AI, where product teams bring examples and get production systems back. The durable platforms will be the ones that make data review, model selection, deployment, and feedback feel like one continuous product, not a chain of separate tools.