Building a Defensible Machine Learning…

Viet Le

Jan 8, 2023

Revisiting the Defensibility Flywheel

Read →

11 Comments

Taha Zemmouri

Jan 10, 2023

Really interesting article and very complete. Thanks for sharing!

Let me add a remark/question about some companies that would be in between the two categories: those that offer a consumable AI service via API to be integrated into software or automation, while relying themselves on more "low-level" third-party services. However, they offer additional processing and provide an "intelligence" that justifies their use.

Concretely, this concerns for example document parsing services (contracts, invoices, resumes, etc.) that rely on Google or AWS for OCR but have trained their own NLP engines. The same goes for services based on speech recognition. Where would you categorize them?

We are working at Eden AI (www.edenai.co) to aggregate and harmonize foundation models to make them easier to use for the people who use them and to make it very easy for them to swap them according to the performance achieved for their specific data or the price evolution (which is a very important point). The borderline can however be quite thin between models that are 100% provided by one vendor and those that are partially based on someone else.

Expand full comment

Reply (1)

Viet Le

Jan 11, 2023

There is no official definition but if we use Elad's framework, I'd put your case into layer 1 or if we want to be creative, then layer 1.5. Companies rebundle the value prop of layer 1 and offer additional toolings on top if to help cater to specific needs, so it's definitely still part of infra and toolings for me

Expand full comment

Daniel Pleus

Jan 11, 2023

Great read, thank you :) I was wondering how you predict defensibility for Level 1 companies? Will this become an enclosed circle of a few powerful companies or do you see challengers gaining traction?

Expand full comment

Reply (1)

Viet Le

Jan 11, 2023

Imv, it's hard to beat the triple R: Resource, Reach, and Research - so most of the value of layer 1 will be captured by the known names: Microsoft / OpenAI, AWS / Stability AI, Facebook AI, Google AI; not sure why I have a hard time mention NVIDIA here, but let's what they come up with.

Expand full comment

Nicolas Kant

Jan 10, 2023

Please share the prompt used to generate the title image

Expand full comment

Reply (1)

Viet Le

Jan 10, 2023

Sure! It was "/imagine renaissance painting of a battle around a castle drawn by Leonardo Da Vinci --ar4:3" if I remember correctly

Expand full comment

Michael Spencer

Jan 10, 2023

So what are your conclusions then about which companies are most likely to challenge foundational model Generative A.I. darlings? Consolidation seems to occur faster here. Stability.AI already siding with AWS.

Expand full comment

Reply (1)

Viet Le

Jan 10, 2023

Hard to predict the future and I hope for a commoditization (e.g. via Open Source) of that space, but my bet is on incumbents taking most of the cake wrt foundation models, so we'll continue seeing OpenAI, Google, Facebook, Amazon, et. al dominate with Huggingface, Scale, and the likes helping distribute those + provide toolings to help deploy and improve those models

Expand full comment

The Silent Treasury

Apr 20

Hello Viet,

I hope this communique finds you in a moment of stillness. Have huge respect for your work.

We’ve just opened the first door of something we’ve been quietly crafting for years—

A work not meant for markets, but for reflection and memory.

Not designed to perform, but to endure.

It’s called The Silent Treasury.

A place where judgment is kept like firewood: dry, sacred, and meant for long winters.

Where trust, patience, and self-stewardship are treated as capital—more rare, perhaps, than liquidity itself.

This first piece speaks to a quiet truth we’ve long sat with:

Why many modern PE, VC, Hedge, Alt funds, SPAC, and rollups fracture before they truly root.

And what it means to build something meant to be left, not merely exited.

It’s not short. Or viral. But it’s built to last.

And if it speaks to something you’ve always known but rarely seen expressed,

then perhaps this work belongs in your world.

The publication link is enclosed, should you wish to open it.

https://helloin.substack.com/p/built-to-be-left?r=5i8pez

Warmly,

The Silent Treasury

A vault where wisdom echoes in stillness, and eternity breathes.

Expand full comment

ilianherzi

Jan 15, 2023

Awesome article, thanks for sharing!

Expand full comment

Johannes Hötter

Jan 13, 2023

Really cool article, Viet! :)

Contains lots of insights into technical topics that founders of AI companies have to look into (e.g. short-term cost of ML) while still having to deal with building a product people want.

Just as you mentioned, my "fear" is open-source falling behind proprietary solutions in AI because of the reasons you named. But (from my understanding), this _strict_ platform-dependency is mainly for generative AI use cases, isn't it? In discriminative use cases, LLMs like GPT-X can be used to create the proprietary data itself (at least partially). For instance, we currently like using GPT-3 to create labeled datasets and then train super simple models on top of it, e.g. using an open-source encoder (distilbert or something from HF) for the embeddings and a simple old-school logistic regression :-) Maybe there is something like partial platform dependency? (e.g. for building, but not for inference). But just my thoughts.

Let's see. Really exciting times :)

Expand full comment

Viet’s Monologues

Building a Defensible Machine Learning…