Discussion about this post

User's avatar
Stewart's avatar

Thanks for the write-up. I'm keenly interested in watching this market develop over the next few years. If we really do get AGI, or something very close to it, I can imagine that the compute associated with inference will be so unbelievably large that this industry has a decade of incredible growth ahead of it.

Jaroslav Sýkora's avatar

You mention a caveat: "If small, customized models become the default for AI applications, this is a market unlocker for these companies, and the value prop becomes much more attractive."

I strongly believe this is the case. We already know that inference-as-a-service (IaaS) does scale the same as SaaS, economically there might not be a good reason to build big datacenters just for inference. Customers have rightful security concerns with handing over their precious data into a cloud; they will rather prefer to have inference on-site, AND customized & integrated into their software. Model distillation will help here.

7 more comments...

No posts

Ready for more?