Foundation Models

The labs that train the models the rest of the stack runs on.

What this layer does

This is the layer of capital-burning research labs. A frontier training run costs hundreds of millions to billions of dollars in compute alone, before headcount or data. The output is a model that other layers either resell (cloud APIs), wrap (applications), or compete with (open-weight challengers).

The economics here are extreme: revenue scales fast, but compute cost scales with usage. Gross margins depend on how cheaply the lab can serve inference — which depends on the model architecture, the GPUs it’s tuned for, and any custom silicon (Anthropic on Trainium, Google on TPU).

There are no pure-play public model labs. Exposure comes through the partners and infrastructure each lab depends on.

Sub-categories

2.1

Frontier Closed Labs

The four (or five) labs training at the absolute frontier of compute scale.

2.2

Big Tech In-House Models

Hyperscaler and platform-owned models built for internal product distribution.

2.3

Open-Weight Challengers

Models released with downloadable weights — the “Linux of AI” thesis. Critical floor on closed-model API pricing.

2.4

Specialty & Enterprise Models

Smaller labs aiming at specific markets (enterprise, regulated, sovereign).

2.5

Image, Video, Audio, Music Models

Modality-specific generative model labs. Often the engines behind the creative apps in Layer 01.

2.6

Robotics & Embodied AI Models

Foundation models for the physical world — the next frontier of training scale.

2.7

Science & Domain Models

Bio, materials, weather, math — non-text models, often partnered with big-tech compute providers.

Analysis coming soon — will cover: training cost trajectories, the open vs. closed debate, post-training as the new moat, why every frontier lab is bolted to a hyperscaler, and how to get public-market exposure to lab winners.