Market Research · April 2025 · 8 min read

The AI Stack: A Full-Market Map for 2025

Seven layers. Hundreds of companies. One map. Here's how I think about the entire AI ecosystem — from chips to chatbots — and where the action really is.

Urjit Patel
Urjit Patel
Head of Commercial Data Science · S&P Global

People ask me all the time to explain "the AI space" at a high level. Not the math — the market. Who's building what, how the pieces connect, and where things get interesting.

After years of building AI systems in finance, here's the mental model I keep coming back to. Think of AI like a stack of building blocks. Seven layers, each one depending on everything below it. Click any layer below to explore it.

The Stack

// click any layer to expand · ordered top (user-facing) → bottom (infrastructure)

L6 Agents & Applications where users actually live

The products people interact with. Copilots, autonomous agents, domain-specific AI tools. This is where distribution matters more than model quality — the best product wins, not the best model.

Microsoft CopilotCursor PerplexityHarvey GleanDevinCharacter.ai
Product thinkingDomain expertise Agent designPrompt engineering
L5 Observability, Safety & LLMOps keeping AI trustworthy in prod

The operational layer that makes AI actually reliable. Evaluation frameworks, cost tracking, guardrails, prompt versioning, red-teaming. Mostly invisible until something goes wrong. Increasingly non-negotiable in enterprise.

LangSmithWeights & Biases Arize AIBraintrust HeliconeGuardrails AIPromptfoo
Eval frameworksRed-teaming Cost optimizationSafety & alignment
L4 Orchestration & Frameworks the glue between everything

Frameworks that wire models to data, tools, and memory. This is where most AI engineers spend their day. It's also the most contested, fastest-commoditizing layer in the whole stack — what's a moat today is a library next year.

LangChainLlamaIndex LangGraphCrewAI AutoGenDSPyHaystack
RAG architectureTool use / function calling Multi-agent coordinationChain design
L3 Data & Memory where AI knowledge actually lives

Vector databases, embeddings, knowledge graphs. This is where AI systems store and retrieve context. Critical for RAG — the quality of your retrieval directly determines the quality of your output.

PineconeWeaviate Qdrantpgvector ChromaMilvusNeo4j
Embedding strategiesChunking & indexing Hybrid searchKnowledge graphs
L2 Model APIs & Serving pay-per-token access to intelligence

Hosted inference — call a model via API without touching any hardware. Cloud platforms like Azure and Bedrock add enterprise compliance on top. Inference cost is the main competitive variable here. It's racing toward zero.

OpenAI APIAnthropic API Azure OpenAIAWS Bedrock Google Vertex AITogether AIGroq
API integrationModel evaluation Fine-tuningCost & latency tradeoffs
L1 Foundation Models the intelligence substrate

The big pre-trained models everything else is built on. Quality differences still exist, but the gap is closing fast — open source is roughly 6–12 months behind frontier. This is the hardest layer to enter and the hardest to sustain a lead in.

OpenAI (GPT-4o)Anthropic (Claude) Google (Gemini)Meta (LLaMA 3) MistralDeepSeekCohere
Pre-trainingRLHF / DPO Architecture researchData curation at scale
L0 Hardware & Infrastructure the physical substrate everything runs on

GPUs, TPUs, custom silicon, and the cloud datacenters that house them. NVIDIA doesn't just win because of H100 hardware — they win because of a decade of CUDA tooling, libraries, and developer lock-in that nobody can easily replicate.

NVIDIA (H100, B200)Google TPU AWS TrainiumAMD MI300X Intel GaudiCerebras
CUDA programmingDistributed training HW-aware optimizationSystems engineering

Where is the value actually captured?

Not all layers are equal. The honest picture:

LayerValue captureMoatTrend
L0 · HardwareExtremely highStrong CUDA lock-inCustom silicon rising
L1 · Foundation ModelsHigh — for nowStrong quality + trustOpen source closing fast
L2 · APIs & ServingModerate, thin marginsMedium enterprise SLAsRace to zero on cost
L3 · Data & MemoryModerateMedium data gravityOpen source pressure
L4 · OrchestrationLow–ModerateWeak mindshare onlyCommoditizing quickly
L5 · ObservabilityModerate, growingMedium workflow depthRegulation is a tailwind
L6 · Agents & AppsHigh if domain-specificStrong distribution + dataMassive greenfield
// the_honest_take

The AI market isn't one market — it's seven different competitive games being played simultaneously. Value concentrates at the extremes: hardware (irreplaceable compute) and applications (irreplaceable distribution). The middle layers are infrastructure. Necessary, but increasingly interchangeable.

If you're building a company, you want to be at L0 or L6. If you're building a career, you want to understand all seven — because the practitioners who think in terms of the full stack are the ones who actually solve hard production problems.

👍
0 comments
LinkedIn Email
// discussion 0
Sign in to leave a comment
© 2025 Urjit Patel · urjitlabs.com
Share on LinkedIn ← back to writing