AI Chips
NVIDIA logo

$NVDA

🇺🇸

Dominates AI training and inference with its GPU and data center accelerator platforms including H100, B200, and GB200. Commands ~89% of the AI accelerator market with $130B+ annual revenue run rate and a $3T+ market cap. Aims to power the entire AI compute stack from cloud training to edge inference, autonomous vehicles, and robotics.

Company Profile

GPUs & Accelerators

The undisputed king of AI compute — designs the GPUs that train virtually every major AI model.

Key Products & Platforms

H100

Data Center GPU

Current workhorse for AI training

B200 / GB200

Data Center GPU

Next-gen Blackwell architecture

A100

Data Center GPU

Previous-gen, still widely deployed

DGX Systems

AI Supercomputer

Turnkey AI training servers

CUDA / cuDNN

Software Platform

Industry-standard AI dev ecosystem

DRIVE Platform

Automotive AI

Self-driving compute platform

Key Customers

MicrosoftGoogleAmazonMetaTeslaOracleCoreWeave

Competitive Position

Market Share

~80% of AI accelerator market

Competitive Moat

CUDA software lock-in, 15+ years of ecosystem development

Key Risk

Customer concentration in hyperscalers; custom chip competition from Google TPU, Amazon Trainium

Why This Company Matters

If you want to understand AI infrastructure, start here. NVIDIA's GPUs are the foundation of virtually every AI system — from ChatGPT to autonomous vehicles. Their dominance is unmatched in tech.

Key Milestones

Apr 1993
Milestone

Founded April 5 by Jensen Huang (ex-LSI Logic, AMD), Chris Malachowsky (Sun) and Curtis Priem (IBM, Sun) at a Denny's in San Jose with $40K; named from Latin 'invidia', initial focus on PC graphics accelerators.

May 1995
Launch

Released NV1, NVIDIA’s first chip on Sega Saturn-style quadrilateral rendering; commercial flop that nearly bankrupted the company before SGS-Thomson and Sega rescue funding.

Apr 1997
Launch

Released RIVA 128, the company’s first commercially successful 3D graphics chip; sold 1M units in four months and saved NVIDIA from the brink of insolvency.

Jan 1999
Milestone

IPO on NASDAQ January 22 at $12/share, raising $42M; small offering reflected modest revenue (~$160M) but funded the Riva and TNT2 graphics generations.

Aug 1999
Launch

Announced GeForce 256 August 31 and shipped October 11; coined the term 'GPU' (Graphics Processing Unit) by integrating hardware transform-and-lighting on a single chip with 23M transistors.

Dec 2000
Milestone

Acquired struggling rival 3dfx Interactive for $112M after the Voodoo card maker filed bankruptcy; NVIDIA absorbed engineering talent and IP that powered the GeForce3.

Feb 2001
Launch

GeForce3 launched with first programmable vertex and pixel shaders, the foundational architecture for every modern GPU; Microsoft selected the chip for the original Xbox console.

Nov 2006
Launch

Unveiled GeForce 8800 GTX (G80) with unified shaders and first CUDA-capable architecture; laid the groundwork for general-purpose GPU computing and modern AI training.

Feb 2007
Launch

Released CUDA 1.0 SDK February 15 for Windows and Linux; opened the GPU as a programmable parallel processor and seeded the scientific-computing and deep-learning communities.

Jun 2008
Launch

Launched Tesla brand for HPC GPUs (Tesla C870/C1060) priced at $1,500-$1,700; first credible move beyond gaming into the data-center compute market.

Sep 2012
Milestone

AlexNet won ImageNet using two NVIDIA GTX 580 GPUs, dropping image-classification error rates from 26% to 15%; the watershed moment validated GPUs as the substrate of modern deep learning.

Nov 2014
Launch

Tesla K80 launched, GK210 Kepler-based dual-GPU accelerator targeting deep learning workloads; the chip became the workhorse of early academic and hyperscaler ML training.

Apr 2016
Launch

Tesla P100 unveiled at GTC 2016, first Pascal data-center GPU with HBM2 memory and NVLink interconnect; targeted deep-learning training and HPC at $5,000+ per GPU.

Apr 2016
Product

DGX-1 announced at GTC 2016 as first purpose-built AI supercomputer-in-a-box; eight P100 GPUs at $129K, with first system hand-delivered by Jensen Huang to OpenAI.

May 2017
Launch

Tesla V100 with Volta architecture announced at GTC, introducing Tensor Cores for AI acceleration at 125 TFLOPS FP16; powered the GPT-2/GPT-3 training era at OpenAI and Microsoft.

Mar 2019
Milestone

Announced $6.9B acquisition of Mellanox to bolster networking for data-center AI fabrics; preempted Intel and Microsoft bids and locked in InfiniBand for next-gen GPU clusters.

Apr 2020
Milestone

Closed $7B Mellanox acquisition after Chinese antitrust clearance, gaining InfiniBand and Ethernet IP that became the spine of every NVIDIA reference AI cluster from DGX-2 onward.

May 2020
Launch

A100 launched at GTC 2020 on TSMC 7nm with 40GB HBM2e and 19.5 TFLOPS FP32; introduced multi-instance GPU (MIG) and became the dominant LLM training chip through 2022.

Sep 2020
Milestone

Announced $40B agreement to acquire Arm Holdings from SoftBank; the deal would have given NVIDIA control of the world's most-used CPU instruction set but drew immediate global regulatory scrutiny.

Feb 2022
Regulatory

Abandoned $40B Arm acquisition due to FTC and global regulatory blockades; SoftBank kept the $1.25B prepayment and prepared Arm for its 2023 IPO.

Mar 2022
Launch

Hopper H100 announced at GTC March 22 — 80B transistors on TSMC 4N, 80GB HBM3 and Transformer Engine for FP8; became the de-facto LLM training chip and pushed lead times to 11+ months by mid-2023.

Sep 2022
Product

H100 entered full production with hyperscaler partner systems shipping in early Q4; Microsoft, AWS, Google and Oracle placed initial orders for 100K+ GPUs each.

Oct 2022
Regulatory

US BIS export controls of October 7 banned A100/H100 sales to China; NVIDIA pivoted to A800/H800 China-specific SKUs with reduced interconnect bandwidth, preserving most China revenue.

Nov 2022
Milestone

OpenAI launched ChatGPT November 30, igniting a global AI compute demand surge that Jensen Huang called AI's 'iPhone moment'; data-center orders surged from <$4B/quarter to $14B+ within four quarters.

Jan 2023
Commercial

DGX H100 systems began shipping to enterprise customers, anchoring the first wave of post-ChatGPT AI buildouts; Microsoft Azure and Oracle Cloud took the first racks for OpenAI workloads.

May 2023
Milestone

Crossed $1T market cap on May 30 on a record AI revenue forecast (Q1 FY24 guide of $11B vs $7.2B consensus); joined Apple, Microsoft, Alphabet and Aramco in the trillion-dollar club.

Oct 2023
Regulatory

October 17 BIS update banned A800/H800 China-specific chips and added performance-density rule covering future SKUs; NVIDIA began designing H20/L20 for the China market under the new caps.

Nov 2023
Product

H200 announced at SC23, first GPU with HBM3e memory at 141GB and 4.8 TB/s bandwidth; targeted Q2 2024 ship date and locked in SK Hynix as lead HBM3e supplier.

Feb 2024
Milestone

Surpassed $2T market cap intraday February 23, becoming the third US firm to close above the threshold; took just nine months from $1T to $2T, the fastest such doubling in market history.

Mar 2024
Launch

Blackwell B200 and GB200 superchip unveiled at GTC March 18, 208B transistors with up to 20 PFLOPS FP4 per GPU; built on TSMC 4NP and used CoWoS-L packaging with 192GB HBM3e.

Mar 2024
Product

GB200 NVL72 announced at GTC, liquid-cooled rack of 36 Grace Blackwell superchips for trillion-parameter LLMs; 1.4 exaFLOPS FP4 compute per rack, priced at ~$3M each.

Jun 2024
Milestone

Surpassed $3T market cap June 5 and briefly became the world's most valuable company on June 18 at ~$3.34T, displacing Microsoft and Apple atop the global leaderboard.

Dec 2024
Commercial

GB200 NVL72 systems began shipping to AWS, Microsoft, Google and Oracle after multi-quarter ramp delays; CoWoS-L packaging yields finally crossed acceptable threshold in Q4.

Mar 2025
Product

Blackwell Ultra B300 unveiled at GTC March 18, 1.5x B200 performance with 288GB HBM3e and 15 PFLOPS dense FP4; targeted as bridge product before Rubin platform in 2026.

Apr 2025
Regulatory

Took $5.5B Q1 charge on H20 China inventory after April BIS rule blocking last China-tailored Hopper SKU; Jensen Huang publicly criticized the rule as an own-goal helping Huawei.

Jul 2025
Milestone

Became first $4T public company on July 9, capping a 10x rally since ChatGPT launch; market cap doubled in 13 months from the $2T close in February 2024.

Jul 2025
Regulatory

US Trump administration permitted resumed sales of H20 to China, reversing April ban after diplomatic deal; NVIDIA recovered partial inventory write-down via fresh shipments.

Oct 2025
Milestone

Crossed $5T market cap intraday October 29, the first company ever to reach that milestone; just 3.5 months after the $4T close, fastest such trillion-dollar leg in history.

Nov 2025
Commercial

Amazon EC2 P6-B300 instances with Blackwell Ultra became generally available, anchoring AWS AI fleet upgrades; instance prices started around $98/hour for 8x B300 nodes.

Nov 2025
Milestone

Began Rubin GPU production at TSMC on N3 node ahead of GTC 2026 mass announcement; SK Hynix HBM4 stacks shipped from September were paired in CoWoS-L packaging.

Jan 2026
Launch

Vera Rubin platform announced at CES, Vera CPU + dual Rubin GPU in one package with NVLink 6 and ConnectX-9; positioned as full vertically-integrated AI factory replacement for Blackwell.

Apr 2026
Commercial

Vera Rubin entered full production at GTC April 2026; locked in 336B transistors, 50 PFLOPS per GPU and 288GB HBM4; AWS, Google, Microsoft and Oracle named as first cloud takers.

Ask Sterling

Register for a premium account to gain access to Sterling AI.

Get Started

Things you can ask Sterling:

Summarize Tesla's latest earnings reportWhy did NVIDIA's margins expand?Compare Apple vs Microsoft's cash flowWhat's driving the EV sector growth?
AI Chips - NVIDIA | Sterling