Back

AI GPU Pricing: What H100, A100, B200 and DGX Systems Cost

June 17, 2026

7 min read

Amnic

AI and LLM costs

No headings found on page

A single NVIDIA H100 costs between $27,000 and $40,000 to buy, a B200 runs $30,000 to $50,000, and a fully built GB200 NVL72 rack can reach $3 million or more. AI GPU pricing spans three orders of magnitude depending on the chip, the form factor, and whether you buy a card or a complete system. For any team budgeting AI infrastructure, that capital number is only the start, which is why hardware spend belongs inside a FinOps practice from day one.

This guide breaks AI GPU pricing into four parts:

Per-chip prices for the GPUs everyone trains on
Full system and DGX prices once networking and CPUs are added
Price per performance, so the sticker number is not read in isolation
Buy versus rent, the decision that actually moves the budget

Per-Chip AI GPU Prices

Purchase prices for the data-center cards, per GPU:

GPU	Memory	Price per GPU (new)
NVIDIA H100	80GB	$27,000 to $40,000
NVIDIA A100	40GB	$10,000 to $12,000
NVIDIA A100	80GB	$15,000 to $17,000
NVIDIA H200	141GB	$25,000 to $35,000
NVIDIA B200 (Blackwell)	192GB	$30,000 to $50,000
NVIDIA GB200 superchip	2 GPUs + Grace CPU	up to $70,000
AMD MI300X	192GB	$10,000 to $15,000

NVIDIA card prices, including an 8-GPU H200 configuration quoted near $315,000, come from a current price guide. The GB200 superchip figure is from reported Blackwell pricing, and the H200 and MI300X per-GPU prices and memory specs from a spec comparison.

A few buying notes:

Memory tier drives the price: The jump from A100 40GB to 80GB adds roughly $5,000 for the same chip.
Secondhand cards are increasingly available: Used H100 and A100 supply has grown as teams upgrade to newer Blackwell parts, pulling resale prices below new.
The H100 vs A100 decision is the most common one: If a model fits in A100 memory, the H100 premium is wasted, a trade-off covered in H100 vs A100.

Full System and DGX Prices

A bare GPU is not a usable machine. Once CPUs, networking, storage, and chassis are added, the real number climbs:

System	Configuration	Approx price
DGX H100	8x H100	~$290,000
DGX B200	8x B200	~$515,000
GB200 NVL72 rack	72x B200 + 36x Grace	~$3,000,000

System prices come from a current server pricing guide.

What inflates a system above the raw card cost:

High-speed interconnect (NVLink, InfiniBand) for multi-GPU training
Grace CPUs and host memory on superchip configurations
Storage and networking sized to keep the GPUs fed
Power and cooling for racks that can draw over 100kW

The point that matters for budgeting: choosing the card is about more than its specs, it is about cost per finished run, which is the core of picking the right GPU for training.

Price Per Performance

The sticker price means little without throughput. Normalized to compute:

GPU	Cost per FP16 TFLOP (list)	Strength
H100	$16 to $20	Broad training and inference
B200	$8 to $12	Best cost per result, compute-bound training
MI300X	Competitive, higher memory	Memory-bound inference

Price-per-performance figures come from a price-performance analysis.

How to use these numbers:

A higher sticker price can be cheaper per result. A B200 costs more than an H100 but can finish the same work for less.
Match the chip to the bottleneck. Memory-bound inference favors high-HBM parts like the MI300X; compute-bound training favors Blackwell.
Generational jumps reset the math. Newer silicon usually wins on cost per TFLOP even at a higher headline price.

Buy vs Rent

The biggest budget decision is whether to own the hardware at all:

Buy when utilization is high and steady. Owning an H100 pays off only if it runs most of the time. At low utilization, a $30,000 card sits idle while depreciating.
Rent when demand is spiky or experimental. Cloud H100 capacity rents from roughly $2 to $8 per GPU per hour, with spot rates near $1.43 (rental rates), so short jobs almost never justify a purchase.
Blend for most teams. Own a baseline fleet for steady inference, rent for training bursts and new experiments.

Whichever path you take, the spend has to be tracked, because owned GPUs hide waste just as easily as rented ones:

Owned hardware buries cost in idle racks and underused capacity, often inside Kubernetes clusters where Kubernetes cost management surfaces the waste.
Rented GPUs drift on the invoice as usage scales.
Either way, every GPU hour needs an owner. Amnic lets you allocate cost back to teams and models so no spend goes untagged, the foundation of FinOps for AI.

Does AI GPU Pricing Vary by Region?

Yes. The same chip carries a different price depending on where you rent or buy it, driven by energy costs, supply, and trade rules.

Cloud rental rates by region, per GPU per hour:

Region	H100 on-demand	A100 on-demand
North America	$1.99 to $4.12	$1.75 to $3.58
Europe	$2.00 to $6.39	$1.58 to $3.27

North America stays the cheapest and most stable, while European rates run higher at the top of the range on account of energy costs and thinner spot supply (regional pricing data). Within a single cloud, US regions are usually the lowest, with other regions adding a surcharge.

Trade rules also shape availability and price:

Allied nations such as the US, UK, Canada, Japan and Germany face no procurement restrictions and get the best pricing and supply.
H100, H200 and Blackwell parts are barred from a set of restricted countries, which fragments supply and lifts prices in markets that rely on limited channels.
Stripped-down variants such as the H20 serve restricted markets in place of the full chips, so the headline price does not map to the same performance everywhere.

The practical takeaway:

Price the workload in the region you will actually run it, not the cheapest list rate you can find.
Factor energy cost into owned hardware, since a region with high power prices raises the real cost of every GPU-hour.

Real Cost Examples

The sticker prices only matter once you run the math on a real workload. Four worked examples, using the rates above:

Scenario	Calculation	Result
Spot vs on-demand, 200-hour run on 8x H100	$2.53 vs $1.43 per GPU-hour, times 8 GPUs times 200 hours	$4,050 vs $2,290, a 43% saving
Buy vs rent break-even, DGX H100 ($290,000) vs neo-cloud	$290,000 / $20.24 per hour for 8 GPUs	~14,300 GPU-hours, ~1.6 years at 24/7
Buy vs rent break-even, DGX H100 ($290,000) vs hyperscaler	$290,000 / $55.04 per hour for 8 GPUs	~5,300 hours, ~7 months at 24/7
Cost of idle, DGX H100 over 3 years	$1.40 per GPU-hour at full use, divided by 30% utilization	~$4.60 per productive GPU-hour
H100 vs B200, same job	$20,000 at ~$18 per TFLOP vs ~$10 per TFLOP	~$11,000 on B200, ~45% less

These figures exclude power, cooling, and depreciation on owned hardware.

How to Keep AI GPU Spend Under Control

A short checklist regardless of chip or buying model:

Normalize to cost per useful output, a finished training run or per million tokens, not the purchase price or hourly rate
Tag every GPU to a team, model, or workload from the start
Track utilization, since idle high-end cards are the single largest source of waste
Compare buy and rent on total cost of ownership, including power, cooling, and depreciation
Alert on drift so a cost change surfaces while a job is still running

Conclusion

AI GPU pricing runs from a $10,000 A100 to a near $3 million Blackwell rack, and the right number depends on the chip, the system around it, the work it does, and whether you own it. The prices will keep moving as supply loosens and new silicon ships. What stays constant is the need to see GPU spend clearly, allocate it accurately, and catch cost anomalies before they compound.

FAQs

How much does an NVIDIA H100 cost?

A new H100 costs $27,000 to $40,000 per GPU. Used and secondhand cards trade below that as supply eases and teams upgrade to newer Blackwell hardware.

How much does a DGX server cost?

A DGX H100 server runs about $290,000, a DGX B200 about $515,000, and a full GB200 NVL72 rack reaches roughly $3,000,000.

What is the cheapest AI GPU to buy?

Among data-center cards, the A100 40GB is the lowest at about $10,000 to $12,000. The AMD MI300X is also competitively priced at roughly $10,000 to $15,000 with more memory than the H200.

Is the B200 worth more than the H100?

Often yes. A B200 costs more per chip but delivers roughly $8 to $12 per FP16 TFLOP against the H100's $16 to $20, so it can finish the same work for less.

Should I buy or rent AI GPUs?

Buy when utilization is high and steady, since an owned H100 only pays off if it runs most of the time. Rent for spiky or experimental work, where cloud H100s start near $2 per GPU per hour.

How does the AMD MI300X compare on price?

The MI300X is priced around $10,000 to $15,000 per GPU and offers more memory than the H200, 192GB versus 141GB, making it a strong value for memory-bound inference.

Does AI GPU pricing vary by region?

Yes. North American cloud rates are the lowest, often $1.99 to $4.12 per H100 GPU-hour, while European rates reach $6.39 on higher energy costs. Trade controls also bar H100, H200 and Blackwell chips from some countries, raising prices where supply is limited.

Better visibility and management into AI Tokens?

Start with a 30 day trial

Connect leading LLMs

24 hour time to value

Stay ahead of AI Spend

Request a Demo