AI GPU Pricing: What H100, A100, B200 and DGX Systems Cost
7 min read
AI and LLM costs

Table of Contents
A single NVIDIA H100 costs between $27,000 and $40,000 to buy, a B200 runs $30,000 to $50,000, and a fully built GB200 NVL72 rack can reach $3 million or more. AI GPU pricing spans three orders of magnitude depending on the chip, the form factor, and whether you buy a card or a complete system. For any team budgeting AI infrastructure, that capital number is only the start, which is why hardware spend belongs inside a FinOps practice from day one.
This guide breaks AI GPU pricing into four parts:
Per-chip prices for the GPUs everyone trains on
Full system and DGX prices once networking and CPUs are added
Price per performance, so the sticker number is not read in isolation
Buy versus rent, the decision that actually moves the budget
Per-Chip AI GPU Prices
Purchase prices for the data-center cards, per GPU:
GPU | Memory | Price per GPU (new) |
|---|---|---|
NVIDIA H100 | 80GB | $27,000 to $40,000 |
NVIDIA A100 | 40GB | $10,000 to $12,000 |
NVIDIA A100 | 80GB | $15,000 to $17,000 |
NVIDIA H200 | 141GB | $25,000 to $35,000 |
NVIDIA B200 (Blackwell) | 192GB | $30,000 to $50,000 |
NVIDIA GB200 superchip | 2 GPUs + Grace CPU | up to $70,000 |
AMD MI300X | 192GB | $10,000 to $15,000 |
NVIDIA card prices, including an 8-GPU H200 configuration quoted near $315,000, come from a current price guide. The GB200 superchip figure is from reported Blackwell pricing, and the H200 and MI300X per-GPU prices and memory specs from a spec comparison.
A few buying notes:
Memory tier drives the price: The jump from A100 40GB to 80GB adds roughly $5,000 for the same chip.
Secondhand cards are increasingly available: Used H100 and A100 supply has grown as teams upgrade to newer Blackwell parts, pulling resale prices below new.
The H100 vs A100 decision is the most common one: If a model fits in A100 memory, the H100 premium is wasted, a trade-off covered in H100 vs A100.
Full System and DGX Prices
A bare GPU is not a usable machine. Once CPUs, networking, storage, and chassis are added, the real number climbs:
System | Configuration | Approx price |
|---|---|---|
DGX H100 | 8x H100 | ~$290,000 |
DGX B200 | 8x B200 | ~$515,000 |
GB200 NVL72 rack | 72x B200 + 36x Grace | ~$3,000,000 |
System prices come from a current server pricing guide.
What inflates a system above the raw card cost:
High-speed interconnect (NVLink, InfiniBand) for multi-GPU training
Grace CPUs and host memory on superchip configurations
Storage and networking sized to keep the GPUs fed
Power and cooling for racks that can draw over 100kW
The point that matters for budgeting: choosing the card is about more than its specs, it is about cost per finished run, which is the core of picking the right GPU for training.
Price Per Performance
The sticker price means little without throughput. Normalized to compute:
GPU | Cost per FP16 TFLOP (list) | Strength |
|---|---|---|
H100 | $16 to $20 | Broad training and inference |
B200 | $8 to $12 | Best cost per result, compute-bound training |
MI300X | Competitive, higher memory | Memory-bound inference |
Price-per-performance figures come from a price-performance analysis.
How to use these numbers:
A higher sticker price can be cheaper per result. A B200 costs more than an H100 but can finish the same work for less.
Match the chip to the bottleneck. Memory-bound inference favors high-HBM parts like the MI300X; compute-bound training favors Blackwell.
Generational jumps reset the math. Newer silicon usually wins on cost per TFLOP even at a higher headline price.
Buy vs Rent
The biggest budget decision is whether to own the hardware at all:
Buy when utilization is high and steady. Owning an H100 pays off only if it runs most of the time. At low utilization, a $30,000 card sits idle while depreciating.
Rent when demand is spiky or experimental. Cloud H100 capacity rents from roughly $2 to $8 per GPU per hour, with spot rates near $1.43 (rental rates), so short jobs almost never justify a purchase.
Blend for most teams. Own a baseline fleet for steady inference, rent for training bursts and new experiments.
Whichever path you take, the spend has to be tracked, because owned GPUs hide waste just as easily as rented ones:
Owned hardware buries cost in idle racks and underused capacity, often inside Kubernetes clusters where Kubernetes cost management surfaces the waste.
Rented GPUs drift on the invoice as usage scales.
Either way, every GPU hour needs an owner. Amnic lets you allocate cost back to teams and models so no spend goes untagged, the foundation of FinOps for AI.
Does AI GPU Pricing Vary by Region?
Yes. The same chip carries a different price depending on where you rent or buy it, driven by energy costs, supply, and trade rules.
Cloud rental rates by region, per GPU per hour:
Region | H100 on-demand | A100 on-demand |
|---|---|---|
North America | $1.99 to $4.12 | $1.75 to $3.58 |
Europe | $2.00 to $6.39 | $1.58 to $3.27 |
North America stays the cheapest and most stable, while European rates run higher at the top of the range on account of energy costs and thinner spot supply (regional pricing data). Within a single cloud, US regions are usually the lowest, with other regions adding a surcharge.
Trade rules also shape availability and price:
Allied nations such as the US, UK, Canada, Japan and Germany face no procurement restrictions and get the best pricing and supply.
H100, H200 and Blackwell parts are barred from a set of restricted countries, which fragments supply and lifts prices in markets that rely on limited channels.
Stripped-down variants such as the H20 serve restricted markets in place of the full chips, so the headline price does not map to the same performance everywhere.
The practical takeaway:
Price the workload in the region you will actually run it, not the cheapest list rate you can find.
Factor energy cost into owned hardware, since a region with high power prices raises the real cost of every GPU-hour.
Real Cost Examples
The sticker prices only matter once you run the math on a real workload. Four worked examples, using the rates above:
Scenario | Calculation | Result |
|---|---|---|
Spot vs on-demand, 200-hour run on 8x H100 | $2.53 vs $1.43 per GPU-hour, times 8 GPUs times 200 hours | $4,050 vs $2,290, a 43% saving |
Buy vs rent break-even, DGX H100 ($290,000) vs neo-cloud | $290,000 / $20.24 per hour for 8 GPUs | ~14,300 GPU-hours, ~1.6 years at 24/7 |
Buy vs rent break-even, DGX H100 ($290,000) vs hyperscaler | $290,000 / $55.04 per hour for 8 GPUs | ~5,300 hours, ~7 months at 24/7 |
Cost of idle, DGX H100 over 3 years | $1.40 per GPU-hour at full use, divided by 30% utilization | ~$4.60 per productive GPU-hour |
H100 vs B200, same job | $20,000 at ~$18 per TFLOP vs ~$10 per TFLOP | ~$11,000 on B200, ~45% less |
These figures exclude power, cooling, and depreciation on owned hardware.
How to Keep AI GPU Spend Under Control
A short checklist regardless of chip or buying model:
Normalize to cost per useful output, a finished training run or per million tokens, not the purchase price or hourly rate
Tag every GPU to a team, model, or workload from the start
Track utilization, since idle high-end cards are the single largest source of waste
Compare buy and rent on total cost of ownership, including power, cooling, and depreciation
Alert on drift so a cost change surfaces while a job is still running
Conclusion
AI GPU pricing runs from a $10,000 A100 to a near $3 million Blackwell rack, and the right number depends on the chip, the system around it, the work it does, and whether you own it. The prices will keep moving as supply loosens and new silicon ships. What stays constant is the need to see GPU spend clearly, allocate it accurately, and catch cost anomalies before they compound.
FAQs
How much does an NVIDIA H100 cost?
A new H100 costs $27,000 to $40,000 per GPU. Used and secondhand cards trade below that as supply eases and teams upgrade to newer Blackwell hardware.
How much does a DGX server cost?
A DGX H100 server runs about $290,000, a DGX B200 about $515,000, and a full GB200 NVL72 rack reaches roughly $3,000,000.
What is the cheapest AI GPU to buy?
Among data-center cards, the A100 40GB is the lowest at about $10,000 to $12,000. The AMD MI300X is also competitively priced at roughly $10,000 to $15,000 with more memory than the H200.
Is the B200 worth more than the H100?
Often yes. A B200 costs more per chip but delivers roughly $8 to $12 per FP16 TFLOP against the H100's $16 to $20, so it can finish the same work for less.
Should I buy or rent AI GPUs?
Buy when utilization is high and steady, since an owned H100 only pays off if it runs most of the time. Rent for spiky or experimental work, where cloud H100s start near $2 per GPU per hour.
How does the AMD MI300X compare on price?
The MI300X is priced around $10,000 to $15,000 per GPU and offers more memory than the H200, 192GB versus 141GB, making it a strong value for memory-bound inference.
Does AI GPU pricing vary by region?
Yes. North American cloud rates are the lowest, often $1.99 to $4.12 per H100 GPU-hour, while European rates reach $6.39 on higher energy costs. Trade controls also bar H100, H200 and Blackwell chips from some countries, raising prices where supply is limited.
FinOps OS powered by context-aware AI agents.
Start with a 30-day no-cost trial.
Read-only.
No credit card.
No commitment.
Want to assess how your FinOps journey can scale?
Benchmark maturity, close governance gaps, and drive ROI in under 20 minutes

Recommended Articles

Anthropic vs OpenAI: A Cost and Capability Comparison for Engineering Teams
Read More

Anthropic API Pricing Explained: How to Estimate and Control LLM Costs
Read More

Mistral API Pricing Explained: How to Estimate and Control Your Token Costs
Read More

Gemini API Pricing Explained: How to Estimate and Control Your Token Costs
Read More

OpenAI API Pricing Explained: How to Estimate and Control Your Token Costs
Read More

LLM Cost Comparison: OpenAI vs Anthropic vs Gemini vs Mistral API Costs Compared
Read More






