GPU Cloud Pricing — Compare 40+ Providers in Real-Time

Find the cheapest GPU for ML training, inference, and rendering. Updated every 15 minutes.

40+ Providers

15+ GPU Models

Real-time Pricing

Popular GPUs

Real-time pricing from top cloud providers

Blackwell 192 GB HBM3e

B200 SXM 192GB

4500 FP16 TFLOPS | 8.0 TB/s

Hopper 141 GB HBM3e

H200 SXM 141GB

989.5 FP16 TFLOPS | 4.8 TB/s

Hopper 80 GB HBM3

H100 SXM5 80GB

1979 FP16 TFLOPS | 3.4 TB/s

Hopper 80 GB HBM3

H100 PCIe 80GB

1513 FP16 TFLOPS | 2.0 TB/s

Ampere 80 GB HBM2e

A100 SXM4 80GB

624 FP16 TFLOPS | 2.0 TB/s

Ada Lovelace 24 GB GDDR6X

RTX 4090

83 FP16 TFLOPS | 1.0 TB/s

Popular Comparisons

Head-to-head GPU specs and pricing

All comparisons

B200 SXM 192GB vs H100 SXM5 80GB

192 GB vs 80 GB VRAM

Compare specs & pricing

H100 SXM5 80GB vs A100 SXM4 80GB

Hopper vs Ampere

Compare specs & pricing

H200 SXM 141GB vs H100 SXM5 80GB

141 GB vs 80 GB VRAM

Compare specs & pricing

A100 SXM4 80GB vs A100 PCIe 40GB

80 GB vs 40 GB VRAM

Compare specs & pricing

48 GB vs 24 GB VRAM

Compare specs & pricing

H100 SXM5 80GB vs H100 PCIe 80GB

Hopper vs Hopper

Compare specs & pricing

GPU Cost Calculators

Estimate costs before you commit

Training Cost Calculator

Estimate the total cost to train your ML model. Input model size, dataset, and epochs to get cost breakdowns across providers.

Inference Cost Calculator

Calculate cost per token, request, or image generation. Compare on-demand vs spot pricing for production inference.

GPU ROI Calculator

Cloud vs on-premise comparison. Calculate break-even point and total cost of ownership over 1-5 years.

Top Cloud GPU Providers

Tier 1 providers with the best availability and pricing

AWS 10 GPUs View Prices

GCP 7 GPUs View Prices

Microsoft Azure 2 GPUs View Prices

Lambda Cloud 8 GPUs View Prices

RunPod 11 GPUs View Prices

Vast.ai 10 GPUs View Prices

CoreWeave 7 GPUs View Prices

How to Choose the Right GPU for Your Workload

Selecting the optimal GPU for your machine learning, inference, or rendering workload depends on several key factors: VRAM capacity, compute performance (measured in TFLOPS), memory bandwidth, and of course cost per hour.

For large language model (LLM) training, the NVIDIA H100 SXM5 and B200 SXM offer the highest FP8/FP16 throughput and inter-GPU bandwidth via NVLink. For fine-tuning and smaller models, the A100 80GB remains a proven, cost-effective choice.

For inference workloads, consider the L40S or L4 for their balance of performance and cost. The RTX 4090 is popular for research and light inference workloads at the lowest price points.

Spot instances can reduce costs by 40-70% compared to on-demand pricing. Check our spot price tracker for real-time availability. For predictable workloads, reserved instances from major cloud providers (AWS, GCP, Azure) offer 30-60% discounts with 1-3 year commitments.

Use our comparison tool to see detailed specs side-by-side, or try the training cost calculator to estimate your total costs before committing to a provider.