A100 PCIe 40GB vs RTX 4090
Compare NVIDIA A100 PCIe 40GB and NVIDIA RTX 4090 specs, performance, and cloud pricing
A100 PCIe 40GB
40GB
From $0.850/hr
RTX 4090
24GB
From $0.370/hr
Architecture
Ampere
vs Ada Lovelace
FP16 Gap
7.5x
A100 PCIe 40GB leads
| Specification | A100 PCIe 40GB | RTX 4090 |
|---|---|---|
| VRAM | 40 GB | 24 GB |
| VRAM Type | HBM2e | GDDR6X |
| FP16 TFLOPS | 624 TFLOPS | 83 TFLOPS |
| FP8 TFLOPS | N/A | 166 TFLOPS |
| Memory Bandwidth | 1.6 TB/s | 1.0 TB/s |
| TDP | 250W | 450W |
| Interconnect | PCIe Gen4 | None |
| Architecture | Ampere | Ada Lovelace |
Price Comparison
| Metric | A100 PCIe 40GB | RTX 4090 |
|---|---|---|
| Cheapest On-Demand | $0.850/hr | $0.370/hr |
| Cheapest Spot | $0.480/hr | $0.280/hr |
| Providers Available | 4 | 3 |
Verdict
Best for Training
NVIDIA A100 PCIe 40GB
624 TFLOPS FP16 with 40GB VRAM
Best Value
NVIDIA A100 PCIe 40GB
734 TFLOPS per $/hr
Best for Inference
NVIDIA A100 PCIe 40GB
624 TFLOPS FP8/FP16
Use-Case Recommendations
Large-Scale Training
Training LLMs and large multi-modal models
Winner
A100 PCIe 40GB
624 TFLOPS FP16 with 40GB HBM2e provides the best training throughput.
Inference at Scale
Deploying models in production for real-time inference
Winner
A100 PCIe 40GB
624 TFLOPS FP8/FP16 gives superior inference throughput.
Budget-Conscious Workloads
Getting the best performance per dollar
Winner
A100 PCIe 40GB
Starting at $0.850/hr delivers the best TFLOPS per dollar.