L40S vs L4

Compare NVIDIA L40S and NVIDIA L4 specs, performance, and cloud pricing

L40S

48GB

From $0.820/hr

L4

24GB

From $0.350/hr

Architecture

Ada Lovelace

vs Ada Lovelace

FP16 Gap

1.5x

L40S leads

SpecificationL40SL4
VRAM48 GB24 GB
VRAM TypeGDDR6XGDDR6
FP16 TFLOPS366.5 TFLOPS242 TFLOPS
FP8 TFLOPS733 TFLOPS485 TFLOPS
Memory Bandwidth864 GB/s300 GB/s
TDP350W72W
InterconnectPCIe Gen4PCIe Gen4
ArchitectureAda LovelaceAda Lovelace

Price Comparison

MetricL40SL4
Cheapest On-Demand$0.820/hr$0.350/hr
Cheapest Spot$0.440/hr$0.210/hr
Providers Available53

Verdict

Best for Training

NVIDIA L40S

366.5 TFLOPS FP16 with 48GB VRAM

Best Value

NVIDIA L4

691 TFLOPS per $/hr

Best for Inference

NVIDIA L40S

733 TFLOPS FP8/FP16

Use-Case Recommendations

Large-Scale Training

Training LLMs and large multi-modal models

Winner

L40S

366.5 TFLOPS FP16 with 48GB GDDR6X provides the best training throughput.

Inference at Scale

Deploying models in production for real-time inference

Winner

L40S

733 TFLOPS FP8/FP16 gives superior inference throughput.

Budget-Conscious Workloads

Getting the best performance per dollar

Winner

L4

Starting at $0.350/hr delivers the best TFLOPS per dollar.

Learn More