LLM VRAM Calculator

Estimate GPU memory requirements for large language models
Model Configuration
Quantization
Bits per parameter
Bytes per parameter
Quality notes
Results
Weights VRAM
KV Cache (per layer)
Total (w/ overhead)
Memory Breakdown
GPU Compatibility