Model Configuration
Quantization
Bits per parameter—
Bytes per parameter—
Quality notes—
Results
Weights VRAM
—
KV Cache (per layer)
—
Total (w/ overhead)
—
Memory Breakdown
GPU Compatibility