← Back to GPUs

NVIDIA · RTX 40
NVIDIA GeForce RTX 4080 SUPER
$950$999 MSRP
The RTX 4080 SUPER delivers excellent 4K gaming performance with 16GB of GDDR6X memory. It is a strong card for gaming-first builds that want some AI capability. The 16GB VRAM handles 14B models at Q4 and runs Stable Diffusion XL comfortably. However, for dedicated AI builds, the 4090's 24GB offers significantly more headroom for the price difference.
Best ForPremium 4K gaming with moderate AI capability
VerdictGreat gaming card, but the 4090 is worth the upgrade for AI-focused builders.
AI
7/10
Gaming
9/10
Specifications
VRAM16GB GDDR6X
Memory Bandwidth736 GB/s
CUDA Cores10,240
Boost Clock2550 MHz
TDP320W
Power Connector1x 16-pin
Length304mm
Form FactorTriple Slot
Release Year2024
AI Capabilities
Capable16GB VRAM
Runs most popular models with quantization. The minimum for serious AI work.
Can run (Q4 quantized)
Llama 3.1 8BQwen 2.5 14BMistral 7BFLUX.1 DevStable Diffusion XLStable Diffusion 3.5 LargeHunyuanVideoCogVideoX-5BMochi 1LTX VideoStable Video DiffusionWan Video 14BCodestral 22BAlphaFold 2ESMFold (ESM-2 15B)ESM-2 3BscGPTRFdiffusionFine-tune Llama 8BTrain SDXL LoRATrain FLUX LoRA
Tight fit (may need CPU offload)
Qwen 2.5 32B (20GB Q4)Qwen 2.5 Coder 32B (20GB Q4)LLaVA 1.6 34B (20GB Q4)
Recommended system RAM for AI: 32GB+ (2x GPU VRAM for model overflow)
Performance Estimates
Estimated tokens/sec for LLM inference based on 736 GB/s memory bandwidth — not hardware benchmarks. Methodology · What is Q4/Q8?
Llama 3.1 8B8B
FP16~25-31 tok/sUsableQwen 2.5 32B32B
Offload~1-3 tok/sVery slowQwen 2.5 14B14B
Q8~31-38 tok/sFastMistral 7B7B
FP16~29-36 tok/sUsableCodestral 22B22B
Q4~35-43 tok/sFastQwen 2.5 Coder 32B32B
Offload~1-3 tok/sVery slowPros
- +Excellent 4K gaming
- +Good DLSS 3 support
- +Slightly better than 4080
Cons
- -16GB may limit some AI workloads
- -Expensive for the VRAM
gamingai
Will It Run?
Llama 3.1 8B8B
FP16Qwen 2.5 32B32B
OffloadQwen 2.5 14B14B
Q8Mistral 7B7B
FP16FLUX.1 Dev12B
Q8Stable Diffusion XL6.6B
FP16Stable Diffusion 3.5 Large8B
Q8HunyuanVideo13B
Q4