Estimate exact GPU memory requirements for deploying transformer models. Input parameters, precision, and batch configuration to get hardware-matched recommendations.
Model Parameters Config
GPU Reference Specs
| GPU | VRAM | Tier |
|---|---|---|
| RTX 3060 | 12 GB | Consumer |
| RTX 4080 | 16 GB | Consumer |
| RTX 4090 | 24 GB | Prosumer |
| A10G | 24 GB | Cloud |
| A100 40G | 40 GB | Data Centre |
| A100 80G | 80 GB | Data Centre |
| H100 SXM | 80 GB | HPC |
| H100 NVL | 94 GB | HPC |
| 2× A100 80G | 160 GB | Multi-GPU |
| 4× A100 80G | 320 GB | Multi-GPU |
| 8× H100 | 640 GB | Cluster |
GPU Compatibility
| GPU | VRAM | Fits? | Headroom |
|---|