GPU GUIDE · NVIDIA

Best AI models for the
NVIDIA RTX 2080 Ti

The NVIDIA RTX 2080 Ti has 11.0GB of VRAM. Below are the top 30 open-source AI models that fit, ranked by composite benchmark score. Each row shows the best quantization that fits your hardware.

VRAM

11.0GB

Brand

nvidia

Models that fit

30

Generation

RTX 20 Series

Open full results filter →

Top 30 models for the NVIDIA RTX 2080 Ti

gemma 4 12B it

Best fit: Q6_K · 10.9GBCan I run it? →

Qwen 3.5 9B

Best fit: Q8_0 · 10.6GBCan I run it? →

Qwen 3.5 4B

Best fit: fp16 · 9.0GBCan I run it? →

Devstral Small 2

Best fit: Q8_0 · 8.4GBCan I run it? →

Ministral 3 14B

Best fit: Q5_K_M · 10.9GBCan I run it? →

Ministral 3 8B

Best fit: Q8_0 · 9.5GBCan I run it? →

Gemma 4 E2B

Best fit: f32 · 9.0GBCan I run it? →

Qwen3 4B Thinking 2507

Best fit: fp16 · 9.0GBCan I run it? →

gemma 4 E4B it

Best fit: Q8_0 · 9.5GBCan I run it? →

Ministral 3 3B

Best fit: fp16 · 7.0GBCan I run it? →

Ministral 3 14B 2512

Best fit: Q5_K_M · 10.9GBCan I run it? →

Qwen3 VL 8B Thinking

Best fit: Q8_0 · 10.3GBCan I run it? →

DeepSeek R1 0528 Qwen3 8B

Best fit: Q8_0 · 9.7GBCan I run it? →

Qwen3 14B

Best fit: Q4_1 · 10.3GBCan I run it? →

DeepSeek R1 Distill Qwen 14B

Best fit: Q4_K_M · 10.0GBCan I run it? →

gemma 4 E2B it

Best fit: Q8_0 · 6.4GBCan I run it? →

Ministral 3 8B 2512

Best fit: Q8_0 · 10.5GBCan I run it? →

Nemotron Nano 12B 2 VL (free)

Best fit: Q5_K_M · 10.4GBCan I run it? →

Nemotron Nano 9B V2 (free)

Best fit: Q8_0 · 10.5GBCan I run it? →

NVIDIA Nemotron 3 Nano 4B BF16

Best fit: Q8_0 · 5.3GBCan I run it? →

Qwen3 VL 8B Instruct

Best fit: Q8_0 · 10.3GBCan I run it? →

Qwen3 8B

Best fit: Q8_0 · 9.7GBCan I run it? →

Qwen3 4B Instruct 2507

Best fit: fp16 · 9.0GBCan I run it? →

Qwen 3.5 2B

Best fit: fp16 · 5.0GBCan I run it? →

Granite 4.1 8B

Best fit: Q8_0 · 10.3GBCan I run it? →

Ministral 3 3B 2512

Best fit: fp16 · 8.6GBCan I run it? →

DeepSeek R1 Distill Llama 8B

Best fit: Q8_0 · 9.5GBCan I run it? →

Qwen 3.5 0.8B

Best fit: fp16 · 2.6GBCan I run it? →

Gemma 3 12B

Best fit: Q5_K_M · 9.7GBCan I run it? →

Phi-4

Best fit: Q4_1 · 10.2GBCan I run it? →

FAQ — running AI on the NVIDIA RTX 2080 Ti

How many AI models can the NVIDIA RTX 2080 Ti run?

With 11.0GB of VRAM, the NVIDIA RTX 2080 Ti can run 30+ open-source models from our database, including gemma 4 12B it, Qwen 3.5 9B, Qwen 3.5 4B.

What's the largest LLM I can run on a NVIDIA RTX 2080 Ti?

The biggest model that fits is approximately 14.8B. Larger models would need to be quantized further or won't fit at all.

Is 11.0GB of VRAM enough for local AI?

Tight but workable. 11.0GB runs 3B-7B models well at Q4. For coding or reasoning models you may want more memory.