GPU GUIDE · NVIDIA

Best AI models for the
NVIDIA GTX 1660 Ti

The NVIDIA GTX 1660 Ti has 6.0GB of VRAM. Below are the top 30 open-source AI models that fit, ranked by composite benchmark score. Each row shows the best quantization that fits your hardware.

VRAM
6.0GB
Brand
nvidia
Models that fit
30
Generation
GTX 16 Series

Top 30 models for the NVIDIA GTX 1660 Ti

01

Qwen 3.5 4B

4.0B
45.1
Best fit: Q8_0 · 5.3GBCan I run it? →
02

Devstral Small 2

7.0B
31.7
Best fit: Q5_K_M · 6.0GBCan I run it? →
03

gemma 4 E4B it

8.0B
31.3
Best fit: Q4_1 · 6.0GBCan I run it? →
04

Qwen3 4B Thinking 2507

4.0B
30.3
Best fit: Q8_0 · 5.3GBCan I run it? →
05

Qwen 3.5 2B

2.0B
27.2
Best fit: fp16 · 5.0GBCan I run it? →
06

Qwen3 VL 8B Thinking

8.8B
27.8
Best fit: Q4_0 · 6.0GBCan I run it? →
07

DeepSeek R1 0528 Qwen3 8B

8.2B
27.4
Best fit: Q4_K_M · 6.0GBCan I run it? →
08

gemma 4 E2B it

5.1B
25.4
Best fit: Q6_K · 5.2GBCan I run it? →
09

Gemma 4 E2B

2.0B
25.0
Best fit: fp16 · 5.0GBCan I run it? →
10

NVIDIA Nemotron 3 Nano 4B BF16

4.0B
24.5
Best fit: Q8_0 · 5.3GBCan I run it? →
11

Ministral 3 8B

8.0B
25.0
Best fit: Q4_K_M · 5.8GBCan I run it? →
12

Qwen3 4B

4.0B
23.7
Best fit: Q8_0 · 5.3GBCan I run it? →
13

Qwen3 VL 8B Instruct

8.8B
23.8
Best fit: Q4_0 · 6.0GBCan I run it? →
14

Qwen3 VL 4B Thinking

4.4B
22.9
Best fit: Q8_0 · 5.7GBCan I run it? →
15

Qwen3 4B Instruct 2507

4.0B
21.5
Best fit: Q8_0 · 5.3GBCan I run it? →
16

Qwen3 8B

8.2B
22.0
Best fit: Q4_K_M · 6.0GBCan I run it? →
17

Granite 4.1 8B

8.8B
20.6
Best fit: Q4_0 · 6.0GBCan I run it? →
18

DeepSeek R1 Distill Llama 8B

8.0B
20.2
Best fit: Q4_K_M · 5.8GBCan I run it? →
19

Ministral 3 3B 2512

3.8B
18.7
Best fit: Q8_0 · 5.0GBCan I run it? →
20

Ministral 3 3B

3.0B
18.3
Best fit: Q8_0 · 4.2GBCan I run it? →
21

Qwen 3.5 0.8B

0.8B
17.5
Best fit: fp16 · 2.6GBCan I run it? →
22

Qwen 2.5 Coder 7B

7.6B
16.6
Best fit: Q4_K_M · 5.6GBCan I run it? →
23

Qwen3 VL 4B Instruct

4.4B
15.9
Best fit: Q8_0 · 5.7GBCan I run it? →
24

DeepSeek R1 Distill Qwen 1.5B

1.8B
15.1
Best fit: fp16 · 4.6GBCan I run it? →
25

Llama 3 8B Instruct

8.0B
14.8
Best fit: Q4_1 · 6.0GBCan I run it? →
26

granite 4.1 3b

3.4B
14.2
Best fit: Q8_0 · 4.6GBCan I run it? →
27

LFM2.5-1.2B-Thinking (free)

1.2B
13.5
Best fit: fp16 · 3.4GBCan I run it? →
28

LFM2.5-1.2B-Instruct (free)

1.2B
13.4
Best fit: fp16 · 3.4GBCan I run it? →
29

Qwen3 1.7B

2.0B
13.3
Best fit: fp16 · 5.0GBCan I run it? →
30

Mistral 7B Instruct v0.1

7.2B
12.4
Best fit: Q5_K_S · 6.0GBCan I run it? →
Advertisement

FAQ — running AI on the NVIDIA GTX 1660 Ti

How many AI models can the NVIDIA GTX 1660 Ti run?

With 6.0GB of VRAM, the NVIDIA GTX 1660 Ti can run 30+ open-source models from our database, including Qwen 3.5 4B, Devstral Small 2, gemma 4 E4B it.

What's the largest LLM I can run on a NVIDIA GTX 1660 Ti?

The biggest model that fits is approximately 8.8B. Larger models would need to be quantized further or won't fit at all.

Is 6.0GB of VRAM enough for local AI?

Tight but workable. 6.0GB runs 3B-7B models well at Q4. For coding or reasoning models you may want more memory.