Question 1

Can the NVIDIA RTX 4080 Super run Qwen 3.6 27B?

Accepted Answer

No. Qwen 3.6 27B (27.0B) needs at least 17.4GB even at its smallest quantization, more than the 16.0GB on the NVIDIA RTX 4080 Super.

Question 2

What's the best quantization to use?

Accepted Answer

None of Qwen 3.6 27B's available quantizations fit in 16.0GB. You'll need either a larger GPU, a smaller model, or to run it in the cloud.

Question 3

What if I need more headroom for context length?

Accepted Answer

KV cache memory grows with context length. The numbers above assume a baseline 2K-4K context. For long-context use (32K+), add another 2-6GB depending on the model architecture.

Can I Run Qwen 3.6 27B on a NVIDIA RTX 4080 Super?

None of Qwen 3.6 27B's quantizations fit

Run it in the cloud instead

Or upgrade your hardware

FAQ

Can the NVIDIA RTX 4080 Super run Qwen 3.6 27B?

What's the best quantization to use?

What if I need more headroom for context length?