NVIDIA Quadro RTX 6000 — 24GB

Specifications

BrandNVIDIA
ModelQuadro RTX 6000
VRAM24GB
ArchitectureTuring
CUDA / Stream Processors4,608
Memory Bandwidth672 GB/s
TDP260W
FP32 TFLOPS16.3

Buy Now

Prices last updated:

GPUDojo is reader-supported. When you buy through links on our site, we may earn an affiliate commission.

Price History

Price tracking started — chart will appear after the next snapshot.

For AI / LLM Use

Solid choice for 30B models and comfortable 14B inference.

What Models Can It Run?

  • 30B Q4_K_M, 14B full precision, 70B Q2 (tight)
  • 14B Q6_K, 30B Q3_K (tight)
  • 14B Q4_K_M, 7B full precision
  • 7B Q6_K, 14B Q3_K (tight)
  • 7B Q4_K_M only

Estimated Performance

Generation: ~50 tokens/sec

Prefill: ~291 tokens/sec

Recommended Quantisations

  • Q4_K_M recommended for 30B models
  • Q6_K or Q8 for 14B and below
  • Full precision for 7B

Pros & Cons

Pros

  • 24GB VRAM — handles large models
  • Turing architecture — good software support
  • Consumer card — easy to install, display output

Cons

Community Verdict

No community reviews yet for the Quadro RTX 6000. Know a good review? Let us know.