intel is seriously competitive for price-to-VRAM, but i don't know about compatibility
NVIDIA is usually the clear winner for performance, 5xxx series/blackwell has support for NVFP4 quantized models
but you could also do like, multiple 3090s or something
hope this helps

NVIDIA Technical Blog
Introducing NVFP4 for Efficient and Accurate Low-Precision Inference | NVIDIA Technical Blog
To get the most out of AI, optimizations are critical. When developers think about optimizing AI models for inference, model compression techniques...