nvidia/Qwen3.6-35B-A3B-NVFP4 · Hugging Face

Article automatically generated from technical news.

The NVIDIA Qwen3.6-35B-A3B-NVFP4 model is the quantized version of Alibaba's Qwen3.6-35B-A3B model, which is an auto-regressive language model that uses an optimized transformer architecture. For more information, please check here . The NVIDIA Qwen3.6-35B-A3B-NVFP4 model is quantized with Model Optimizer . Post Training Quantization This model was obtained by quantizing the weights of Qwen3.6-35B-A3B to NVFP4 data type, ready for inference with vLLM. Only the weights and

Fonte originale