NVIDIA-Nemotron-Labs-3-Elastic-12B-A2B

About

static quants of DavidAU/NVIDIA-Nemotron-Labs-3-Elastic-12B-A2B

Provided Quants

Link Type Size/GB Notes
GGUF Q4_K_M 9.64GB ...
Downloads last month
191
GGUF
Model size
12B params
Architecture
nemotron_h_moe
Hardware compatibility
Log In to add your hardware

4-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for nightmedia/NVIDIA-Nemotron-Labs-3-Elastic-12B-A2B-GGUF