NVIDIA Releases Nemotron-Labs-TwoTower: an Open-Weight Diffusion Language Model Built on a Frozen Autoregressive Nemotron-3-Nano-30B-A3B Backbone

u/ai-lover 2026-07-01 · 08:18 UTC

NVIDIA has released Nemotron-Labs-TwoTower, an open-weight diffusion language model utilizing a hybrid Mamba-2, attention, and MoE backbone based on Nemotron-3-Nano-30B-A3B. The architecture employs a "two-tower" approach, separating a frozen autoregressive context tower for clean token processing from the denoising process to resolve weight conflict. This design aims to optimize the dual requirements of representing clean context and denoising noisy tokens.

Read original

→ View original source

← Back to homepage

NVIDIA Releases Nemotron-Labs-TwoTower: an Open-Weight Diffusion Language Model Built on a Frozen Autoregressive Nemotron-3-Nano-30B-A3B Backbone

Related Articles

Google AI Introduces TabFM: A Hybrid-Attention Tabular Foundation Model for Zero-Shot Classification and Regression

False Confidence isn’t a machine problem, it’s the oldest human error. The Dogmatic Average II

allenai /olmocr

open-webui /open-webui

Intel Arc Pro B70 + llama.cpp (Vulkan) benchmarks with Qwen3.6-27B and Qwen3.6-35B-A3B