Skip to content
NVIDIA

NVIDIA: Nemotron 3 Nano Omni (free)

flagship
NVIDIA · released 2025-06-01 · text+image+audio+video->text
currently routing · 4.2k rpm
256K tokens
Context
— / 1M
Input
— / 1M
Output
— t/s
Speed
proprietary
License
/ ABOUT

Nemotron 3 Nano Omni 30B A3B Reasoning is NVIDIA's compact reasoning-focused model using a Mixture-of-Experts architecture for efficient inference.

Providers for NVIDIA: Nemotron 3 Nano Omni (free)

1 routes · sorted by uptime

ClosedRouter routes requests to the providers best able to handle your prompt size and parameters, with automatic fallbacks to maximize uptime.

Provider
Context
Quant
Uptime · 30d
256K
bf16
0.00%