Question 1

What is NVIDIA Nemotron 3 Super?

Accepted Answer

NVIDIA Nemotron 3 Super is a 120B-parameter open hybrid MoE model, activating just 12B parameters for maximum compute efficiency and accuracy in complex multi-agent applications. Built on a hybrid Mamba-Transformer...

Question 2

How much does NVIDIA Nemotron 3 Super cost?

Accepted Answer

NVIDIA Nemotron 3 Super is priced at $0.21 per 1 million input tokens and $0.455 per 1 million output tokens when accessed via NeuronGate. You pay per token — no subscriptions required. Top up with crypto (USDT, USDC, ETH, BTC).

Question 3

Does NVIDIA Nemotron 3 Super support streaming?

Accepted Answer

Yes, NVIDIA Nemotron 3 Super fully supports streaming responses via NeuronGate's API. Use the standard `"stream": true` parameter in your request.

Question 4

What is the context window of NVIDIA Nemotron 3 Super?

Accepted Answer

NVIDIA Nemotron 3 Super has a context window of 1.0M tokens (~750K words).

Question 5

Does NVIDIA Nemotron 3 Super support function calling (tools)?

Accepted Answer

Yes, NVIDIA Nemotron 3 Super supports function calling / tool use via NeuronGate's standard API.

Question 6

How do I use NVIDIA Nemotron 3 Super with NeuronGate?

Accepted Answer

Create a NeuronGate account at neurongate.net, top up your balance with crypto, generate an API key, and use model ID `nvidia/nemotron-3-super-120b-a12b` in your requests. NeuronGate uses the OpenAI-compatible API format — just change your base URL to `https://neurongate.net/v1`.

Example	Cost
1K input tokens (short prompt)	$0.00021
1K in + 500 out (typical response)	$0.00044
10K in + 2K out (document analysis)	$0.00301
100K in + 10K out (large context)	$0.0256

NVIDIA Nemotron 3 Super

Code Examples

Pricing Details

Frequently Asked Questions

Capabilities

Context Window

Modalities

Similar Models