Question 1

What is Z.ai GLM 5 Turbo?

Accepted Answer

GLM-5 Turbo is a new model from Z.ai designed for fast inference and strong performance in agent-driven environments such as OpenClaw scenarios. It is deeply optimized for real-world agent workflows...

Question 2

How much does Z.ai GLM 5 Turbo cost?

Accepted Answer

Z.ai GLM 5 Turbo is priced at $1.2 per 1 million input tokens and $4 per 1 million output tokens when accessed via NeuronGate. You pay per token — no subscriptions required. Top up with crypto (USDT, USDC, ETH, BTC).

Question 3

Does Z.ai GLM 5 Turbo support streaming?

Accepted Answer

Yes, Z.ai GLM 5 Turbo fully supports streaming responses via NeuronGate's API. Use the standard `"stream": true` parameter in your request.

Question 4

What is the context window of Z.ai GLM 5 Turbo?

Accepted Answer

Z.ai GLM 5 Turbo has a context window of 203K tokens (~152K words). Maximum output is 131K tokens.

Question 5

Does Z.ai GLM 5 Turbo support function calling (tools)?

Accepted Answer

Yes, Z.ai GLM 5 Turbo supports function calling / tool use via NeuronGate's standard API.

Question 6

How do I use Z.ai GLM 5 Turbo with NeuronGate?

Accepted Answer

Create a NeuronGate account at neurongate.net, top up your balance with crypto, generate an API key, and use model ID `z-ai/glm-5-turbo` in your requests. NeuronGate uses the OpenAI-compatible API format — just change your base URL to `https://neurongate.net/v1`.

Example	Cost
1K input tokens (short prompt)	$0.00120
1K in + 500 out (typical response)	$0.00320
10K in + 2K out (document analysis)	$0.0200
100K in + 10K out (large context)	$0.1600

Z.ai GLM 5 Turbo

Code Examples

Pricing Details

Frequently Asked Questions

Capabilities

Context Window

Modalities

Similar Models