Create an AI Gateway
Procedures
Result

What to Do Next
Manage Token Quota

Manage Expiration

Duplicate AI Gateway
Delete
Last updated
Go to zenConsole and log in with your account. On the left menu bar, go to Products > GPU & AI > AI Gateway, and click Create AI Gateway.

A token represents a unit of text processed by the model. Token quota defines the maximum number of tokens, including all token types, such as input tokens, output tokens, cached tokens, and any other tokens consumed during request processing.
The token quota is unlimited by default. Uncheck Unlimited quota if you want to restrict the maximum number of tokens this API key can consume.
For example, if you set the token quota to 1,000,000 tokens:
The API key can consume up to 1,000,000 tokens in total
Once the quota is reached, the API key will be automatically suspended
No further token consumption or API requests will be allowed unless the quota is updated

You can add your AI gateway instance into a specified resource group. In a resource group, specific members can be granted defined permissions for resources.
See Create a Resource Group for more details.
Once created, you will receive an API Access Key, which is used for all subsequent API calls. See API docs to view all related API interfaces.
Copy this access key and store it securely.

Click the AI Gateway name to view the details page of logs, data analysis and basic information.
You can also perform the follow operations in Actions if you need:
Uncheck Unlimited quota to enter a quota limit.

Select desired API key expiration duration.

Click Duplicate AI Gateway to create an AI Gateway with the same configuration.
Click Delete if you don't need the AI gateway any more.
Last updated