FAQ
FAQ
Updated on 23 Sep 2025

1. How can I create an API Key and use it with models?

You can create an API Key under My Account → My API Keys.
This key will be required to call models via the Inference API.


2. How is model usage pricing calculated?

Pricing is based on the number of input and output tokens.
You can check details under Product Information → Pricing or in Billing Management inside My Account.


3. What are the rate limits for model usage?

Each model has its own Rate Limit (e.g., requests per second or tokens per second).
You can view this information in Product Information → Rate Limit.


4. Does the Marketplace support autoscaling for model endpoints?

Yes. Endpoints can be configured with autoscaling based on traffic load,
optimizing costs while maintaining stability during traffic spikes.