All documents
We now support using credits for three services: GPU Container, AI Studio - Model Fine-tuning, and AI Inference.
Service | Billing Logic | Minimum billable usage | Time to charge |
---|---|---|---|
Model Fine-tuning | - Based on the usage duration of a pipeline - Remaining unbilled usage will be rounded up to the nearest 15 minutes |
15 minutes | Usage ≥ 1/4 GPU-hour |
Billing Equation:
Usage time (hours) × GPU count × item price from pricing table
Example:
You use 1 GPU NVIDIA H100 SXM5 priced at $5.5/GPU-hour to train model Qwen/Qwen3-32B for 8 minutes
$5.5 × 1 GPU × 0.25 GPU-hour = $1.375
Based on usage volume and usage duration of a pipeline.
Rate: 01 GB/hour
(Every 1 hour)
Hourly storage usage = Total usage amount within 1 hour = Sum of (Usage amount in each 5-minute block)
Time Slot | Recorded Block Size (GB) | Price / 1 minute | Usage Amount |
---|---|---|---|
00:00–00:05 | 5 | 0.000013 | 5GB × 0.000013 × 5 minutes = 0.000325 |
00:05–00:10 | 5 | 0.000013 | 5GB × 0.000013 × 5 minutes = 0.000325 |
00:10–00:15 | 5 | 0.000013 | 5GB × 0.000013 × 5 minutes = 0.000325 |
00:15–00:20 | 7 | 0.000013 | 7GB × 0.000013 × 5 minutes = 0.000455 |
00:20–00:25 | 7 | 0.000013 | 7GB × 0.000013 × 5 minutes = 0.000455 |
00:25–00:30 | 7 | 0.000013 | 7GB × 0.000013 × 5 minutes = 0.000455 |
00:30–00:35 | 7 | 0.000013 | 7GB × 0.000013 × 5 minutes = 0.000455 |
00:35–00:40 | 7 | 0.000013 | 7GB × 0.000013 × 5 minutes = 0.000455 |
00:40–00:45 | 7 | 0.000013 | 7GB × 0.000013 × 5 minutes = 0.000455 |
00:45–00:50 | 7 | 0.000013 | 7GB × 0.000013 × 5 minutes = 0.000455 |
00:50–00:55 | 7 | 0.000013 | 7GB × 0.000013 × 5 minutes = 0.000455 |
00:55–01:00 | 7 | 0.000013 | 7GB × 0.000013 × 5 minutes = 0.000455 |
TOTAL (1 hour)
Total amount of 12 records = $0.0050
Service | Billing Logic | Minimum billable usage | Time to charge |
---|---|---|---|
Model as a Service | Based on usage volume (number of tokens/characters) | 1 token | Every 5 minutes |
Billing Equation:
Input tokens × input price / 1M
+ Output tokens × output price / 1M
Example:
You use model Qwen3-32B priced at $0.165/1M input tokens and $0.187/1M output tokens, using 13,394 input tokens and 127 output tokens
(13,394 × 0.165 + 127 × 0.187) / 1,000,000 = $0.0022
Service | Billing Logic | Minimum billable usage | Time to charge |
---|---|---|---|
GPU Container | - GPU Instance: - Number of GPUs - Usage duration (minutes) - Status: Running - Persistent Storage: - GB capacity - Usage duration (minutes) - Status: Running/Stopped |
1 minute | Every 15 minutes |
Billing Equation:
GPU Cost + Storage Cost
Example:
You use a container with one H100 GPU (15 CPUs, 250 GB RAM, 1 GPU) and 1000 GB temporary storage priced at $2.31/hour with 1000 GB persistent storage priced at $0.00013/GB/hour for 30 minutes
0.5 × $2.31 = $1.16
1000 × $0.00013 × 0.5 = $0.0015
$1.16 + $0.0015 = $1.1615
Cookie | Duration | Description |
---|---|---|
cookielawinfo-checbox-analytics | 11 months | |
cookielawinfo-checbox-functional | 11 months | |
cookielawinfo-checbox-others | 11 months | |
cookielawinfo-checkbox-necessary | 11 months | |
cookielawinfo-checkbox-performance | 11 months | |
viewed_cookie_policy | 11 months |