Billing Calculation
Billing Calculation
Updated on 21 Aug 2025

We now support using credits for three services: GPU Container, AI Studio - Model Fine-tuning, and AI Inference.

1. Model Fine-tuning

Service Billing Logic Minimum billable usage Time to charge
Model Fine-tuning - Based on the usage duration of a pipeline
- Remaining unbilled usage will be rounded up to the nearest 15 minutes
15 minutes Usage ≥ 1/4 GPU-hour

Billing Equation:

  • Usage duration: Calculated in GPU-hours (rounded up to the nearest 15-minute increment)
  • GPU: Number of GPUs used
  • Cost: Usage time (hours) × GPU count × item price from pricing table

Example:

You use 1 GPU NVIDIA H100 SXM5 priced at $5.5/GPU-hour to train model Qwen/Qwen3-32B for 8 minutes

  • Usage duration: 8 mins → rounded up to 15 mins (0.25 GPU-hour)
  • GPU: 1
  • Cost: $5.5 × 1 GPU × 0.25 GPU-hour = $1.375

2. Model Hub

Based on usage volume and usage duration of a pipeline.

  • Each hour is split into 12 blocks — 1 block = 5 minutes.
  • Smaller chunks mean you don’t get charged for idle time.
  • If your model is active, we measure how much storage (in GB) it used during that block.
  • If your model is deleted, nothing is recorded, and you’re not charged for that block.
  • At the end of the hour, we add up all the recorded usage to calculate your bill.

Rate: 01 GB/hour (Every 1 hour)


Billing Equation

Hourly storage usage = Total usage amount within 1 hour = Sum of (Usage amount in each 5-minute block)


Example

Time Slot Recorded Block Size (GB) Price / 1 minute Usage Amount
00:00–00:05 5 0.000013 5GB × 0.000013 × 5 minutes = 0.000325
00:05–00:10 5 0.000013 5GB × 0.000013 × 5 minutes = 0.000325
00:10–00:15 5 0.000013 5GB × 0.000013 × 5 minutes = 0.000325
00:15–00:20 7 0.000013 7GB × 0.000013 × 5 minutes = 0.000455
00:20–00:25 7 0.000013 7GB × 0.000013 × 5 minutes = 0.000455
00:25–00:30 7 0.000013 7GB × 0.000013 × 5 minutes = 0.000455
00:30–00:35 7 0.000013 7GB × 0.000013 × 5 minutes = 0.000455
00:35–00:40 7 0.000013 7GB × 0.000013 × 5 minutes = 0.000455
00:40–00:45 7 0.000013 7GB × 0.000013 × 5 minutes = 0.000455
00:45–00:50 7 0.000013 7GB × 0.000013 × 5 minutes = 0.000455
00:50–00:55 7 0.000013 7GB × 0.000013 × 5 minutes = 0.000455
00:55–01:00 7 0.000013 7GB × 0.000013 × 5 minutes = 0.000455

TOTAL (1 hour)
Total amount of 12 records = $0.0050

Model as a Service

Service Billing Logic Minimum billable usage Time to charge
Model as a Service Based on usage volume (number of tokens/characters) 1 token Every 5 minutes

Billing Equation:

  • Usage tokens: Input tokens + Output tokens
  • Cost: Input tokens × input price / 1M + Output tokens × output price / 1M

Example:

You use model Qwen3-32B priced at $0.165/1M input tokens and $0.187/1M output tokens, using 13,394 input tokens and 127 output tokens

  • Usage token: 13,394 input tokens and 127 output tokens
  • Cost = (13,394 × 0.165 + 127 × 0.187) / 1,000,000 = $0.0022

GPU Container

Service Billing Logic Minimum billable usage Time to charge
GPU Container - GPU Instance:
   - Number of GPUs
   - Usage duration (minutes)
   - Status: Running

- Persistent Storage:
   - GB capacity
   - Usage duration (minutes)
   - Status: Running/Stopped
1 minute Every 15 minutes

Billing Equation:

  • Usage duration: In hours (not rounded up)
  • Storage Volume: In GB
  • GPU Cost: Usage hours × GPU hourly price
  • Storage Cost: Usage hours × GB × storage hourly price
  • Total Cost: GPU Cost + Storage Cost

Example:

You use a container with one H100 GPU (15 CPUs, 250 GB RAM, 1 GPU) and 1000 GB temporary storage priced at $2.31/hour with 1000 GB persistent storage priced at $0.00013/GB/hour for 30 minutes

  • Usage duration: 30 minutes = 0.5 hours
  • GPU Cost: 0.5 × $2.31 = $1.16
  • Storage Cost: 1000 × $0.00013 × 0.5 = $0.0015
  • Total: $1.16 + $0.0015 = $1.1615