Billing Calculation
Billing Calculation
Updated on 18 Jul 2025

We now support using credits for three services: GPU Container, AI Studio - Model Fine-tuning, and AI Inference.

Model Fine-tuning

Service Billing Logic Minimum billable usage Time to charge
Model Fine-tuning - Based on the usage duration of a pipeline
- Remaining unbilled usage will be rounded up to the nearest 15 minutes
15 minutes Usage ≥ 1/4 GPU-hour

Billing Equation:

  • Usage duration: Calculated in GPU-hours (rounded up to the nearest 15-minute increment)
  • GPU: Number of GPUs used
  • Cost: Usage time (hours) × GPU count × item price from pricing table

Example:

You use 1 GPU NVIDIA H100 SXM5 priced at $5.5/GPU-hour to train model Qwen/Qwen3-32B for 8 minutes

  • Usage duration: 8 mins → rounded up to 15 mins (0.25 GPU-hour)
  • GPU: 1
  • Cost: $5.5 × 1 GPU × 0.25 GPU-hour = $1.375

Model as a Service

Service Billing Logic Minimum billable usage Time to charge
Model as a Service Based on usage volume (number of tokens/characters) 1 token When usage reaches 1M tokens or API ends

Billing Equation:

  • Usage tokens: Input tokens + Output tokens
  • Cost: Input tokens × input price / 1M + Output tokens × output price / 1M

Example:

You use model Qwen3-32B priced at $0.165/1M input tokens and $0.187/1M output tokens, using 13,394 input tokens and 127 output tokens

  • Usage token: 13,394 input tokens and 127 output tokens
  • Cost = (13,394 × 0.165 + 127 × 0.187) / 1,000,000 = $0.0022

GPU Container

Service Billing Logic Minimum billable usage Time to charge
GPU Container - GPU Instance:
   - Number of GPUs
   - Usage duration (minutes)
   - Status: Running

- Persistent Storage:
   - GB capacity
   - Usage duration (minutes)
   - Status: Running/Stopped
1 minute Every 15 minutes

Billing Equation:

  • Usage duration: In hours (not rounded up)
  • Storage Volume: In GB
  • GPU Cost: Usage hours × GPU hourly price
  • Storage Cost: Usage hours × GB × storage hourly price
  • Total Cost: GPU Cost + Storage Cost

Example:

You use a container with one H100 GPU (15 CPUs, 250 GB RAM, 1 GPU) and 1000 GB temporary storage priced at $2.31/hour with 1000 GB persistent storage priced at $0.00013/GB/hour for 30 minutes

  • Usage duration: 30 minutes = 0.5 hours
  • GPU Cost: 0.5 × $2.31 = $1.16
  • Storage Cost: 1000 × $0.00013 × 0.5 = $0.0015
  • Total: $1.16 + $0.0015 = $1.1615