What’s New on FPT AI Factory
Table of Contents
We continue to advance the FPT AI Factory platform to improve scalability, performance, and operational efficiency. This release, as of December 12, 2025, introduces new features and optimizations designed to enhance the smoothness and efficiency of your workflows.

Accelerate LLM workflows with new optimization techniques and gain real-time visibility through Grafana-integrated UI Logs.
New feature
1. Full support for the Qwen3VL model
Allow users to leverage state-of-the-art multimodal capabilities of the Qwen3VL model family for tasks such as visual understanding across AI Studio and related services.

2. Support Download Model Catalog by SDK
Enable Model Catalog download via SDK gives developers a faster, automated way to integrate and manage models and improve workflow efficiency.

Boost automation, ease of use, and performance, helping customers work faster and smarter with AI Notebook.
New feature
1. Automated Lab Version Upgrade
Remove manual steps for deleting old labs and remapping during version upgrades, saving time and reducing errors.
2. Event Notification Scheduling
Enable scheduled system and feature announcements directly in AI Notebook, ensuring users stay informed without disruption.

3. Notebook Gallery
Offer ready-to-use notebooks for common use cases across various topics, allowing quick reference and execution to accelerate development.


4. GPU Quota Control
Introduce per-tenant GPU usage limits for better resource allocation and cost management, ensuring fair and efficient utilization.
Achieve operational stability with new upgrades in LiteLLM engine, billing, kafka, and top-up services.
New feature
1. Infrastructure & API Stability
2. Production Go-Live & Core Services

Billing
Foster real-time tracking, transparent cost insights, and a centralized dashboard for all usage-related information.
New feature
Product Usage: users can better manage budgets, optimize resource consumption, and make data-driven decisions with confidence.
A centralized interface that displays:

Use case