About Us
Highlights FPT Cloud Server FPT AI Factory FPT Network FPT Cloud Backup & DR FPT Storage FPT Security FPT Container FPT Database FPT Cloud Monitoring FPT Data Suite FPT.AI

Show all

Object Storage

Secure, unlimited storage to ensures efficiency as well as high and continuous data access demand.

GPU Server

Virtual server integration for 3D Rendering, AI or ML

FPT Load Balancing

Enhance application capacity and availability.

FPT AI Factory

Access to an all-inclusive stack for AI development, driven by NVIDIA’s powerful technology!

Cloud WAF

FPT Web Application Firewall provides powerful protection for web applications

Cloud Server

Advanced virtual server with rapid scalability

Backup Service

Backup and restore data instantly, securely and maintain data integrity.

Cloud Server

Advanced virtual server with rapid scalability

FPT AI Factory

Access to an all-inclusive stack for AI development, driven by NVIDIA’s powerful technology!

FPT Load Balancing

Enhance application capacity and availability.

Backup Service

Backup and restore data instantly, securely and maintain data integrity.

Disaster Recovery Service

Recovery, ensuring quick operation for the business after all incidents and disasters.

Block Storage

Diverse throughput and capacity to meet various business workloads.

Object Storage

Secure, unlimited storage to ensures efficiency as well as high and continuous data access demand.

Cloud WAF

FPT Web Application Firewall provides powerful protection for web applications

FPT Cloud WAPPLES

Intelligent and Comprehensive Virtual Web Application Firewall - Security Collaboration between FPT Cloud and Penta Security.

Next-Gen Firewall

The Next generation firewall security service

Container Registry

Easily store, manage, deploy, and secure Container images

Kubernetes Engine

Safe, secure, stable, high-performance Kubernetes platform

FPT Database for MongoDB

Provided as a service to deploy, monitor, backup, restore, and scale MongoDB databases on cloud.

FPT Database for Redis

Provided as a service to deploy, monitor, backup, restore, and scale Redis databases on cloud.

PostgreSQL Database Engine

Provided as a service to deploy, monitor, backup, restore, and scale PostgreSQL databases on cloud.

Monitoring

System Monitoring Solution anywhere, anytime, anyplatform

FPT Data Suite

Helps reduce operational costs by up to 40% compared to traditional BI solutions, while improving efficiency through optimized resource usage and infrastructure scaling.
Pricing
Partner
- Tech news
- White Paper
Event

Service

Cloud Server

FPT AI Factory

FPT Load Balancing

Monitoring

FPT Data Suite

Cloud Insights

ENG

Tiếng Việt English 中文 (中国) 日本語

All documents

GPU Container

FPT Monitoring

Incident Management

Billing

AI Factory Billing

Billing

AI Marketplace

AI Inference

AI Studio

FPT AI Inference

AI Inference

AI Infrastructure

FPT Security

FPT Cloud Server

FPT DevSecOps Services

FPT Integration

FPT Database Engine

Managed – FPT Database Engine

FPT Cloud Backup & DR

FPT Storage

FPT Network

FPT Container

Templates

Updated on 16 Sep 2025

Print: Export: PDF

Templates are used to launch images as containers and define the required container disk size, volume, volume paths, and ports needed. You can also define environment variables and startup commands within the template.

Built-in Templates

These templates are maintained by FPT AI Factory. We now offer built-in templates:

vLLM v0.8.1
- Intended Use: This vLLM container image is built and maintained by AI Factory. This template enables high throughput model inference using GPU resources with a state-of-the-art engine.

Environment Variables: Some more useful environment variables are provided for container customization.

Variable	Type	Description
HUGGING_FACE_HUB_TOKEN	string	Your Hugging Face User Access Token

Startup commands:

Command	Arguments
python	--model deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B --dtype bfloat16 --gpu-memory-utilization 0.9 --max-model-len 8192 --api-key your_api_key
-m	/
vllm.entrypoints.openai.api_server	/

Port:

Type	Port
HTTP	8000

Jupyter Notebook

Intended Use: This template provides Jupyter Lab to adopt remote development for AI/Data Scientists without local hardware limitations.
Environment Variables: Some more useful environment variables are provided for container customization.

Variable	Type	Default	Description
USERNAME	string	admin	Username to access Jupyter Notebook
PASSWORD	string		Password to access Jupyter Notebook (Generated by system)

Port:

Type	Port
HTTP	8000

Ollama WebUI

Intended Use: This template supports running various large language models (LLM) programs, including Ollama and APIs compatible with OpenAI, making it easy for users to customize based on workflow.
Port:

Type	Port
HTTP	8080

Ollama

Intended Use: This template enables high-throughput model inference using GPU resources with state-of-the-art engine.
Environment Variables: Some more useful environment variables are provided for container customization.

Variable	Type	Description
API_TOKEN	string	Auto-authenticate with external services (Generated by system)

Port:

Type	Port
HTTP	8000

Code Server

Intended Use: This template offers cloud-based VS Code with GPU to train, test, and debug AI models remotely with full IDE capabilities.
Environment Variables: Some more useful environment variables are provided for container customization.

Variable	Type	Default	Description
PUID	int	0	UserID
PGID	int	0	GroupID
TZ	string	Etc/UTC	Your timezone
PROXY_DOMAIN	string	code-server.my.domain	Domain will be proxied for subdomain proxying
DEFAULT_WORKSPACE	string	/	Default folder opened when accessing code-server
PASSWORD	string	/	Password to access code-server (Generated by system)

Port:

Type	Port
HTTP	8443

Ubuntu

Intended Use: This is a minimal Ubuntu with several useful additions to improve your user experience. While the root account is available as usual, we have created a normal system user for your convenience.
Port:

Type	Port
TCP	22

Additional Software
- Docker: Docker is installed and automatically starts for you. The default user has been added to the docker group, allowing you to manage containers without requiring root privileges.
- Nvidia CUDA: Nvidia driver version 550.90.07 is preinstalled to the container providing CUDA version 12.4.

vLLM v0.10.1
- Intended Use: This vLLM container image is built and maintained by AI Factory. This template enables high throughput model inference using GPU resources with state-of-the-art engine.
- Environment Variables: Some more useful environment variables are provided for container customization.

Variable	Type	Description
HUGGING_FACE_HUB_TOKEN	string	Your Hugging Face User Access Token

Startup commands:

Command	Arguments
python	--model openai/gpt-oss-20b --dtype bfloat16 --gpu-memory-utilization 0.9 --max-model-len 8192 --api-key your_api_key
-m	/
vllm.entrypoints.openai.api_server	/

Port:

Type	Port
HTTP	8000

NVIDIA Pytorch 25.03

Port:

Type	Port
TCP	22
HTTP	8888

Startup commands:

Command	Arguments
/bin/bash	/
-c	/
/usr/sbin/sshd && jupyter lab --ip=0.0.0.0 --port=8888 --allow-root --NotebookApp.token='your_token' --NotebookApp.password='' --notebook-dir=/workspace	/

Tensorflow 2.19.0

Port:

Type	Port
TCP	22

NVIDIA CUDA 12.9.1

Port:

Type	Port
TCP	22

Custom Templates

You can use your own Docker image by clicking "Custom Template" and overriding your own image:tag. If your image is from a private Docker repository, make sure to provide your username and password for authentication.

Alt text

How to monitor container

Storage

Cookie	Duration	Description
cookielawinfo-checbox-analytics	11 months
cookielawinfo-checbox-functional	11 months
cookielawinfo-checbox-others	11 months
cookielawinfo-checkbox-necessary	11 months
cookielawinfo-checkbox-performance	11 months
viewed_cookie_policy	11 months