All documents
Step 1: Select AI Platform → Model Serving → Deployment → New Deployment.
Step 2: Enter the Model Settings information, then click Next
Model Information: AI deployment information. Select Model Type:
Model included in Image: AI Model included in Container Image
Model not included in Image: AI Model not included in Container Image
NVIDIA NGC Catalog: AI Model using NVIDIA NGC technology
If Model Type is Model included in Image, select Model Source:
Model Source: Model selection source. Select Model Source:
Model Catalog: Centralized repository of public models, shared for users to use.
Private Model: Private repository of users, can be used internally within the organization.
Custom Model: Custom model on the Internet, currently only supporting Hugging Face models.
Model URL: Path to the custom model
Model Token: User authentication token on the platform of the selected Custom Model (e.g., Hugging Face)
If you select Model Type as Model included in Image or Model not Included in Image, select Image Information:
If Model Type is NVIDIA NIM – NGC Catalog, select deployment information:
Step 3: Enter the Deployment Settings information, then click Next.
Advance Settings: Enter advanced configurations for Deployment. Click See More to configure.
Deployment Strategy: Choose a deployment strategy for K8S. Available strategies include:
Startup Command: Configure the startup command for instances
Environment Variable: Define environment variables for the instance
Nodes Selector: Select specific worker nodes/worker groups for deployment
Tags: Assign tags to the Deployment
Step 4: Enter configuration details for Traffic Settings, then click Next
Step 5: Review the entered information and click Confirm to create the Deployment cluster
Cookie | Duration | Description |
---|---|---|
cookielawinfo-checbox-analytics | 11 months | |
cookielawinfo-checbox-functional | 11 months | |
cookielawinfo-checbox-others | 11 months | |
cookielawinfo-checkbox-necessary | 11 months | |
cookielawinfo-checkbox-performance | 11 months | |
viewed_cookie_policy | 11 months |