Creating a Deployment

On this page

Initiate New Deployment
Configure Infrastructure
Adding Scaling Metrics
Deploy

Initiate New Deployment

From the main menu, select the Deployments tab

Click on the Create button to start a new deployment
- Enter Deployment Name: Provide a unique name for your deployment.
- Select Cluster: Choose the cluster where you want to deploy your model.
- Select Model: Choose the model you want to deploy from the list.

If deploying the model on your own infrastructure (BYOC), select the cluster details that were previously set up in the clusters stage.If deploying as a Private Endpoint, select Simplismart Infrastructure as the cluster.

Configure Infrastructure

Choose the appropriate accelerators for your deployment, such as GPUs/CPUs.

Adding Scaling Metrics

Specify the scaling metrics that will be used to auto-scale your deployment.
Set the threshold values for each metric to trigger scaling actions.

Deploy

Click on the Deploy Model button to initiate the deployment process.
Check the right part of the screen to see the creation status of your deployment.
Monitor the deployment status to know when the model is ready for usage.
The status will show deployed once done. Your model is now ready for use.

Adding a Custom Model Deploying a Custom Model

Get Started

Types of Inference

Playground

Model Compilation

Deployment

Benchmarking

Training

Settings

References

Initiate New Deployment

Configure Infrastructure

Adding Scaling Metrics

Deploy

Get Started

Types of Inference

Playground

Model Compilation

Deployment

Benchmarking

Training

Settings

References

​Initiate New Deployment

​Configure Infrastructure

​Adding Scaling Metrics

​Deploy

Initiate New Deployment

Configure Infrastructure

Adding Scaling Metrics

Deploy