Initiate New Deployment

From the main menu, select the Deployments tab

  • Click on the Create button to start a new deployment
    • Enter Deployment Name: Provide a unique name for your deployment.
    • Select Cluster: Choose the cluster where you want to deploy your model.
    • Select Model: Choose the model you want to deploy from the list.

If deploying the model on your own infrastructure (BYOC), select the cluster details that were previously set up in the clusters stage.

If deploying as a Private Endpoint, select Simplismart Infrastructure as the cluster.


Configure Infrastructure

  • Choose the appropriate accelerators for your deployment, such as GPUs/CPUs.


Adding Scaling Metrics

  • Specify the scaling metrics that will be used to auto-scale your deployment.
  • Set the threshold values for each metric to trigger scaling actions.


Deploy

  • Click on the Deploy Model button to initiate the deployment process.
  • Check the right part of the screen to see the creation status of your deployment.
  • Monitor the deployment status to know when the model is ready for usage.
  • The status will show deployed once done. Your model is now ready for use.