Private endpoints provide dedicated infrastructure for your model deployments, ensuring better performance and reliability.

Benefits of using a private endpoint

  • Dedicated resources: No sharing of compute resources with other users.
  • Enhanced performance: Improved response times and throughput.
  • Higher reliability: Reduced risk of downtime and performance degradation.

Deploying your model on a Private Endpoint

To deploy your model on a private endpoint, follow these outlined processes. Each step includes a link for detailed instructions, ensuring a smooth launch and use of your deployed model.


Model Optimisation

  1. You can either choose an available model from our model marketplace, or add your own.
  2. Optimise your model for deployment by visiting the Models page and using our optimisation tools.

Model Deployment

You can deploy your optimised model on a private endpoint by selecting your cloud provider as simplismart, click here for detailed deployment steps.


Inferencing

You can invoke your deployed models from the API tab of the model deployment.

Want the deployment to run in your own cluster? Here’s how.