
Benefits of using a Dedicated endpoint
- Dedicated resources: No sharing of compute resources with other users.
- Bring your own models: Deploy your own custom or finetuned models.
- Enhanced performance: Improved response times and throughput.
- Higher reliability: Reduced risk of downtime and performance degradation.
Deploying your model on a Dedicated Endpoint
To deploy your model on a dedicated endpoint, follow these outlined processes. Each step includes a link for detailed instructions, ensuring a smooth launch and use of your deployed model.1
Model Optimisation
- Optimise your model for deployment by visiting the Models page and using our optimisation tools.
- You can either choose a pre-optimized model from our Model Marketplace, or add your own.
Pre-optimized models are faster to deploy and support one-click deployment for quick setup and usage.
2
Model Deployment
You can deploy your optimised model on a dedicated endpoint by selecting your cloud provider as
Simplismart Cloud
, click here for detailed Deployment steps.3
Inferencing
You can invoke your deployed models from the API tab of the model deployment.
For dedicated endpoints, pricing depends on GPU usage, with different GPUs priced separately. For detailed pricing information, refer to the Pricing section on our website.