simplismart model-repos.
List Model Repos
Lists model repositories with optional filtering.| Option | Description |
|---|---|
--offset N | Pagination offset (default: 0) |
--count N | Page size (default: 5) |
--model-id UUID | Fetch specific model repo by ID |
--name NAME | Filter by model name (contains match) |
--status STATUS | Filter by status |
--model-type TYPE | Filter by model type |
--created-by EMAIL | Filter by creator email |
Status Options
| Value | Description |
|---|---|
SUCCESS | Model repo is ready |
FAILED | Model repo creation failed |
DELETED | Model repo has been deleted |
PROGRESSING | Model repo is being created |
Get Model Repo
Gets details of a specific model repository.| Option | Description |
|---|---|
--model-id UUID | Model repo UUID (required) |
Get Model Profiles
Generates/resolves the model profile from a source type and path. Returns the optimization combinations available for that model (quantizations, machines, and parallelism options).| Option | Description |
|---|---|
--type TYPE | Source type (required) - e.g., hf, s3, azure_blob |
--path PATH | Source path (required) |
--secret-id UUID | Secret UUID (optional) |
profile object containing:
- model_info —
type,architecture,param_size,default_quantization - quantizations — map of quantization (e.g.
float16,fp8,int4) to list of supportedmachines - combinations — list of valid (quantization, machine, parallelism) combinations; each has
quantization,machine,tensor_parallelism,data_parallelism,pipeline_parallelism,expert_parallelism(each a list of allowed values)
Bring Your Own Container Model Repo
Bring your own container.| Option | Description |
|---|---|
--name NAME | Model repo name (required) |
--source-type TYPE | Source type (required) - docker_hub, depot, nvidiadockersecret |
--runtime-gpus N | Runtime GPU count (required) |
--source-secret UUID | Source secret UUID (required for registry sources) |
--registry-path PATH | Registry path/repo (required) |
--docker-tag TAG | Image tag (required) |
--env KEY=VALUE | Environment variable (can be repeated) |
--healthcheck JSON/@file | JSON or file path for healthcheck |
--ports JSON/@file | JSON or file path for ports |
--metrics-path JSON/@file | JSON array or file path for metrics paths |
--deployment-custom-configuration JSON/@file | Custom deployment config |
Create Private Compile Model Repo
Creates a private compile model repository: the platform compiles the model from a source (e.g. Hugging Face) using config files for model, optimisation, and pipeline.| Option | Description |
|---|---|
--name NAME | Model repo name (required) |
--source-type TYPE | Source type (required) - e.g. huggingface |
--source-url URL | Source path/URL (required) - e.g. HF repo id |
--mode MODE | Compilation mode (default: public_hf) - e.g. public_hf, private_hf, aws, gcp, public_url, simplismart |
--model-class CLASS | Model class (required) - e.g. LlamaForCausalLM |
--accelerator-type TYPE | Accelerator type (required) - e.g. nvidia-h100 |
--accelerator-count N | Accelerator count (default: 0) |
--org-id UUID | Org UUID (optional if inferred from PG token) |
--cloud-account UUID | Cloud account UUID |
--source-secret UUID | Secret UUID for source access |
--lora-secret UUID | LoRA secret UUID |
--description TEXT | Description |
--short-description TEXT | Short description |
--dropdown-description TEXT | Dropdown description |
--machine-type TYPE | Machine type |
--region REGION | Region |
--resource-group GROUP | Resource group |
--processing-mode MODE | One of: SYNC, ASYNC, REALTIME_ASYNC |
--model-family FAMILY | Model family |
--env KEY=VALUE | Environment variable (repeatable) |
--model-config JSON/@file | Model config |
--optimisation-config JSON/@file | Optimisation config |
--pipeline-config JSON/@file | Pipeline config |
--tasks JSON/@file | Tasks list |
--tags JSON/@file | Tags object |
--output-metadata JSON/@file | Output metadata |
--additional-details JSON/@file | Additional details |
Delete Model Repo
Deletes a model repository.| Option | Description |
|---|---|
--model-id UUID | Model repo UUID (required) |