Skip to main content
GET
/
job
/
get
Retrieve details of a specific LLM/VLM training job
curl --request GET \
  --url https://training-suite.app.simplismart.ai/job/get/ \
  --header 'Authorization: Bearer <token>'
{
  "request_id": "<string>",
  "experiment_name": "<string>",
  "status": "QUEUED",
  "created_at": "2023-11-07T05:31:56Z",
  "updated_at": "2023-11-07T05:31:56Z",
  "started_at": "2023-11-07T05:31:56Z",
  "completed_at": "2023-11-07T05:31:56Z",
  "progress": 123,
  "dataset_config": {
    "preprocessing": {
      "lazy_tokenize": true,
      "streaming": true,
      "prompt": {
        "system": "<string>",
        "max_length": 123,
        "template": "<string>"
      }
    },
    "split": {
      "type": "random",
      "ratios": [
        123
      ]
    }
  },
  "model_details": {
    "base_model": "<string>",
    "ownership": "public",
    "source_type": "hf",
    "model_type": "llm",
    "quantization": {
      "quant_bits": 4
    }
  },
  "train_config": {
    "type": "sft",
    "torch_dtype": "float16",
    "task_type": "causal_lm",
    "train_type": "lora",
    "tuner_backend": "<string>",
    "hyperparameters": {
      "num_epochs": 123,
      "per_device_train_batch_size": 123,
      "per_device_eval_batch_size": 123,
      "gradient_checkpointing": true,
      "save_steps": 123,
      "save_total_limit": 123,
      "eval_steps": 123,
      "logging_steps": 123,
      "learning_rate": 123,
      "dataloader_num_workers": 123
    },
    "adapter_config": {
      "r": 123,
      "alpha": 123,
      "dropout": 123,
      "targets": [
        "<string>"
      ]
    },
    "distributed": {
      "type": "ddp"
    }
  },
  "dataset_details": {
    "dataset_name": "<string>",
    "dataset_path": "<string>",
    "dataset_description": "<string>",
    "dataset_type": "jsonl",
    "dataset_format": "sharegpt",
    "source_type": "s3",
    "ownership": "public",
    "secret_id": "<string>",
    "region": "<string>"
  },
  "infra_config": {
    "gpu_type": "h100",
    "gpu_count": 2,
    "infra_type": "<string>",
    "node_count": 2
  },
  "output_model_path": "<string>",
  "error_message": "<string>"
}

Authorizations

Authorization
string
header
required

JWT token for authentication

Headers

Authorization
string
required

Bearer token for authentication and authorization.

Query Parameters

org_id
string
required

Organization ID to which the training job belongs.

request_id
string
required

Unique identifier for the training job request.

Response

The requested training job details were retrieved successfully.

request_id
string

Unique identifier for the training job request

experiment_name
string

Name of the training experiment

status
enum<string>

Current status of the training job

Available options:
QUEUED,
RUNNING,
COMPLETED,
FAILED,
CANCELED
created_at
string<date-time>

Timestamp when the job was created

updated_at
string<date-time>

Timestamp when the job was last updated

started_at
string<date-time> | null

Timestamp when the job started running

completed_at
string<date-time> | null

Timestamp when the job completed

progress
number

Progress percentage of the training job (0-100)

dataset_config
object
model_details
object
train_config
object
dataset_details
object
infra_config
object
output_model_path
string | null

Path to the trained model output (when completed)

error_message
string | null

Error message if the job failed