Skip to content

JobMonitor.get_status()

Retrieves the current status of a job, along with the associated model identifier, job metadata, and results.

The job status can be one of the following:

StatusDescription
QUEUEDA job is queued and it is waiting for execution.
COMPLETEDA job has been completed (e.g., the model has been trained).
RUNNINGA job is currently beeing executed.
CANCELEDThe job has been canceled by the user.
FAILEDThe job has failed (an error occured).

The method returns a JSON object containing the following fields:

FieldDescription
job_idA unique identifier for the job. This ID can be used to retrieve the job via the JobManager’s fetch_job method.
statusThe current state of the job (e.g., queued, completed, running, canceled, failed)
started_atThe timestamp indicating when the job was created.
finished_atThe timestamp indicating when the job finished execution.
updated_atThe timestamp of the most recent status update for the job.
experiment_nameThe experiment name that uniquely identifies the dataset used in the job, following the format dataset_name_task_type_task_name.
job_typeThe type of job being executed (train, inference or train_inference). Jobs initiated by the fit() method are of type train, jobs initiated by the predict() method are of type inference, while jobs initiated by the fit_predict() method are of type train_inference.
model_run_idA unique identifier assigned to the trained model associated with the job. For train or train_inference jobs, this is the ID of the model that was trained in the job; for inference job, it is the ID of the model used for inference.
resultA dictionary containing details such as per-epoch validation metrics, training metrics, best epoch, and related information.
export_pathsA dictionary containing file paths to the exported predictions and embeddings generated by the job. This is only relevant for inference and train_inference jobs.
source_viewsA dictionary containing the secure data views of the predictions and embeddings generated by the job. This is only relevant for inference and train_inference jobs.
# Create a job and send it to the queue
train_job = trainer.fit(dataset=dataset)
train_job.get_status()