1 pending

Run Dossier

2026-02-22_15-11-12_8073

Training run for SD 2.1 Joint

8 metrics · 7,468 train · 0 val

interrupted Feb 22, 15:11 Duration: 35h 15m Python 3.11.6 train_joint.py --config /cluster/home/drothenpiele/models/dehaze-baseline/.euler-launches/euler_launch_95220a54-8a32-405b-b07c-5638fb896894/joint_vkitti.yaml
lr 1e-5batch 1epochs 200precision fp16model stable-diffusion-2-1 seed 42wd 0.01000grad accum 4warmup 0.05000ema nograd clip 1
ID 58125873 Job sd_train_concat Part gpuhe.120h Node eu-g6-016 CPU 8 GPU 1
out/train/sd21-joint/runs/2026-02-22_15-11-12_8073
Error
Signal: SIGTERM
Metrics 8/8
gpu 4
mem 3
total/gb
used/gb
util/pct
util/pct
dehaze/loss
depth/loss
loss
lr
X-Axis
Y-Scale
Series

Loss

T
2026-02-22_15-11-12_8073 train (smoothed)
2026-02-22_15-11-12_8073 train (raw)

Dehaze Loss

T
2026-02-22_15-11-12_8073 train (smoothed)
2026-02-22_15-11-12_8073 train (raw)

Depth Loss

T
2026-02-22_15-11-12_8073 train (smoothed)
2026-02-22_15-11-12_8073 train (raw)

Gpu Mem Total Gb

T
2026-02-22_15-11-12_8073 train (smoothed)
2026-02-22_15-11-12_8073 train (raw)

Gpu Mem Used Gb

T
2026-02-22_15-11-12_8073 train (smoothed)
2026-02-22_15-11-12_8073 train (raw)

Gpu Mem Util Pct

T
2026-02-22_15-11-12_8073 train (smoothed)
2026-02-22_15-11-12_8073 train (raw)

Gpu Util Pct

T
2026-02-22_15-11-12_8073 train (smoothed)
2026-02-22_15-11-12_8073 train (raw)

Lr

T
2026-02-22_15-11-12_8073 train (smoothed)
2026-02-22_15-11-12_8073 train (raw)

No output snapshots found for this run.

Outputs are generated during training and saved to outputs/epoch_N_step_M/ directories.

epoch 9 / step 3500 Checkpoint #6
2/24/2026, 3:57:40 PM
/cluster/scratch/drothenpiele/SD21/exp_1/checkpoint-9
epoch 19 / step 7000 Checkpoint #7
2/24/2026, 3:57:40 PM
/cluster/scratch/drothenpiele/SD21/exp_1/checkpoint-19
epoch 29 / step 10500 Checkpoint #8
2/24/2026, 3:57:40 PM
/cluster/scratch/drothenpiele/SD21/exp_1/checkpoint-29
epoch 39 / step 14000 Checkpoint #9
2/24/2026, 3:57:40 PM
/cluster/scratch/drothenpiele/SD21/exp_1/checkpoint-39
epoch 49 / step 17500 Checkpoint #10
2/24/2026, 3:57:40 PM
/cluster/scratch/drothenpiele/SD21/exp_1/checkpoint-49
epoch 59 / step 21000 Checkpoint #11
2/24/2026, 3:57:40 PM
/cluster/scratch/drothenpiele/SD21/exp_1/checkpoint-59
epoch 69 / step 24500 Checkpoint #12
2/24/2026, 3:57:40 PM
/cluster/scratch/drothenpiele/SD21/exp_1/checkpoint-69
epoch 79 / step 28000 Checkpoint #13
2/24/2026, 3:57:40 PM
/cluster/scratch/drothenpiele/SD21/exp_1/checkpoint-79
epoch 89 / step 31500 Checkpoint #14
2/24/2026, 3:57:40 PM
/cluster/scratch/drothenpiele/SD21/exp_1/checkpoint-89
epoch 99 / step 35000 Checkpoint #15
2/24/2026, 3:57:40 PM
/cluster/scratch/drothenpiele/SD21/exp_1/checkpoint-99
epoch 109 / step 38500 Checkpoint #16
2/24/2026, 3:57:40 PM
/cluster/scratch/drothenpiele/SD21/exp_1/checkpoint-109
epoch 119 / step 42000 Checkpoint #17
2/24/2026, 3:57:40 PM
/cluster/scratch/drothenpiele/SD21/exp_1/checkpoint-119
epoch 129 / step 45500 Checkpoint #18
2/24/2026, 3:57:40 PM
/cluster/scratch/drothenpiele/SD21/exp_1/checkpoint-129
epoch 139 / step 49000 Checkpoint #19
2/24/2026, 3:57:40 PM
/cluster/scratch/drothenpiele/SD21/exp_1/checkpoint-139
epoch 149 / step 52500 Checkpoint #20
2/24/2026, 3:57:40 PM
/cluster/scratch/drothenpiele/SD21/exp_1/checkpoint-149
epoch 159 / step 56000 Checkpoint #21
2/24/2026, 3:57:40 PM
/cluster/scratch/drothenpiele/SD21/exp_1/checkpoint-159
epoch 169 / step 59500 Checkpoint #22
2/24/2026, 3:57:40 PM
/cluster/scratch/drothenpiele/SD21/exp_1/checkpoint-169
Producer launch exports are available. Manage launch-owned exports here for quick reference.
Open Launch

Inherited Launch Exports

These exports are published by the run's producer launch.

Published

The producer launch does not publish any exports yet.

Run-Owned Exports

Publish direct filesystem paths here.

Published

No run-owned exports are published yet.

Raw Artifacts

Run-owned exports are typically direct paths, so there are no captured artifacts to publish from here.

Euler View - ML Experiment Monitor