No jobs

Run Dossier

Training run for SD 2.1. Mari

8 metrics · 5 train · 7 val

Chart Overlays

Compare this run against its peers

0/63
Add peer runs here to layer their train and val traces onto the current metric charts.
crashed Jun 28, 12:35 Duration: 27s Python 3.11.7 train_joint.py --config /cluster/home/drothenpiele/models/stable_diffusion_mari/mari/.euler-launches/euler_launch_f2744a49-531c-4955-b798-dfd400c569b2/joint_vkitti.yaml
lr 4e-5batch 16epochs 200precision fp16model stable-diffusion-2-1 seed 42wd 0grad accum 1warmup 0.05000ema nograd clip 1
ID 4821650
mari/runs/2026-06-28_12-35-36_85a3
Error
OutOfMemoryError: CUDA out of memory. Tried to allocate 90.00 MiB. GPU 0 has a total capacity of 79.25 GiB of which 43.31 MiB is free. Process 2782667 has 43.29 GiB memory in use. Including non-PyTorc...
Metrics 8/8
sys.train 4
gpu_mem_total_gb
gpu_mem_used_gb
gpu_mem_util_pct
gpu_util_pct
Legacy 4
data/camera 3
enabled/any/source
fields/emitted
rays/emitted
train/camera/valid/fraction
X-Axis
Y-Scale
Series

Gpu Mem Total Gb

T V
2026-06-28_12-35-36_85a3 train (smoothed)
2026-06-28_12-35-36_85a3 val (smoothed)
2026-06-28_12-35-36_85a3 train (raw)
2026-06-28_12-35-36_85a3 val (raw)

Gpu Mem Used Gb

T V
2026-06-28_12-35-36_85a3 train (smoothed)
2026-06-28_12-35-36_85a3 val (smoothed)
2026-06-28_12-35-36_85a3 train (raw)
2026-06-28_12-35-36_85a3 val (raw)

Gpu Mem Util Pct

T V
2026-06-28_12-35-36_85a3 train (smoothed)
2026-06-28_12-35-36_85a3 val (smoothed)
2026-06-28_12-35-36_85a3 train (raw)
2026-06-28_12-35-36_85a3 val (raw)

Gpu Util Pct

T V
2026-06-28_12-35-36_85a3 train (smoothed)
2026-06-28_12-35-36_85a3 val (smoothed)
2026-06-28_12-35-36_85a3 train (raw)
2026-06-28_12-35-36_85a3 val (raw)

Data/Camera Enabled Any Source

V
2026-06-28_12-35-36_85a3 val (smoothed)
2026-06-28_12-35-36_85a3 val (raw)

Data/Camera Fields Emitted

V
2026-06-28_12-35-36_85a3 val (smoothed)
2026-06-28_12-35-36_85a3 val (raw)

Data/Camera Rays Emitted

V
2026-06-28_12-35-36_85a3 val (smoothed)
2026-06-28_12-35-36_85a3 val (raw)

Train/Camera Valid Fraction

T
2026-06-28_12-35-36_85a3 train (smoothed)
2026-06-28_12-35-36_85a3 train (raw)

No output snapshots found for this run.

Outputs are generated during training and saved to outputs/epoch_N_step_M/ directories.

Producer launch exports are available. Manage launch-owned exports here for quick reference.
Open Launch

Inherited Launch Exports

These exports are published by the run's producer launch.

Published

The producer launch does not publish any exports yet.

Run-Owned Exports

Publish direct filesystem paths here.

Published

No run-owned exports are published yet.

Raw Artifacts

Run-owned exports are typically direct paths, so there are no captured artifacts to publish from here.

Euler View - ML Experiment Monitor