home/categories/health-fitness/wanshuiyin-auto-claude-code-research-in-sleep-skills-skills-codex-training-check-skill-md
health-fitnessbusiness

training-check

Periodically check WandB metrics during training to catch problems early (NaN, loss divergence, idle GPUs). Avoids wasting GPU hours on broken runs. Use when training is running and you want automated health checks.

wanshuiyin
maintainer
wanshuiyin
Updated 3/25/2026
Stars
6131
Forks
556
quick start

Installation and usage

Periodically check WandB metrics during training to catch problems early (NaN, loss divergence, idle GPUs). Avoids wasting GPU hours on broken runs. Use when training is running and you want automated health checks.

Installation
$ install --globalskills.sh
Usage

Once installed, you can use this skill by running the following command in your terminal:

skills use training-check