home/categories/health-fitness/wanshuiyin-auto-claude-code-research-in-sleep-skills-skills-codex-training-check-skill-md
health-fitnessbusiness

training-check

Periodically check WandB metrics during training to catch problems early (NaN, loss divergence, idle GPUs). Avoids wasting GPU hours on broken runs. Use when training is running and you want automated health checks.

wanshuiyin
maintainer
wanshuiyin
更新于 3/25/2026
星标
6131
分支
556
quick start

Installation and usage

Periodically check WandB metrics during training to catch problems early (NaN, loss divergence, idle GPUs). Avoids wasting GPU hours on broken runs. Use when training is running and you want automated health checks.

安装
$ install --globalskills.sh
使用

安装后,您可以通过在终端运行以下命令来使用此技能:

skills use training-check