home/categories/machine-learning/erland366-transformers-playground-codex-skills-gradient-accumulation-deterministic-skill-md
machine-learningdata-ai

gradient-accumulation-deterministic

Implement gradient accumulation that produces bit-identical results to standard batching. Use when: comparing GA vs non-GA runs, debugging training reproducibility.

Erland366
maintainer
Erland366
更新于 1/15/2026
星标
0
分支
1
quick start

Installation and usage

Implement gradient accumulation that produces bit-identical results to standard batching. Use when: comparing GA vs non-GA runs, debugging training reproducibility.

安装
$ install --globalskills.sh
使用

安装后,您可以通过在终端运行以下命令来使用此技能:

skills use gradient-accumulation-deterministic