home/categories/machine-learning/erland366-transformers-playground-codex-skills-gradient-accumulation-deterministic-skill-md
machine-learningdata-ai

gradient-accumulation-deterministic

Implement gradient accumulation that produces bit-identical results to standard batching. Use when: comparing GA vs non-GA runs, debugging training reproducibility.

Erland366
maintainer
Erland366
更新於 1/15/2026
星標
0
分支
1
quick start

Installation and usage

Implement gradient accumulation that produces bit-identical results to standard batching. Use when: comparing GA vs non-GA runs, debugging training reproducibility.

安裝
$ install --globalskills.sh
使用

安裝後,您可以透過在終端機執行以下指令來使用此技能:

skills use gradient-accumulation-deterministic