home/categories/machine-learning/erland366-transformers-playground-codex-skills-gradient-accumulation-deterministic-skill-md
machine-learningdata-ai
gradient-accumulation-deterministic
Implement gradient accumulation that produces bit-identical results to standard batching. Use when: comparing GA vs non-GA runs, debugging training reproducibility.
maintainer
Erland366
์
๋ฐ์ดํธ๋จ 1/15/2026
์คํ
0
ํฌํฌ
1
quick start
Installation and usage
Implement gradient accumulation that produces bit-identical results to standard batching. Use when: comparing GA vs non-GA runs, debugging training reproducibility.
์ค์น
$ install --globalskills.sh
์ฌ์ฉ๋ฒ
์ค์น ํ ํฐ๋ฏธ๋์์ ๋ค์ ๋ช ๋ น์ ์คํํ์ฌ ์ด ์คํฌ์ ์ฌ์ฉํ ์ ์์ต๋๋ค:
skills use gradient-accumulation-deterministic