home/categories/machine-learning/davila7-claude-code-templates-cli-tool-components-skills-ai-research-distributed-training-deepspeed-skill-md
machine-learningdata-ai
deepspeed
Expert guidance for distributed training with DeepSpeed - ZeRO optimization stages, pipeline parallelism, FP16/BF16/FP8, 1-bit Adam, sparse attention
maintainer
davila7
Updated 1/20/2026
Stars
17577
Forks
1576
quick start
Installation and usage
Expert guidance for distributed training with DeepSpeed - ZeRO optimization stages, pipeline parallelism, FP16/BF16/FP8, 1-bit Adam, sparse attention
Installation
$ install --globalskills.sh
Usage
Once installed, you can use this skill by running the following command in your terminal:
skills use deepspeed