home/categories/framework-internals/letta-ai-skills-letta-benchmarks-trajectory-feedback-torch-tensor-parallelism-skill-md
framework-internalsdevelopment
torch-tensor-parallelism
Guidance for implementing tensor parallelism in PyTorch, including ColumnParallelLinear and RowParallelLinear layers. This skill should be used when implementing distributed tensor parallel operations, sharding linear layers across multiple GPUs, or simulating collective operations like all-gather and all-reduce for parallel computation.
maintainer
letta-ai
更新於 1/19/2026
星標
31
分支
5
quick start
Installation and usage
Guidance for implementing tensor parallelism in PyTorch, including ColumnParallelLinear and RowParallelLinear layers. This skill should be used when implementing distributed tensor parallel operations, sharding linear layers across multiple GPUs, or simulating collective operations like all-gather and all-reduce for parallel computation.
安裝
$ install --globalskills.sh
使用
安裝後,您可以透過在終端機執行以下指令來使用此技能:
skills use torch-tensor-parallelism