home/categories/framework-internals/letta-ai-skills-letta-benchmarks-trajectory-feedback-torch-tensor-parallelism-skill-md
framework-internalsdevelopment

torch-tensor-parallelism

Guidance for implementing tensor parallelism in PyTorch, including ColumnParallelLinear and RowParallelLinear layers. This skill should be used when implementing distributed tensor parallel operations, sharding linear layers across multiple GPUs, or simulating collective operations like all-gather and all-reduce for parallel computation.

letta-ai
maintainer
letta-ai
更新于 1/19/2026
星标
31
分支
5
quick start

Installation and usage

Guidance for implementing tensor parallelism in PyTorch, including ColumnParallelLinear and RowParallelLinear layers. This skill should be used when implementing distributed tensor parallel operations, sharding linear layers across multiple GPUs, or simulating collective operations like all-gather and all-reduce for parallel computation.

安装
$ install --globalskills.sh
使用

安装后,您可以通过在终端运行以下命令来使用此技能:

skills use torch-tensor-parallelism