home/categories/framework-internals/letta-ai-skills-letta-benchmarks-trajectory-feedback-torch-tensor-parallelism-skill-md
framework-internalsdevelopment

torch-tensor-parallelism

Guidance for implementing tensor parallelism in PyTorch, including ColumnParallelLinear and RowParallelLinear layers. This skill should be used when implementing distributed tensor parallel operations, sharding linear layers across multiple GPUs, or simulating collective operations like all-gather and all-reduce for parallel computation.

letta-ai
maintainer
letta-ai
Updated 1/19/2026
Stars
31
Forks
5
quick start

Installation and usage

Guidance for implementing tensor parallelism in PyTorch, including ColumnParallelLinear and RowParallelLinear layers. This skill should be used when implementing distributed tensor parallel operations, sharding linear layers across multiple GPUs, or simulating collective operations like all-gather and all-reduce for parallel computation.

Installation
$ install --globalskills.sh
Usage

Once installed, you can use this skill by running the following command in your terminal:

skills use torch-tensor-parallelism