home/categories/machine-learning/davila7-claude-code-templates-cli-tool-components-skills-ai-research-mechanistic-interpretability-saelens-skill-md
machine-learningdata-ai

sparse-autoencoder-training

Provides guidance for training and analyzing Sparse Autoencoders (SAEs) using SAELens to decompose neural network activations into interpretable features. Use when discovering interpretable features, analyzing superposition, or studying monosemantic representations in language models.

davila7
maintainer
davila7
Updated 1/20/2026
Stars
17577
Forks
1576
quick start

Installation and usage

Provides guidance for training and analyzing Sparse Autoencoders (SAEs) using SAELens to decompose neural network activations into interpretable features. Use when discovering interpretable features, analyzing superposition, or studying monosemantic representations in language models.

Installation
$ install --globalskills.sh
Usage

Once installed, you can use this skill by running the following command in your terminal:

skills use sparse-autoencoder-training