home/categories/bioinformatics/orchestra-research-ai-research-skills-04-mechanistic-interpretability-saelens-skill-md
bioinformaticsresearch
sparse-autoencoder-training
Provides guidance for training and analyzing Sparse Autoencoders (SAEs) using SAELens to decompose neural network activations into interpretable features. Use when discovering interpretable features, analyzing superposition, or studying monosemantic representations in language models.
maintainer
Orchestra-Research
Updated 12/17/2025
Stars
6563
Forks
515
quick start
Installation and usage
Provides guidance for training and analyzing Sparse Autoencoders (SAEs) using SAELens to decompose neural network activations into interpretable features. Use when discovering interpretable features, analyzing superposition, or studying monosemantic representations in language models.
Installation
$ install --globalskills.sh
Usage
Once installed, you can use this skill by running the following command in your terminal:
skills use sparse-autoencoder-training