cqa-06-assembly-scope
Validates assembly scope limits for nested assemblies and total includes. Use when assemblies grow too large.
ab-testing
When the user wants to plan, design, or implement an A/B test or experiment. Also use when the user mentions "A/B test," "split test," "experiment," "test this change," "variant copy," "multivariate test," or "hypothesis." For tracking implementation, see analytics-tracking.
review-tests
Assess the current coverage and patterns across all tests in the project. Use when asked to identify opportunities to improve how well the system's behaviour is confirmed by its tests.
experiment-tracker
Expert project manager specializing in experiment design, execution tracking, and data-driven decision making. Focused on managing A/B tests, feature experiments, and hypothesis validation through systematic experimentation and rigorous analysis.
experiment-review
Analyze A/B tests and experiments with statistical rigor, assess significance, perform segment analysis, and produce a clear ship/kill/extend recommendation.
monitor-experiments
Monitors all active and recently completed experiments across Amplitude projects, triages them by importance, then runs deep analysis and reporting on the most impactful ones. Use when the user asks to "check on experiments", "experiment status", "experiment review", "what experiments are running", or wants a periodic experiment health report.
analyze-experiments
Designs A/B tests with proper metrics and variants, analyzes running or completed experiments, and interprets results with statistical rigor. Use when setting up experiments, checking experiment status, analyzing results, or making ship decisions.
scan-reviewer
Post-diagnostic verification — checks that findings address the original concern, skills ran as planned, and results are internally consistent
particle-filter
Sequential Monte Carlo (particle filters) for real-time probability updating. Bootstrap filter with systematic resampling, ESS monitoring, logit-space state evolution, and credible intervals. For live event tracking like election night or real-time market monitoring.
ab-test-dashboard
Build and analyze A/B test dashboards, calculate statistical significance, and track experiment results.
scientific-validation
Scientific method for validating claims with pre-registration, power analysis, statistical rigor, and Bayesian methods. Use when testing hypotheses, running experiments, or validating claims from papers. TRIGGER when: validate, hypothesis, experiment, backtest, evidence, statistical test. DO NOT TRIGGER when: routine coding, config changes, documentation, non-experimental tasks.
irt-psychometrics-edu
Use this Skill for educational measurement with IRT: 2PL/3PL calibration, item information function, DIF detection, and test equating (Stocking-Lord).
data-version-control
Data version control with DVC covering pipeline tracking, remote storage, experiment comparison, and reproducible ML workflows for research.
librosa-audio
Music information retrieval with librosa: tempo, chroma, MFCCs, spectral features, onset detection, harmonic-percussive separation, pitch, and k-NN similarity search.
music21-score
Use this Skill for computational musicology with music21: score analysis, harmonic reduction, melodic contour, counterpoint checking, and corpus comparison.
audio-quality-check
Analyze audio recording quality - echo detection, loudness, speech intelligibility, SNR, spectral analysis. Use when the user wants to check a recording's quality, detect echo or duplication in audio files, measure speech clarity, compare original vs processed audio, diagnose why a recording sounds bad, or analyze audio tracks from Blackbox or any call recording app. Triggers on audio quality, recording analysis, echo detection, check recording, sound quality, analyze audio, speech quality, PESQ, STOI, loudness, SNR, audio diagnostics, recording sounds bad, echo in recording, audio duplication.
experimental-design
Design rigorous experiments: sample size calculation, randomization strategies, pre-registration, and AB test duration estimation.