load-anomaly-guard
Detects unsafe training load spikes (>20-30% week-over-week) and emits safety flags. Use in nightly background jobs or when reviewing weekly training volume with conservative adjustment recommendations.
atft-training
Run and monitor ATFT-GAT-FAN training loops, hyper-parameter sweeps, and safety modes on A100 GPUs.
train-debug
Interactive diagnostic workflow for training problems. Use when training is failing, loss is stuck, gradients explode, NaN occurs, or convergence is poor.
gradient-accumulation-deterministic
Implement gradient accumulation that produces bit-identical results to standard batching. Use when: comparing GA vs non-GA runs, debugging training reproducibility.
memory-write
Persist a decision or learning to Mother-Harness long-term memory
speckit-specify
Create or update the feature specification from a natural language feature description.
experiment-logger
Log ML experiments with hyperparameters, metrics, and plots; human interprets results and plans next experiments
categorical-encoder
Эксперт categorical encoding. Используй для ML feature engineering, one-hot, target encoding и embeddings.
machine-learning-engineer
Use when user needs ML model deployment, production serving infrastructure, optimization strategies, and real-time inference systems. Designs and implements scalable ML systems with focus on reliability and performance.
paper-replication
深度学习论文复现的skill。可以读取pdf并解析其中的图片、公式、表格等内容,然后参考下面的prompts。触发词包括"帮我复现这篇论文"、"论文复现"、"实现这个模型",或当用户提供深度学习论文需要转化为PyTorch代码时。
calibration
Applies decision thresholds for high-confidence inputs or enforces conservative safety margins for low-confidence cases
moai-lang-python
Python best practices with modern frameworks, AI/ML integration, and performance optimization for 2025
ground-truth-management
Comprehensive guide to creating, managing, and maintaining ground truth datasets for AI evaluation including annotation, quality control, and versioning
scaling-analysis
Run scaling experiments to understand model/data/compute relationships. Use when investigating scaling laws, compute-optimal training, or model size decisions.
quant-resource-patterns
Follow these patterns when implementing quant domain resources like Dataset, Signal, Alpha, Portfolio, Strategy, Universe, Backtest, or MonitoringRun in OptAIC. Use for creating DB models, DTOs, services, and tests for trading-specific entities.
time-series-models
Bayesian time series models including AR, MA, ARMA, state-space models, and dynamic linear models in Stan and JAGS.
production-api-tester
Live testing and validation of production research API for strategy optimization loops
feature-extraction
Extracts vector embeddings for fairness analysis using MobileNetV3 with FairDisCo disentanglement
markov-regime-features
Debugging constant Markov regime features in RL observations - when HMM probabilities show uniform values instead of dynamic regime estimates
mlops-patterns
Follow these patterns when implementing MLOps features in OptAIC. Use for ML model definitions (5-component structure), model instances, training/inference pipelines, model registry, and monitoring. Covers signal models, macro regime models, relevance models, and signal combining/filtering models.