home/categories/machine-learning

category focus

Machine Learning

Training models and neural networks.

1987 اسکلزall categories

sorting

stars

current ordering strategy

query

all entries

refine the visible subset

machine-learning

llm-ai-coding-agent

針對模型推理、重生成和調試進行最佳化

glennfriend

data-ai

open

machine-learning

gradient-accumulation-deterministic

Implement gradient accumulation that produces bit-identical results to standard batching. Use when: comparing GA vs non-GA runs, debugging training reproducibility.

Erland366

data-ai

open

machine-learning

vision

Vision model fine-tuning with FastVisionModel. Covers Pixtral, Ministral VL training, UnslothVisionDataCollator, image+text datasets, and vision-specific LoRA configuration.

atrawog

data-ai

open

machine-learning

memory-write

Persist a decision or learning to Mother-Harness long-term memory

rcmiller01

data-ai

open

machine-learning

speckit-specify

Create or update the feature specification from a natural language feature description.

Obsidian-Owl

data-ai

open

machine-learning

experiment-logger

Log ML experiments with hyperparameters, metrics, and plots; human interprets results and plans next experiments

hmyuuu

data-ai

open

machine-learning

categorical-encoder

Эксперт categorical encoding. Используй для ML feature engineering, one-hot, target encoding и embeddings.

dengineproblem

data-ai

open

machine-learning

Use when user needs ML model deployment, production serving infrastructure, optimization strategies, and real-time inference systems. Designs and implements scalable ML systems with focus on reliability and performance.

404kidwiz

data-ai

open

machine-learning

paper-replication

深度学习论文复现的skill。可以读取pdf并解析其中的图片、公式、表格等内容，然后参考下面的prompts。触发词包括"帮我复现这篇论文"、"论文复现"、"实现这个模型"，或当用户提供深度学习论文需要转化为PyTorch代码时。

bahayonghang

data-ai

open

machine-learning

calibration

Applies decision thresholds for high-confidence inputs or enforces conservative safety margins for low-confidence cases

do-ops885

data-ai

open

machine-learning

moai-lang-python

Python best practices with modern frameworks, AI/ML integration, and performance optimization for 2025

kivo360

data-ai

open

machine-learning

ground-truth-management

Comprehensive guide to creating, managing, and maintaining ground truth datasets for AI evaluation including annotation, quality control, and versioning

AmnadTaowsoam

data-ai

open

machine-learning

train

Execute a neural network training run with mandatory monitoring and best-practice defaults. Use when user wants to train a model, start training, or run a training job.

rHedBull

data-ai

open

machine-learning

scaling-analysis

Run scaling experiments to understand model/data/compute relationships. Use when investigating scaling laws, compute-optimal training, or model size decisions.

rHedBull

data-ai

open

machine-learning

quant-resource-patterns

Follow these patterns when implementing quant domain resources like Dataset, Signal, Alpha, Portfolio, Strategy, Universe, Backtest, or MonitoringRun in OptAIC. Use for creating DB models, DTOs, services, and tests for trading-specific entities.

colingwuyu

data-ai

open

machine-learning

time-series-models

Bayesian time series models including AR, MA, ARMA, state-space models, and dynamic linear models in Stan and JAGS.

choxos

data-ai

open

machine-learning

production-api-tester

Live testing and validation of production research API for strategy optimization loops

mberto10

data-ai

open

machine-learning

feature-extraction

Extracts vector embeddings for fairness analysis using MobileNetV3 with FairDisCo disentanglement

do-ops885

data-ai

open

machine-learning

markov-regime-features

Debugging constant Markov regime features in RL observations - when HMM probabilities show uniform values instead of dynamic regime estimates

smith6jt-cop

data-ai

open

machine-learning

mlops-patterns

Follow these patterns when implementing MLOps features in OptAIC. Use for ML model definitions (5-component structure), model instances, training/inference pipelines, model registry, and monitoring. Covers signal models, macro regime models, relevance models, and signal combining/filtering models.

colingwuyu

data-ai

open

machine-learning

exps-logistic

Documentation for the logistic regression MI estimation experiment (exps_logistic)

TerryTong-Git

data-ai

open

machine-learning

network-architecture-sizing

PPO network architecture sizing for trading models. Trigger: (1) model files are unexpectedly small/large, (2) choosing hidden_dims for training, (3) balancing model capacity vs inference speed.

smith6jt-cop

data-ai

open

machine-learning

refactorscikit-learn

Refactor Scikit-learn and machine learning code to improve maintainability, reproducibility, and adherence to best practices. This skill transforms working ML code into production-ready pipelines that prevent data leakage and ensure reproducible results. It addresses preprocessing outside pipelines, missing random_state parameters, improper cross-validation, and custom transformers not following sklearn API conventions. Implements proper Pipeline and ColumnTransformer patterns, systematic hyperparameter tuning, and appropriate evaluation metrics.

SnakeO

data-ai

open

machine-learning

training-archive-gating

Mandatory training archive with model gating (APPROVED/REVIEW/DROP). Trigger when: (1) training run completes, (2) need to decide which models to deploy, (3) want historical training reference, (4) need checkpoint recommendations for overfitting.

smith6jt-cop

data-ai

open

Page 81 / 83