home/categories/data-ai

domain cluster

Data & AI

Machine learning, LLMs, and data processing.

9743 skillsall categories

sorting

stars

current ordering strategy

query

all entries

refine the visible subset

machine-learning

mhc

Implements Manifold-Constrained Hyper-Connections (mHC) to solve residual connection issues using Doubly Stochastic Matrices.

yonesuke

data-ai

open

machine-learning

Detects unsafe training load spikes (>20-30% week-over-week) and emits safety flags. Use in nightly background jobs or when reviewing weekly training volume with conservative adjustment recommendations.

nadavyigal

data-ai

open

machine-learning

atft-training

Run and monitor ATFT-GAT-FAN training loops, hyper-parameter sweeps, and safety modes on A100 GPUs.

wer-inc

data-ai

open

machine-learning

train-debug

Interactive diagnostic workflow for training problems. Use when training is failing, loss is stuck, gradients explode, NaN occurs, or convergence is poor.

rHedBull

data-ai

open

machine-learning

llm-ai-coding-agent

針對模型推理、重生成和調試進行最佳化

glennfriend

data-ai

open

machine-learning

gradient-accumulation-deterministic

Implement gradient accumulation that produces bit-identical results to standard batching. Use when: comparing GA vs non-GA runs, debugging training reproducibility.

Erland366

data-ai

open

machine-learning

vision

Vision model fine-tuning with FastVisionModel. Covers Pixtral, Ministral VL training, UnslothVisionDataCollator, image+text datasets, and vision-specific LoRA configuration.

atrawog

data-ai

open

machine-learning

memory-write

Persist a decision or learning to Mother-Harness long-term memory

rcmiller01

data-ai

open

machine-learning

speckit-specify

Create or update the feature specification from a natural language feature description.

Obsidian-Owl

data-ai

open

machine-learning

experiment-logger

Log ML experiments with hyperparameters, metrics, and plots; human interprets results and plans next experiments

hmyuuu

data-ai

open

machine-learning

categorical-encoder

Эксперт categorical encoding. Используй для ML feature engineering, one-hot, target encoding и embeddings.

dengineproblem

data-ai

open

machine-learning

machine-learning-engineer

Use when user needs ML model deployment, production serving infrastructure, optimization strategies, and real-time inference systems. Designs and implements scalable ML systems with focus on reliability and performance.

404kidwiz

data-ai

open

machine-learning

paper-replication

深度学习论文复现的skill。可以读取pdf并解析其中的图片、公式、表格等内容，然后参考下面的prompts。触发词包括"帮我复现这篇论文"、"论文复现"、"实现这个模型"，或当用户提供深度学习论文需要转化为PyTorch代码时。

bahayonghang

data-ai

open

machine-learning

calibration

Applies decision thresholds for high-confidence inputs or enforces conservative safety margins for low-confidence cases

do-ops885

data-ai

open

machine-learning

moai-lang-python

Python best practices with modern frameworks, AI/ML integration, and performance optimization for 2025

kivo360

data-ai

open

machine-learning

ground-truth-management

Comprehensive guide to creating, managing, and maintaining ground truth datasets for AI evaluation including annotation, quality control, and versioning

AmnadTaowsoam

data-ai

open

machine-learning

train

Execute a neural network training run with mandatory monitoring and best-practice defaults. Use when user wants to train a model, start training, or run a training job.

rHedBull

data-ai

open

machine-learning

scaling-analysis

Run scaling experiments to understand model/data/compute relationships. Use when investigating scaling laws, compute-optimal training, or model size decisions.

rHedBull

data-ai

open

machine-learning

quant-resource-patterns

Follow these patterns when implementing quant domain resources like Dataset, Signal, Alpha, Portfolio, Strategy, Universe, Backtest, or MonitoringRun in OptAIC. Use for creating DB models, DTOs, services, and tests for trading-specific entities.

colingwuyu

data-ai

open

machine-learning

time-series-models

Bayesian time series models including AR, MA, ARMA, state-space models, and dynamic linear models in Stan and JAGS.

choxos

data-ai

open

machine-learning

production-api-tester

Live testing and validation of production research API for strategy optimization loops

mberto10

data-ai

open

machine-learning

feature-extraction

Extracts vector embeddings for fairness analysis using MobileNetV3 with FairDisCo disentanglement

do-ops885

data-ai

open

machine-learning

markov-regime-features

Debugging constant Markov regime features in RL observations - when HMM probabilities show uniform values instead of dynamic regime estimates

smith6jt-cop

data-ai

open

machine-learning

mlops-patterns

Follow these patterns when implementing MLOps features in OptAIC. Use for ML model definitions (5-component structure), model instances, training/inference pipelines, model registry, and monitoring. Covers signal models, macro regime models, relevance models, and signal combining/filtering models.

colingwuyu

data-ai

open

Page 404 / 406