domain cluster

Data & AI

Machine learning, LLMs, and data processing.

9743 مهارةall categories
sorting
stars
current ordering strategy
query
all entries
refine the visible subset
data-analysis
0

generating-ralph-charts

Generates RALPH charts (factor-level analysis diagrams) for HAYST method test design. Use when designing combination tests, identifying test coverage gaps, or analyzing system behavior factors.

sato-dev1234
sato-dev1234
data-ai
open
data-engineering
0

data-quality-audit

Comprehensive data quality assessment against defined business rules and constraints. Use when validating data against expected schemas, checking referential integrity across tables, or auditing data pipeline outputs before production use.

nimrodfisher
nimrodfisher
data-ai
open
data-analysis
0

marimo-editor

This skill should be used when working with marimo reactive notebooks for data science and analytics. Triggers include: - Creating new marimo notebooks - Converting Jupyter notebooks to marimo - Editing existing marimo notebooks - Implementing reactive patterns and UI components - Building interactive data visualizations with marimo

dakesan
dakesan
data-ai
open
data-analysis
0

analyzing-nba-stats

Fetches and processes NBA player and team statistics. Use when the user wants to analyze basketball data for the sports picker model.

KevinGastelum
KevinGastelum
data-ai
open
data-analysis
0

data-analysis

分析结构化数据并生成统计报告和可视化建议

w2112515
w2112515
data-ai
open
data-analysis
0

matlab

MATLAB and GNU Octave numerical computing for matrix operations, data analysis, visualization, and scientific computing. Use when writing MATLAB/Octave scripts for linear algebra, signal processing, image processing, differential equations, optimization, statistics, or creating scientific visualizations. Also use when the user needs help with MATLAB syntax, functions, or wants to convert between MATLAB and Python code. Scripts can be executed with MATLAB or the open-source GNU Octave interpreter.

MAF2414
MAF2414
data-ai
open
data-engineering
0

wcdb

Use when working with wcdb

MemoryReload
MemoryReload
data-ai
open
data-engineering
0

python-data-transform

Transform, clean, and reshape data using pandas and numpy for ETL and data preprocessing. WHEN: Manipulating DataFrames, cleaning datasets, reshaping data (pivot, melt), merging/joining tables, data normalization, CSV/Excel processing. WHEN NOT: Creating Excel files with formatting (use python-xlsx), building APIs (use python-backend), statistical modeling.

LounisBou
LounisBou
data-ai
open
data-engineering
0

convex-migration

guidance on how to properly do data migrations in Convex

ianwatts22
ianwatts22
data-ai
open
data-engineering
0

lineage-and-provenance

See the main Data Lineage skill for comprehensive coverage of data lineage tracking and provenance.

AmnadTaowsoam
AmnadTaowsoam
data-ai
open
data-engineering
0

polaris-catalog

ALWAYS USE when configuring Polaris catalog, managing namespaces, or setting up credentials in floe-platform. Use IMMEDIATELY when integrating DuckDB via dbt-duckdb plugin, configuring PyIceberg REST catalog, or debugging access control issues. Provides research steps for REST API, OAuth2 authentication, and multi-engine coordination with DuckDB, dbt, and Dagster.

Obsidian-Owl
Obsidian-Owl
data-ai
open
data-engineering
0

agentdb-state-manager

Persistent state management using AgentDB (DuckDB) for workflow analytics and checkpoints. Provides read-only analytics cache synchronized from TODO_*.md files, enabling: - Complex dependency graph queries - Historical workflow metrics - Context checkpoint storage/recovery - State transition analysis Use when: Data gathering and analysis for workflow state tracking Triggers: "analyze workflow", "query state", "checkpoint", "workflow metrics"

stharrold
stharrold
data-ai
open
data-engineering
0

altinity-expert-clickhouse-replication

Diagnose ClickHouse replication health, Keeper connectivity, replica lag, and queue issues. Use for replication lag and readonly replica problems.

Altinity
Altinity
data-ai
open
data-engineering
0

dbt-patterns

Comprehensive guide to dbt (data build tool) patterns, modeling best practices, testing strategies, and production workflows for modern data transformation

AmnadTaowsoam
AmnadTaowsoam
data-ai
open
data-engineering
0

hive-scheduler

How to create scheduled jobs in Hive framework

paralect
paralect
data-ai
open
data-engineering
0

data-governance-and-quality

Data governance strategy, quality validation rules, and data dictionary management for vehicle insurance platform. Use when defining data quality standards, implementing validation rules, managing field mappings, resolving data conflicts, or establishing data governance processes. Covers data cleaning standards, quality metrics, and mapping management.

alongor666
alongor666
data-ai
open
data-engineering
0

execplans

Write and maintain self-contained ExecPlans (execution plans) that a novice can follow end-to-end; use when planning or implementing non-trivial repo changes.

leynos
leynos
data-ai
open
data-engineering
0

analyzing-objectstar

Skill for understanding, editing, analyzing, and migrating TIBCO Objectstar (Object Service Broker) code used in mainframe OTP and batch applications. Activate when user is working with Objectstar rules, asks about mainframe modernization, or legacy 4GL code involving GET, FORALL, or EXCEPTION blocks.

JohnnyVicious
JohnnyVicious
data-ai
open
data-engineering
0

executive-cdo

Executive CDO Agent. 데이터 전략, 데이터 거버넌스, AI/ML 전략을 담당합니다.

shaul1991
shaul1991
data-ai
open
data-engineering
0

test-data-generation-validation

Generate real Cassandra 5.0 test data using Docker containers, export SSTables with proper directory structure, validate parsing against sstabledump, and manage test datasets. Use when working with test data generation, dataset creation, SSTable export, validation, fixture management, or sstabledump comparison.

pmcfadin
pmcfadin
data-ai
open
data-engineering
0

hive-handler

How to create event handlers in Hive framework

paralect
paralect
data-ai
open
data-engineering
0

data-quality-checks-and-validation

Implementing comprehensive data quality checks across the data pipeline to ensure accuracy, completeness, and reliability.

AmnadTaowsoam
AmnadTaowsoam
data-ai
open
data-engineering
0

csv-validator

Validates and fixes BOM CSV files for ECIR tool compatibility. Use when users need to check CSV files before running ECIR comparisons, fix CSV formatting issues, ensure required columns exist, or diagnose why ECIR tool fails to process a CSV file.

CBoser
CBoser
data-ai
open
Previous
Page 343 / 406
Next