domain cluster

Data & AI

Machine learning, LLMs, and data processing.

9743 skillsall categories
sorting
stars
current ordering strategy
query
all entries
refine the visible subset
data-engineering
185

executive-data-storytelling

Transform data into compelling executive narratives using the What/Why/Next framework from Gartner research

majiayu000
majiayu000
data-ai
open
data-engineering
185

faker

Use when writing Vague (.vague) files that need realistic test data using faker generators for names, emails, addresses, dates, and more

majiayu000
majiayu000
data-ai
open
data-engineering
185

flowerpower

Create and manage data pipelines using the FlowerPower framework with Hamilton DAGs and uv. Use when users request creating flowerpower projects, pipelines, Hamilton dataflows, or ask about flowerpower configuration, execution, or CLI commands.

majiayu000
majiayu000
data-ai
open
data-engineering
185

flyway-migrations

Flyway database migrations - use for schema changes, data migrations, version management, and PostgreSQL DDL

majiayu000
majiayu000
data-ai
open
data-engineering
185

frappe-data-migration-generator

Generate data migration scripts for Frappe. Use when migrating data from legacy systems, transforming data structures, or importing large datasets.

majiayu000
majiayu000
data-ai
open
data-engineering
185

fvtt-data-migrations

This skill should be used when moving data between storage locations, changing data structures, renaming fields, or removing deprecated data. Covers schema versioning, safe migration methods, the Foundry unset operator, and idempotent migrations.

majiayu000
majiayu000
data-ai
open
data-engineering
185

gcp-bq-data-loading

Use when loading data into BigQuery from CSV, JSON, Avro, Parquet files, Cloud Storage, or local files. Covers bq load command, source formats, schema detection, incremental loading, and handling parsing errors.

majiayu000
majiayu000
data-ai
open
data-engineering
185

go-docker

Docker containerization for Go applications

majiayu000
majiayu000
data-ai
open
data-engineering
185

grey-haven-data-modeling

Design database schemas for Grey Haven multi-tenant SaaS - SQLModel models, Drizzle schema, multi-tenant isolation with tenant_id and RLS, timestamp fields, foreign keys, indexes, migrations, and relationships. Use when creating database tables.

majiayu000
majiayu000
data-ai
open
data-engineering
185

grey-haven-data-validation

Comprehensive data validation using Pydantic v2 with data quality monitoring and schema alignment for PlanetScale PostgreSQL. Use when implementing API validation, database schema alignment, or data quality assurance. Triggers: 'validation', 'Pydantic', 'schema', 'data quality'.

majiayu000
majiayu000
data-ai
open
data-engineering
185

implement-connector

Implement a Python connector that conforms to the LakeflowConnect interface for data ingestion.

majiayu000
majiayu000
data-ai
open
data-engineering
185

implementing-query-pagination

Implement cursor-based or offset pagination for Prisma queries. Use for datasets 100k+, APIs with page navigation, or infinite scroll/pagination mentions.

majiayu000
majiayu000
data-ai
open
data-engineering
185

instrumentation

Use when defining events, fields, and governance for GTM analytics pipelines.

majiayu000
majiayu000
data-ai
open
data-engineering
185

java-docker

Containerize Java applications - Dockerfile optimization, JVM settings, security

majiayu000
majiayu000
data-ai
open
data-engineering
185

julien-infra-jokers

Complete management for Jokers Hockey website - deployment, build checks, database migrations (Drizzle ORM), and PM2 process management. Use for any Jokers site operation.

majiayu000
majiayu000
data-ai
open
data-engineering
185

laravel-data-writer

Skill for creating and editing Spatie Laravel Data classes following Prowi conventions. Use when working with Data classes, DTOs, or data transfer objects. Enforces proper constructor-based properties, annotation-based validation, and Collection usage.

majiayu000
majiayu000
data-ai
open
data-engineering
185

litestream-coder

This skill guides configuring Litestream for continuous SQLite backup in Rails 8+ apps. Use when setting up production backups for SQLite databases (Solid Queue, Solid Cache, Solid Cable).

majiayu000
majiayu000
data-ai
open
data-engineering
185

local-first

Enforces local-first architecture principles for Breath of Now. Use this skill when working with data, state management, or sync features. Ensures IndexedDB (Dexie.js) is always the source of truth.

majiayu000
majiayu000
data-ai
open
data-engineering
185

lockplane

Use Lockplane for safe database schema management - define schemas in .lp.sql files, validate, and apply with shadow DB testing

majiayu000
majiayu000
data-ai
open
data-engineering
185

azure-kusto

Query and analyze data in Azure Data Explorer (Kusto/ADX) using KQL for log analytics, telemetry, and time series analysis. WHEN: KQL queries, Kusto database queries, Azure Data Explorer, ADX clusters, log analytics, time series data, IoT telemetry, anomaly detection.

microsoft
microsoft
data-ai
open
machine-learning
185

account-aware-training

Add account state (P&L, win rate, drawdown) to RL observations + drawdown penalty in rewards. Trigger when: (1) model needs account awareness, (2) training should penalize drawdowns, (3) upgrading obs_dim 5300→5600.

majiayu000
majiayu000
data-ai
open
machine-learning
185

add-awesome-tool

This skill should be used when analyzing a link to an AI tool and adding it to the awesome-ai-tools readme with proper categorization

majiayu000
majiayu000
data-ai
open
machine-learning
185

agent-model-selection

Guidelines for selecting appropriate AI model (Sonnet vs Haiku) based on task complexity, ensuring cost efficiency while maintaining quality. Use when assigning work.

majiayu000
majiayu000
data-ai
open
Previous
Page 172 / 406
Next