推荐[ For iPhone / iPad ] 🔥 惊艳！苹果交互式壁纸，让你的桌面动起来

skills.homescapability registry 搜索

home/categories/data-ai

domain cluster

Data & AI

Machine learning, LLMs, and data processing.

9743 个技能all categories

sorting

stars

current ordering strategy

query

all entries

refine the visible subset

data-engineering

1

field-extraction-parsing

Extract structured fields from unstructured log data using OPAL parsing functions. Covers extract_regex() for pattern matching with type casting, split() for delimited data, parse_json() for JSON logs, and JSONPath for navigating parsed structures. Use when you need to convert raw log text into queryable fields for analysis, filtering, or aggregation.

rustomax

data-ai

data-engineering

1

url-parameter-parser

Parse URLs in CSV files and extract query parameters as new columns. Use when working with CSV files containing URLs that need parameter extraction and analysis.

feed-mob

data-ai

data-engineering

1

data-migration-expert

Use this agent when reviewing PRs that touch database migrations, data backfills, or any code that transforms production data. This agent validates ID mappings against production reality, checks for swapped values, verifies rollback safety, and ensures data integrity during schema changes. Essential for any migration that involves ID mappings, column renames, or data transformations. <example>Context: The user has a PR with database migrations that involve ID mappings. user: "Review this PR that migrates from action_id to action_module_name" assistant: "I'll use the data-migration-expert agent to validate the ID mappings and migration safety" <commentary>Since the PR involves ID mappings and data migration, use the data-migration-expert to verify the mappings match production and check for swapped values.</commentary></example> <example>Context: The user has a migration that transforms enum values. user: "This migration converts status integers to string enums" assistant: "Let me have the data-migration-exper

i3ringit

data-ai

data-engineering

1

data-engineering

Master data engineering, ETL/ELT, data warehousing, SQL optimization, and analytics. Use when building data pipelines, designing data systems, or working with large datasets.

pluginagentmarketplace

data-ai

data-engineering

1

data-warehousing

Snowflake, BigQuery, Redshift, dimensional modeling, and modern data warehouse architecture

pluginagentmarketplace

data-ai

data-engineering

1

supabase-realtime

Comprehensive guide for implementing Supabase Realtime features with best practices, scalable patterns, and migration strategies. Use when building realtime features in Supabase applications including messaging, notifications, presence, live updates, collaborative features, or migrating from postgres_changes to broadcast. Covers client setup, database triggers with realtime.broadcast_changes, RLS authorization, naming conventions, and performance optimization.

antonpme

data-ai

data-engineering

1

tg-validation

Data validation patterns for the World of Darkness Django application including database constraints, model validators, and atomic transactions. Use when implementing XP/freebie spending transactions, adding database constraints to models, writing clean() validation methods, or ensuring data integrity for character stats.

charlesmsiegel

data-ai

data-engineering

1

airflow-expert

Expert-level Apache Airflow orchestration, DAGs, operators, sensors, XComs, task dependencies, and scheduling

personamanagmentlayer

data-ai

data-engineering

1

databricks-expert

Expert-level Databricks platform, Apache Spark, Delta Lake, MLflow, notebooks, and cluster management

personamanagmentlayer

data-ai

data-engineering

1

postgresql-replication

PostgreSQL streaming replication - setup, monitoring, failover

pluginagentmarketplace

data-ai

data-engineering

1

schema-patterns

Effect Schema conventions and patterns. Triggers on Schema class creation, tagged unions, enums, type guards, or test fixtures using Effect Schema.

jasonkuhrt

data-ai

data-engineering

1

polars

Fast in-memory DataFrame library for datasets that fit in RAM. Use when pandas is too slow but data still fits in memory. Lazy evaluation, parallel execution, Apache Arrow backend. Best for 1-100GB datasets, ETL pipelines, faster pandas replacement. For larger-than-RAM data use dask or vaex.

hxk622

data-ai

data-engineering

1

data-warehousing

Master dimensional data modeling including star schema design, slowly changing dimensions, fact tables, and data warehouse architecture

pluginagentmarketplace

data-ai

data-engineering

1

db-query

This skill enables querying Spanner databases through the AfterShip DSP API. It uses the go-admin-automizely-cli library to obtain authentication tokens and execute SQL queries against Spanner databases in different environments.

virgoC0der

data-ai

data-engineering

1

openspec-sync-specs

Sync delta specs from a change to main specs. Use when the user wants to update main specs with changes from a delta spec, without archiving the change.

pproenca

data-ai

data-engineering

1

db-migration

Generate Firestore data migration scripts for schema changes, field additions, and data transformations. Use when migrating data, adding fields, or restructuring collections.

JanSzewczyk

data-ai

data-engineering

1

etl-tools

Apache Airflow, dbt, Prefect, Dagster, and modern data orchestration for production data pipelines

pluginagentmarketplace

data-ai

data-engineering

1

domino-distributed-computing

Work with distributed computing frameworks in Domino including Apache Spark, Ray, and Dask clusters. Covers cluster configuration, on-demand clusters, choosing between frameworks, PySpark usage, and scaling workloads. Use when processing large datasets, parallel ML training, or running distributed compute jobs.

jvdomino

data-ai

data-engineering

1

zarr-python

Chunked N-D arrays for cloud storage. Compressed arrays, parallel I/O, S3/GCS integration, NumPy/Dask/Xarray compatible, for large-scale scientific computing pipelines.

hxk622

data-ai

data-engineering

1

replay-dead-letters

Replay failed observations after bug fixes or database issues. Shows which observations succeed and fail, helping recover from failures and validate fixes work on real data.

harehimself

data-ai

data-engineering

1

workflow-logging

工作流过程日志格式规范和写入模式。定义 JSONL 和文本两种格式的日志结构、事件类型、级别定义和写入方法。

penkzhou

data-ai

data-engineering

1

new-scanner

Use this skill ONLY when creating a new data scanner (e.g., Twitter scanner, Bloomberg scanner). Do not use for agents or strategies.

kimrejstrom

data-ai

data-engineering

1

gcp-bq-data-export

Use when exporting BigQuery data to Cloud Storage, extracting tables to CSV, JSON, Avro, or Parquet formats, or using EXPORT DATA statements. Covers bq extract command, format options, compression, and wildcard exports.

FunnelEnvy

data-ai

data-engineering

1

zrl-bundle-creator

This skill should be used when creating custom Zipline data bundles from CSV, API, or database sources. It provides deterministic patterns for ingesting OHLCV data, registering bundles, and validating data integrity for backtesting.

JeanBaissari

data-ai

Page 297 / 406