home/categories/data-engineering

category focus

Data Eng.

ETL pipelines and big data infrastructure.

1541 skillsall categories

sorting

stars

current ordering strategy

query

all entries

refine the visible subset

data-engineering

185

flowerpower

Create and manage data pipelines using the FlowerPower framework with Hamilton DAGs and uv. Use when users request creating flowerpower projects, pipelines, Hamilton dataflows, or ask about flowerpower configuration, execution, or CLI commands.

majiayu000

data-ai

open

data-engineering

185

flyway-migrations

Flyway database migrations - use for schema changes, data migrations, version management, and PostgreSQL DDL

majiayu000

data-ai

open

data-engineering

185

frappe-data-migration-generator

Generate data migration scripts for Frappe. Use when migrating data from legacy systems, transforming data structures, or importing large datasets.

majiayu000

data-ai

open

data-engineering

185

fvtt-data-migrations

This skill should be used when moving data between storage locations, changing data structures, renaming fields, or removing deprecated data. Covers schema versioning, safe migration methods, the Foundry unset operator, and idempotent migrations.

majiayu000

data-ai

open

data-engineering

185

gcp-bq-data-loading

Use when loading data into BigQuery from CSV, JSON, Avro, Parquet files, Cloud Storage, or local files. Covers bq load command, source formats, schema detection, incremental loading, and handling parsing errors.

majiayu000

data-ai

open

data-engineering

185

go-docker

Docker containerization for Go applications

majiayu000

data-ai

open

data-engineering

185

go-sync-primitives

sync.WaitGroup and sync.Mutex patterns

majiayu000

data-ai

open

data-engineering

185

grey-haven-data-modeling

Design database schemas for Grey Haven multi-tenant SaaS - SQLModel models, Drizzle schema, multi-tenant isolation with tenant_id and RLS, timestamp fields, foreign keys, indexes, migrations, and relationships. Use when creating database tables.

majiayu000

data-ai

open

data-engineering

185

grey-haven-data-validation

Comprehensive data validation using Pydantic v2 with data quality monitoring and schema alignment for PlanetScale PostgreSQL. Use when implementing API validation, database schema alignment, or data quality assurance. Triggers: 'validation', 'Pydantic', 'schema', 'data quality'.

majiayu000

data-ai

open

data-engineering

185

implement-connector

Implement a Python connector that conforms to the LakeflowConnect interface for data ingestion.

majiayu000

data-ai

open

data-engineering

185

implementing-query-pagination

Implement cursor-based or offset pagination for Prisma queries. Use for datasets 100k+, APIs with page navigation, or infinite scroll/pagination mentions.

majiayu000

data-ai

open

data-engineering

185

instrumentation

Use when defining events, fields, and governance for GTM analytics pipelines.

majiayu000

data-ai

open

data-engineering

185

java-docker

Containerize Java applications - Dockerfile optimization, JVM settings, security

majiayu000

data-ai

open

data-engineering

185

julien-infra-jokers

Complete management for Jokers Hockey website - deployment, build checks, database migrations (Drizzle ORM), and PM2 process management. Use for any Jokers site operation.

majiayu000

data-ai

open

data-engineering

185

laravel-data-writer

Skill for creating and editing Spatie Laravel Data classes following Prowi conventions. Use when working with Data classes, DTOs, or data transfer objects. Enforces proper constructor-based properties, annotation-based validation, and Collection usage.

majiayu000

data-ai

open

data-engineering

185

litestream-coder

This skill guides configuring Litestream for continuous SQLite backup in Rails 8+ apps. Use when setting up production backups for SQLite databases (Solid Queue, Solid Cache, Solid Cable).

majiayu000

data-ai

open

data-engineering

185

local-first

Enforces local-first architecture principles for Breath of Now. Use this skill when working with data, state management, or sync features. Ensures IndexedDB (Dexie.js) is always the source of truth.

majiayu000

data-ai

open

data-engineering

185

lockplane

Use Lockplane for safe database schema management - define schemas in .lp.sql files, validate, and apply with shadow DB testing

majiayu000

data-ai

open

data-engineering

185

azure-kusto

Query and analyze data in Azure Data Explorer (Kusto/ADX) using KQL for log analytics, telemetry, and time series analysis. WHEN: KQL queries, Kusto database queries, Azure Data Explorer, ADX clusters, log analytics, time series data, IoT telemetry, anomaly detection.