home/categories/data-engineering
category focus

Data Eng.

ETL pipelines and big data infrastructure.

1541 skillsall categories
sorting
stars
current ordering strategy
query
all entries
refine the visible subset
data-engineering
185

flowerpower

Create and manage data pipelines using the FlowerPower framework with Hamilton DAGs and uv. Use when users request creating flowerpower projects, pipelines, Hamilton dataflows, or ask about flowerpower configuration, execution, or CLI commands.

majiayu000
majiayu000
data-ai
open
data-engineering
185

flyway-migrations

Flyway database migrations - use for schema changes, data migrations, version management, and PostgreSQL DDL

majiayu000
majiayu000
data-ai
open
data-engineering
185

frappe-data-migration-generator

Generate data migration scripts for Frappe. Use when migrating data from legacy systems, transforming data structures, or importing large datasets.

majiayu000
majiayu000
data-ai
open
data-engineering
185

fvtt-data-migrations

This skill should be used when moving data between storage locations, changing data structures, renaming fields, or removing deprecated data. Covers schema versioning, safe migration methods, the Foundry unset operator, and idempotent migrations.

majiayu000
majiayu000
data-ai
open
data-engineering
185

gcp-bq-data-loading

Use when loading data into BigQuery from CSV, JSON, Avro, Parquet files, Cloud Storage, or local files. Covers bq load command, source formats, schema detection, incremental loading, and handling parsing errors.

majiayu000
majiayu000
data-ai
open
data-engineering
185

go-docker

Docker containerization for Go applications

majiayu000
majiayu000
data-ai
open
data-engineering
185

grey-haven-data-modeling

Design database schemas for Grey Haven multi-tenant SaaS - SQLModel models, Drizzle schema, multi-tenant isolation with tenant_id and RLS, timestamp fields, foreign keys, indexes, migrations, and relationships. Use when creating database tables.

majiayu000
majiayu000
data-ai
open
data-engineering
185

grey-haven-data-validation

Comprehensive data validation using Pydantic v2 with data quality monitoring and schema alignment for PlanetScale PostgreSQL. Use when implementing API validation, database schema alignment, or data quality assurance. Triggers: 'validation', 'Pydantic', 'schema', 'data quality'.

majiayu000
majiayu000
data-ai
open
data-engineering
185

implement-connector

Implement a Python connector that conforms to the LakeflowConnect interface for data ingestion.

majiayu000
majiayu000
data-ai
open
data-engineering
185

implementing-query-pagination

Implement cursor-based or offset pagination for Prisma queries. Use for datasets 100k+, APIs with page navigation, or infinite scroll/pagination mentions.

majiayu000
majiayu000
data-ai
open
data-engineering
185

instrumentation

Use when defining events, fields, and governance for GTM analytics pipelines.

majiayu000
majiayu000
data-ai
open
data-engineering
185

java-docker

Containerize Java applications - Dockerfile optimization, JVM settings, security

majiayu000
majiayu000
data-ai
open
data-engineering
185

julien-infra-jokers

Complete management for Jokers Hockey website - deployment, build checks, database migrations (Drizzle ORM), and PM2 process management. Use for any Jokers site operation.

majiayu000
majiayu000
data-ai
open
data-engineering
185

laravel-data-writer

Skill for creating and editing Spatie Laravel Data classes following Prowi conventions. Use when working with Data classes, DTOs, or data transfer objects. Enforces proper constructor-based properties, annotation-based validation, and Collection usage.

majiayu000
majiayu000
data-ai
open
data-engineering
185

litestream-coder

This skill guides configuring Litestream for continuous SQLite backup in Rails 8+ apps. Use when setting up production backups for SQLite databases (Solid Queue, Solid Cache, Solid Cable).

majiayu000
majiayu000
data-ai
open
data-engineering
185

local-first

Enforces local-first architecture principles for Breath of Now. Use this skill when working with data, state management, or sync features. Ensures IndexedDB (Dexie.js) is always the source of truth.

majiayu000
majiayu000
data-ai
open
data-engineering
185

lockplane

Use Lockplane for safe database schema management - define schemas in .lp.sql files, validate, and apply with shadow DB testing

majiayu000
majiayu000
data-ai
open
data-engineering
185

azure-kusto

Query and analyze data in Azure Data Explorer (Kusto/ADX) using KQL for log analytics, telemetry, and time series analysis. WHEN: KQL queries, Kusto database queries, Azure Data Explorer, ADX clusters, log analytics, time series data, IoT telemetry, anomaly detection.

microsoft
microsoft
data-ai
open
data-engineering
183

aleph

/aleph - External memory workflow for large local data.

Hmbown
Hmbown
data-ai
open
data-engineering
177

tddhypershift

TDD workflow with HyperShift cluster - real-time debugging with full cluster access

kagenti
kagenti
data-ai
open
data-engineering
177

tddkind

TDD workflow with Kind cluster - fast local iteration for Kagenti development

kagenti
kagenti
data-ai
open
data-engineering
177

hypershiftcluster

Create and destroy HyperShift clusters on AWS for testing Kagenti platform. Manages ephemeral OpenShift clusters.

kagenti
kagenti
data-ai
open
data-engineering
177

hypershift

Manage HyperShift clusters on AWS for Kagenti testing. Create, destroy, debug clusters and check quotas.

kagenti
kagenti
data-ai
open
Previous
Page 39 / 65
Next