home/categories/data-engineering
category focus

Data Eng.

ETL pipelines and big data infrastructure.

1541 skillsall categories
sorting
stars
current ordering strategy
query
all entries
refine the visible subset
data-engineering
0

adding-api-sources

Use when implementing a new data source adapter for metapyle, before writing any source code

stabilefrisur
stabilefrisur
data-ai
open
data-engineering
0

etl-pipeline-agent

Designs and implements Extract, Transform, Load pipelines for data processing

Unicorn
Unicorn
data-ai
open
data-engineering
0

numpy-datetime

Date and time handling with datetime64 and timedelta64, including business day offsets and naive time parsing. Triggers: datetime64, timedelta64, busday, time series, naive time.

cuba6112
cuba6112
data-ai
open
data-engineering
0

ef-core-configuration

Use this skill when configuring Entity Framework Core entities using the Fluent API.

michaellperry
michaellperry
data-ai
open
data-engineering
0

fluid-model

在 Windows 环境下按算例(001-264,默认001)修改控制CSV(Boundary等)、就地patch批量job-config、运行 real_predict 并汇总/对比输出。

zly7
zly7
data-ai
open
data-engineering
0

pytest-testing

Pytest testing patterns for data pipelines, Airflow DAGs, and BigQuery queries. Use when writing unit tests for DAG tasks, integration tests with BigQuery, property-based tests with Hypothesis, data contract validation with Pydantic/JSON schemas, performance tests with pytest-benchmark, or snapshot/visual regression tests for data outputs.

ilorozco11
ilorozco11
data-ai
open
data-engineering
0

masterdata-csv-validator

GLOWマスタデータCSVの検証スキル。作成したCSVファイルがDB投入可能か、server/client実装と整合性があるかをテンプレート、DBスキーマと照合してチェックします。マスタデータ、CSV検証、バリデーション、チェックで使用します。

Wonderplanet
Wonderplanet
data-ai
open
data-engineering
0

hive-middleware

How to create and use middlewares in Hive framework

paralect
paralect
data-ai
open
data-engineering
0

aurora-criteria

Aurora Criteria Pattern - Complete guide for QueryStatement usage in Aurora/NestJS. Trigger: When implementing queries, filters, searches, pagination, or complex data retrieval.

avvale
avvale
data-ai
open
data-engineering
0

pca-integrator

Constroi servidores MCP (Model Context Protocol) para conectar o Claude ao banco de dados PostgreSQL e APIs do PCA Camocim.

narcisolcf
narcisolcf
data-ai
open
data-engineering
0

use-or-subclass-existing-component

Discover, use, or subclass existing Dagster integration components (dbt, Looker, PowerBI, Fivetran, etc.). Handles configuration-file-based components (dbt, Sling) and API-based components (Fivetran, PowerBI) appropriately. Use only when an existing Dagster component exists within Dagster's integration libraries.

cnolanminich
cnolanminich
data-ai
open
data-engineering
0

cobol-migration-analyzer

Analyzes legacy COBOL programs and JCL jobs to assist with migration to modern Java applications. Extracts business logic, identifies dependencies, generates migration reports, and creates Java implementation strategies. Use when working with mainframe migration, COBOL analysis, legacy system modernization, JCL workflows, or when users mention COBOL to Java conversion, analyzing .cbl/.CBL/.cob files, working with copybooks, or planning Java service implementations from COBOL programs.

DauQuangThanh
DauQuangThanh
data-ai
open
data-engineering
0

data-validator

Validates data against specified rules including required field checks, email format validation, and numeric type verification. Use when the user needs to verify data integrity, validate form inputs, or check data against business rules.

egermano
egermano
data-ai
open
data-engineering
0

data-quality-rules

See the main Data Validation Rules skill for comprehensive coverage of data quality rule implementation.

AmnadTaowsoam
AmnadTaowsoam
data-ai
open
data-engineering
0

data-engineering

Use when "data pipelines", "ETL", "data warehousing", "data lakes", or asking about "Airflow", "Spark", "dbt", "Snowflake", "BigQuery", "data modeling"

eyadsibai
eyadsibai
data-ai
open
data-engineering
0

pm-04-clean-filter

Apply cleaning and filtering actions based on data quality decisions and generate filtered log artefacts.

Wattysaid
Wattysaid
data-ai
open
data-engineering
0

docs-reconcile

Reconcile normalized CI/PL/BL fields and output mismatch report

macho715
macho715
data-ai
open
data-engineering
0

data-validation-rules

Implementing comprehensive validation rules across database, application, and pipeline layers to ensure data integrity.

AmnadTaowsoam
AmnadTaowsoam
data-ai
open
data-engineering
0

senior-data-engineer

World-class data engineering skill for building scalable data pipelines, ETL/ELT systems, and data infrastructure. Expertise in Python, SQL, Spark, Airflow, dbt, Kafka, and modern data stack. Includes data modeling, pipeline orchestration, data quality, and DataOps. Use when designing data architectures, building data pipelines, optimizing data workflows, or implementing data governance.

LRYuChi
LRYuChi
data-ai
open
data-engineering
0

altinity-expert-clickhouse-dictionaries

Analyze ClickHouse external dictionaries including configuration, memory usage, reload status, and performance. Use for dictionary issues and load failures.

Altinity
Altinity
data-ai
open
data-engineering
0

aviation-spatial

SpatiaLite database operations for aviation data. Use when building spatial indexes, running proximity queries, importing airspace GeoJSON, or performing geometric calculations on aviation features like finding airports within radius, airspace containment, or obstacle corridor searches.

KitfoxPilot
KitfoxPilot
data-ai
open
data-engineering
0

dara-dataset-expert

Warehouse-Prozess-Analyse mit 207 Labels, 47 Prozessen, 8 Szenarien, 10 Triggern. Vollständige Expertise für DaRa Datensatz + REFA-Methodik + Validierungslogik + Szenarioerkennung. 100% faktenbasiert ohne Halluzinationen.

mpone1909
mpone1909
data-ai
open
data-engineering
0

vkc-drizzle-schema-migration

Standardize Drizzle schema/migration/seed workflow for Viet K-Connect. Use when adding or changing DB tables, especially DB-driven visa rulesets and document templates (no hardcoding).

LEE-SANG-BOK
LEE-SANG-BOK
data-ai
open
data-engineering
0

plan-replayer-testing

Expertise in adding new test cases for the TiDB plan replayer. Use when the user provides a plan replayer zip file and wants to create a new test.

hawkingrei
hawkingrei
data-ai
open
Previous
Page 59 / 65
Next