home/categories/data-engineering
category focus

Data Eng.

ETL pipelines and big data infrastructure.

1541 スキルall categories
sorting
stars
current ordering strategy
query
all entries
refine the visible subset
data-engineering
0

pm-07-conformance

Evaluate conformance of the event log to discovered models and generate deviation artefacts.

Wattysaid
Wattysaid
data-ai
open
data-engineering
0

dbf-data-analysis

This skill should be used when the user asks to "analyze DBF files", "read DBF data", "query DBF database", "convert DBF to Parquet", "analyze Thai accounting data", "explore legacy database", "run DuckDB queries on DBF", or mentions DBF, Parquet conversion, or Thai legacy accounting systems. Provides comprehensive guidance for reading, converting, and analyzing Thai legacy DBF accounting databases.

ninyawee
ninyawee
data-ai
open
data-engineering
0

historical-backfill-execution

Execute chunked historical blockchain data backfills using canonical 1-year pattern. Use when loading multi-year historical data, filling gaps in ClickHouse, or preventing OOM failures on Cloud Run. Keywords chunked_backfill.sh, BigQuery historical, gap filling, memory-safe backfill.

terrylica
terrylica
data-ai
open
data-engineering
0

dba-architect

DBA Architect Agent. 스키마 설계, 정규화, 데이터 모델링을 담당합니다.

shaul1991
shaul1991
data-ai
open
data-engineering
0

clickhouse-cloud-service-setup

Fetch ClickHouse Cloud service details from API (organization ID, service endpoints, configuration). Use when setting up new ClickHouse Cloud services, discovering endpoints, or validating service metadata for gapless-crypto-clickhouse project.

terrylica
terrylica
data-ai
open
data-engineering
0

ontology-phase-3-synthesize

Phase 3 of Ontology Builder Pipeline. Synthesizes analysis results into complete Domain Requirement Document (DRD). Use after Phase 2 analysis is complete to generate structured DRD.

a4b-corporation
a4b-corporation
data-ai
open
data-engineering
0

drift-fpdart

Flutter + fpdart環境でのDriftデータベース管理の包括的なスキル。以下の場合に使用:(1) 型安全なクエリを持つDriftデータベースのセットアップ、(2) EitherとTaskEitherを使った関数型エラーハンドリングの実装、(3) fpdartの合成を使ったリポジトリパターンの作成、(4) データベースマイグレーションとスキーマ変更の管理、(5) 関数型変換を使ったリアクティブストリームの記述、(6) 関数型ラッパーを使ったトランザクション管理の実装、(7) 例外を使わない関数型データベースエラーハンドリング、(8) 依存性注入を使ったテスト可能なデータベースレイヤーの作成、(9) Driftのクエリビルダーを使った複雑なクエリの構築、(10) リレーションシップ管理の実装、(11) JSONカラムを使った柔軟なデータ構造の保存と取得

northox5825
northox5825
data-ai
open
data-engineering
0

snowflake-semanticview

Create, alter, and validate Snowflake semantic views using Snowflake CLI (snow). Use when asked to build or troubleshoot semantic views/semantic layer definitions with CREATE/ALTER SEMANTIC VIEW, to validate semantic-view DDL against Snowflake via CLI, or to guide Snowflake CLI installation and connection setup.

Ditto190
Ditto190
data-ai
open
data-engineering
0

pm-02-ingest-profile

Ingest the event log, normalise schema, and generate an initial data profile with notebook and manifest updates.

Wattysaid
Wattysaid
data-ai
open
data-engineering
0

data-io-impl

Implement data I/O with immediate DTO conversion using ILoader/ISaver protocols. Use when: building data loaders, result savers, file handlers, model persistence, or any external I/O operations. Triggers: "loader", "saver", "I/O", "file", "parquet", "json", "csv", "persistence", "データ読込", "保存", "永続化", "読み書き". NOT for: data transformation (use processor-impl), business logic (use domain-impl).

sunbluesome
sunbluesome
data-ai
open
data-engineering
0

data-validation

Comprehensive data validation framework for testing schema compliance, data quality, and referential integrity. Validates databases, APIs, data pipelines, and file formats. Generates data quality scorecards with anomaly detection across completeness, accuracy, and consistency.

chaserbreitenbach
chaserbreitenbach
data-ai
open
data-engineering
0

athena-queries

Run AWS Athena queries against telemetry data. Use when executing SQL against telemetry-parser-db (raw Parquet from telemetry-parser-service) or telemetry_alerts (DBT-transformed tables). Also for Glue catalog exploration, partition debugging, or filtering by $path pseudo-column.

asimihsan
asimihsan
data-ai
open
data-engineering
0

data-import

车险经营数据导入技能,支持CSV/JSON解析、验证、优先级管理

alongor666
alongor666
data-ai
open
data-engineering
0

local-clickhouse

Install, configure, and validate local ClickHouse for gapless-crypto-clickhouse development and backtesting. Use when setting up local development environment, enabling offline mode, improving query performance for backtesting, or running E2E validation. Includes mise/Homebrew/apt installation, mode detection, connection validation, and E2E workflow scripts.

terrylica
terrylica
data-ai
open
data-engineering
0

fiftyone

FiftyOne dataset visualization and curation tool via Podman Quadlet. Multi-container architecture with MongoDB sidecar for dataset persistence. GPU-accelerated for ML workflows. Use when users need to configure, start, or manage FiftyOne for dataset analysis.

atrawog
atrawog
data-ai
open
data-engineering
0

data-validator

验证车险CSV数据的完整性和正确性,检查26个必需字段,验证数据类型、枚举值和业务规则。当用户提到"验证数据"、"检查数据"、"数据导入"、"CSV"时使用。

alongor666
alongor666
data-ai
open
data-engineering
0

dataform-engineering-fundamentals

Use when developing BigQuery Dataform transformations, SQLX files, source declarations, or troubleshooting pipelines - enforces TDD workflow (tests first), ALWAYS use ${ref()} never hardcoded table paths, comprehensive columns:{} documentation, safety practices (--schema-suffix dev, --dry-run), proper ref() syntax, .sqlx for new declarations, no schema config in operations/tests, and architecture patterns that prevent technical debt under time pressure

ihistand
ihistand
data-ai
open
data-engineering
0

fvtt-data-migrations

This skill should be used when moving data between storage locations, changing data structures, renaming fields, or removing deprecated data. Covers schema versioning, safe migration methods, the Foundry unset operator, and idempotent migrations.

ImproperSubset
ImproperSubset
data-ai
open
data-engineering
0

learning

Updates local AgentDB vector store with new case patterns while monitoring for bias introduction

do-ops885
do-ops885
data-ai
open
data-engineering
0

prisma-data-model

Understand the specific data model, entities, and database operations for Project SENTINEL. Use when writing database queries, creating migrations, or explaining the data architecture.

ArtisanClarinets
ArtisanClarinets
data-ai
open
data-engineering
0

neo4j-graphiti-migration

Assists with migrating data from Neo4j Memory to Graphiti temporal knowledge graphs

donbr
donbr
data-ai
open
data-engineering
0

altinity-expert-clickhouse-errors

Investigate ClickHouse query failures, exceptions, crashes, and error patterns. Use for error analysis and failure investigation.

Altinity
Altinity
data-ai
open
Previous
Page 62 / 65
Next