home/categories/data-engineering
category focus

Data Eng.

ETL pipelines and big data infrastructure.

1541 skillsall categories
sorting
stars
current ordering strategy
query
all entries
refine the visible subset
data-engineering
483

azure-table-storage

Expert knowledge for Azure Table Storage development including best practices, architecture & design patterns, limits & quotas, security, configuration, and integrations & coding patterns. Use when managing Entra ID/RBAC access, monitoring metrics/logs, tuning partitions/keys, or scripting tables via PowerShell, and other Azure Table Storage related development tasks. Not for Azure Cosmos DB (use azure-cosmos-db), Azure Blob Storage (use azure-blob-storage), Azure Queue Storage (use azure-queue-storage), Azure Files (use azure-files).

MicrosoftDocs
MicrosoftDocs
data-ai
open
data-engineering
471

bio-clinical-biostatistics-cdisc-data

Reads and prepares CDISC SDTM clinical trial data for analysis. Handles domain tables (DM, AE, EX, VS, LB), USUBJID-based joins, event-to-subject aggregation, and SUPPQUAL pivoting. Use when working with clinical trial datasets in CDISC/SDTM format or .xpt files.

GPTomics
GPTomics
data-ai
open
data-engineering
471

bio-workflows-clinical-trial-pipeline

End-to-end clinical trial analysis workflow from CDISC data loading through statistical testing to regulatory-compliant reporting. Covers data preparation, logistic regression, categorical tests, subgroup analysis, and Table 1 generation. Use when performing a complete analysis of clinical trial data.

GPTomics
GPTomics
data-ai
open
data-engineering
456

clickhouse-io

ClickHouse database patterns, query optimization, analytics, and data engineering best practices for high-performance analytical workloads.

vibeeval
vibeeval
data-ai
open
data-engineering
456

docker-ops

Dockerfile best practices, multi-stage builds, docker-compose, container networking, volume management, and image optimization.

vibeeval
vibeeval
data-ai
open
data-engineering
456

harvest-structured

Structured data extraction - tables, pricing, products, API endpoints with schema

vibeeval
vibeeval
data-ai
open
data-engineering
456

morph-search

Fast codebase search via WarpGrep (20x faster than grep)

vibeeval
vibeeval
data-ai
open
data-engineering
456

redis-patterns

Data structure selection, pub/sub patterns, Lua scripting, pipelining, and cluster topology strategies.

vibeeval
vibeeval
data-ai
open
data-engineering
456

tdd-migrate

TDD workflow for migrations - orchestrate agents, zero main context growth

vibeeval
vibeeval
data-ai
open
data-engineering
456

tdd-migration-pipeline

Orchestrator-only workflow for migrating/rewriting codebases with full TDD and agent delegation

vibeeval
vibeeval
data-ai
open
data-engineering
456

terraform-patterns

Module composition, state management, workspace strategy, provider versioning, and infrastructure-as-code best practices.

vibeeval
vibeeval
data-ai
open
data-engineering
453

academic-delivery

8-step pipeline for academic deliverables — essays, reports, analysis, capstones. Encodes the full intake-to-delivery workflow with automatic red-team triggering.

winstonkoh87
winstonkoh87
data-ai
open
data-engineering
451

create-dsc-resource

Create a complete, accurate DSC resource in this repository following the provided guidelines and design patterns.

PowerShell
PowerShell
data-ai
open
data-engineering
448

gum-tool-save-classes

Reference guide for Gum's save/load data model. Load this when working with GumProjectSave, ScreenSave, ComponentSave, StandardElementSave, ElementSave, StateSave, VariableSave, InstanceSave, BehaviorSave, or any serialization/deserialization of Gum project files.

vchelaru
vchelaru
data-ai
open
data-engineering
448

gum-tool-variable-grid

Reference guide for Gum's Variables tab and DataUiGrid system. Load this when working on the Variables tab, DataUiGrid control, MemberCategory, InstanceMember, category population, property grid refresh, or category expansion state persistence.

vchelaru
vchelaru
data-ai
open
data-engineering
438

skill-presentation

Presentation extraction and slide generation routing

benbrastmckie
benbrastmckie
data-ai
open
data-engineering
438

skill-presentation

Presentation extraction and slide generation routing

benbrastmckie
benbrastmckie
data-ai
open
data-engineering
432

comp-sheet

Build an industry comp sheet Excel model with deep operational KPIs

daloopa
daloopa
data-ai
open
data-engineering
432

generic-max-supply

Multi-period supply chain planning model: data files, BOM structure, variable/constraint reference for the max-supply base model.

NVIDIA
NVIDIA
data-ai
open
data-engineering
429

bsl-query-expert

Query BSL semantic models with group_by, aggregate, filter, and visualizations. Use for data analysis from existing semantic tables.

boringdata
boringdata
data-ai
open
data-engineering
429

bsl-model-builder

Build BSL semantic models with dimensions, measures, joins, and YAML config. Use for creating/modifying data models.

boringdata
boringdata
data-ai
open
data-engineering
414

agent-clickhouse-io

ClickHouse database patterns, query optimization, analytics, and data engineering best practices for high-performance analytical workloads.

Dokhacgiakhoa
Dokhacgiakhoa
data-ai
open
Previous
Page 25 / 65
Next