home/categories/data-engineering
category focus

Data Eng.

ETL pipelines and big data infrastructure.

1541 مهارةall categories
sorting
stars
current ordering strategy
query
all entries
refine the visible subset
data-engineering
5.9K

openspec-sync-specs

Sync delta specs from a change to main specs. Use when the user wants to update main specs with changes from a delta spec, without archiving the change.

PBH-BTN
PBH-BTN
data-ai
open
data-engineering
5.9K

data-layer

This skill provides patterns for working with the data-layer module. Use when creating/editing files in src/data-layer/, src/lib/data/, or adding new data sources.

ethereum
ethereum
data-ai
open
data-engineering
5.7K

breakup-pr

Break up a large PR into vertical feature slices delivered incrementally via Graphite stacked PRs. All verticals share a single feature flag so the entire feature ships atomically. Use when: splitting a large PR, breaking up a diff, vertical slicing, incremental delivery, phased rollout, or when a PR is too large to review.

lightdash
lightdash
data-ai
open
data-engineering
5.6K

model-doc-sync

Keep anomalib model READMEs, docs pages, image assets, and benchmark/result references in sync

open-edge-platform
open-edge-platform
data-ai
open
data-engineering
5.4K

daft-distributed-scaling

Scale Daft workflows to distributed Ray clusters. Invoke when optimizing performance or handling large data.

Eventual-Inc
Eventual-Inc
data-ai
open
data-engineering
5.1K

geo-schema

Schema.org structured data audit and generation optimized for AI discoverability — detect, validate, and generate JSON-LD markup

zubair-trabzada
zubair-trabzada
data-ai
open
data-engineering
4.9K

incremental-audio-workflow

Step-by-step audio production with per-stem verification, timing alignment, and incremental quality gates

HKUDS
HKUDS
data-ai
open
data-engineering
4.9K

audio-track-production

End-to-end audio production workflow with stems, effects, archiving, and verification

HKUDS
HKUDS
data-ai
open
data-engineering
4.9K

incremental-excel-build

Build complex Excel files through staged, verifiable steps with intermediate CSV outputs for debugging

HKUDS
HKUDS
data-ai
open
data-engineering
4.9K

pptx-debug-workflow

Systematic debugging workflow for python-pptx presentation generation

HKUDS
HKUDS
data-ai
open
data-engineering
4.9K

resilient-research-workflow

Unified workflow that delegates failed web searches to shell_agent for resilient data gathering, then applies anchored spreadsheet proof gates for verified Excel output

HKUDS
HKUDS
data-ai
open
data-engineering
4.9K

panel-component-aggregator

Dashboard panel component pattern for aggregator/summary panels that accept pushed partial data updates via updateData(), defer rendering until first data arrives, and incrementally re-render metrics without re-fetching — ideal for panels that synthesize data already fetched by other panels.

HKUDS
HKUDS
data-ai
open
data-engineering
4.9K

virtual-list

Implement efficient virtual scrolling for rendering large lists with DOM recycling, chunk-based rendering, and performance optimizations

HKUDS
HKUDS
data-ai
open
data-engineering
4.9K

ff-oce-dashboard

Generate the OCE shift status dashboard. Triggers on: 'generate shift dashboard', 'show dashboard', 'shift status', 'status dashboard', 'what's going on', or any request for a NON-SPECIFIC overview of current OCE status (incidents, pipelines, errors).

microsoft
microsoft
data-ai
open
data-engineering
4.5K

seo-schema

Detect, validate, and generate Schema.org structured data. JSON-LD format preferred. Use when user says "schema", "structured data", "rich results", "JSON-LD", or "markup".

AgriciDaniel
AgriciDaniel
data-ai
open
data-engineering
4.2K

building-ioc-defanging-and-sharing-pipeline

Build an automated pipeline to defang indicators of compromise (URLs, IPs, domains, emails) for safe sharing and distribute them in STIX format through TAXII feeds and threat intelligence platforms.

mukul975
mukul975
data-ai
open
data-engineering
4.2K

implementing-cloud-dlp-for-data-protection

Implementing Cloud Data Loss Prevention (DLP) using Amazon Macie, Azure Information Protection, and Google Cloud DLP API to discover, classify, and protect sensitive data across cloud storage, databases, and data pipelines.

mukul975
mukul975
data-ai
open
data-engineering
4.2K

implementing-endpoint-dlp-controls

Implements endpoint Data Loss Prevention (DLP) controls to detect and prevent sensitive data exfiltration through email, USB, cloud storage, and printing. Use when deploying DLP agents, creating content inspection policies, or preventing unauthorized data movement from endpoints. Activates for requests involving DLP, data exfiltration prevention, content inspection, or sensitive data protection on endpoints.

mukul975
mukul975
data-ai
open
data-engineering
4.2K

implementing-threat-intelligence-lifecycle-management

Implement a structured threat intelligence lifecycle encompassing planning, collection, processing, analysis, dissemination, and feedback stages to produce actionable intelligence for organizational decision-making.

mukul975
mukul975
data-ai
open
data-engineering
4K

team-project-management

Use when a task requires multiple workers to collaborate, when tasks have dependencies (parallel/serial), or when you need DAG-based task orchestration within your team.

agentscope-ai
agentscope-ai
data-ai
open
data-engineering
4K

indigo-dex

Interact with decentralized exchanges on Cardano through the Indigo Protocol ecosystem.

openclaw
openclaw
data-ai
open
data-engineering
4K

indigo-ipfs

Store and retrieve data on IPFS and query collector UTXOs for the Indigo Protocol.

openclaw
openclaw
data-ai
open
data-engineering
4K

team

The Autonomous Orchestration Framework (AOF). Defining the hierarchy, collaboration logic, and reward distribution for human-agent hybrid organizations and task-force swarms.

openclaw
openclaw
data-ai
open
Previous
Page 7 / 65
Next