home/categories/data-engineering
category focus

Data Eng.

ETL pipelines and big data infrastructure.

1541 skillsall categories
sorting
stars
current ordering strategy
query
all entries
refine the visible subset
data-engineering
538

event-loop

Expert skill for high-performance event-driven I/O programming and optimization

a5c-ai
a5c-ai
data-ai
open
data-engineering
538

protocol-parser

Specialized skill for binary and text protocol parsing and serialization. Design and validate protocol message formats, generate parser code from specifications, implement state machine parsing, and handle endianness and byte alignment.

a5c-ai
a5c-ai
data-ai
open
data-engineering
538

serialization

Expert skill for binary and text serialization formats, schema design, and optimization

a5c-ai
a5c-ai
data-ai
open
data-engineering
538

gatling-load-testing

Expert skill for Gatling simulation development, load test execution, and performance analysis. Write Gatling simulations in Scala DSL, configure injection profiles and feeders, define assertions, analyze HTML reports, and integrate with Gatling Enterprise.

a5c-ai
a5c-ai
data-ai
open
data-engineering
538

cucumber-bdd-testing

Cucumber/Gherkin BDD testing for behavior-driven development workflows

a5c-ai
a5c-ai
data-ai
open
data-engineering
538

key-management-orchestrator

Cryptographic key lifecycle management orchestration including generation, rotation, and destruction across key management systems

a5c-ai
a5c-ai
data-ai
open
data-engineering
538

docker-web

Docker containerization for web apps, multi-stage builds, and optimization.

a5c-ai
a5c-ai
data-ai
open
data-engineering
538

mongodb

MongoDB schema design, aggregation pipelines, indexing strategies, and performance.

a5c-ai
a5c-ai
data-ai
open
data-engineering
538

react-query

TanStack Query (React Query) patterns for server state management, caching, mutations, optimistic updates, and infinite queries.

a5c-ai
a5c-ai
data-ai
open
data-engineering
538

turborepo

Turborepo configuration, caching, and pipeline optimization.

a5c-ai
a5c-ai
data-ai
open
data-engineering
533

dump

Dump Kurtosis state for debugging and sharing. Export enclave state including service logs, configurations, and file artifacts to a local directory. Use when you need to capture state for offline analysis or to share with others for debugging.

kurtosis-tech
kurtosis-tech
data-ai
open
data-engineering
529

database-architect

Full pipeline for DB design. An agent team collaborates to perform data modeling, migration, indexing, query optimization, and security verification. Use this skill for any database design task including 'design a database', 'database modeling', 'table design', 'ERD', 'migration', 'query optimization', 'index design', 'SQL schema', 'PostgreSQL design', 'MySQL design', etc. Also supports optimization and security auditing for existing schemas. Note: actual DB server installation/operation, cloud infrastructure provisioning, and monitoring dashboard setup are outside the scope of this skill.

revfactory
revfactory
data-ai
open
data-engineering
529

query-optimization-catalog

SQL query optimization catalog. An extension skill for performance-analyst that provides index strategies (B-Tree/Hash/GIN/GiST), execution plan analysis, N+1 problem resolution, partitioning strategies, and per-pattern optimization techniques for slow queries. Use when performing DB performance analysis involving 'query optimization', 'index design', 'execution plans', 'N+1 problems', 'partitioning', 'slow queries', etc. Note: data modeling and security configuration are outside the scope of this skill.

revfactory
revfactory
data-ai
open
data-engineering
529

legacy-modernizer

A full pipeline for transforming legacy codebases into modern architectures. An agent team collaborates to perform technical debt analysis, refactoring strategy formulation, code migration, and regression testing. Use this skill for requests like 'modernize legacy code', 'create a refactoring strategy', 'code migration', 'technical debt analysis', 'legacy system upgrade', 'framework migration', 'code modernization', 'refactoring plan', and other legacy code modernization tasks. Also supports strategy formulation and migration when existing analysis reports are available. Note: actual production deployment, CI/CD pipeline execution, and infrastructure provisioning are outside the scope of this skill.

revfactory
revfactory
data-ai
open
data-engineering
529

dag-orchestration-patterns

Airflow DAG pattern, of , retry strategy, etc. , strategy etc. data pipeline orchestration guide. 'Airflow DAG', 'DAG ', 'of', 'retry strategy', 'etc.', '', 'pipeline orchestration', 'Dagster', 'Prefect' etc. pipeline scheduling this for. scheduler-engineerof DAG -ize. , data rule of monitoring dashboard this of scope .

revfactory
revfactory
data-ai
open
data-engineering
529

data-pipeline

data pipelineof count, transformation, -based, verification, monitoring inthisbefore teamthis to ·lower pipeline. 'data pipeline ', 'ETL pipeline ', 'data count automatic-ize', 'data lower pipeline', 'ELT ', ' pipeline', 'tree pipeline', 'Airflow DAG only', 'dbt model ', 'data verification ' etc. data pipeline · beforein this for. existing pipelineof verificationthis monitoringonly necessary inalso supported. , real-time tree (Flink/Spark Streaming) direct execution, infrastructure provisioning, database administrator(DBA) this of scope .

revfactory
revfactory
data-ai
open
data-engineering
529

data-quality-framework

data (accuracy, completeness, timeliness, consistency etc.)per verification rule and Great Expectations, dbt tests etc.of also for guide. 'data ', 'verification rule', 'Great Expectations', 'dbt test', 'data profiling', 'or more detection', 'data ' etc. data this for. data-quality-managerof verification -ize. , pipeline schedulingthis before architecture this of scope .

revfactory
revfactory
data-ai
open
data-engineering
529

data-analysis

A full analysis pipeline where an agent team collaborates to perform exploratory data analysis (EDA), data cleaning, statistical analysis, visualization, and report writing. Use this skill for 'analyze this data', 'do EDA', 'exploratory analysis', 'statistical analysis', 'data visualization', 'write an analysis report', 'analyze CSV', 'extract data insights', 'data cleaning', 'outlier analysis', and other data analysis tasks. Note: real-time data streaming, ML model training/deployment, and BI dashboard server construction are outside this skill's scope.

revfactory
revfactory
data-ai
open
data-engineering
529

data-migration

Full migration pipeline where an agent team collaborates to perform source analysis, schema mapping, transformation script generation, validation query design, and rollback planning. Use this skill for requests like 'data migration', 'DB migration', 'data transfer', 'schema conversion', 'database migration plan', 'ETL scripts', 'data transition', 'DB migration validation', 'system cutover', etc. Note: real-time CDC streaming setup, cloud infrastructure provisioning, and application code migration are outside the scope of this skill.

revfactory
revfactory
data-ai
open
Previous
Page 22 / 65
Next