home/categories/data-engineering
category focus

Data Eng.

ETL pipelines and big data infrastructure.

1541 skillsall categories
sorting
stars
current ordering strategy
query
all entries
refine the visible subset
data-engineering
565

data-reconciliation-exceptions

Reconciles data sources using stable identifiers (Pay Number, driving licence, driver card, and driver qualification card numbers), producing exception reports and “no silent failure” checks. Use when you need weekly matching with explicit reasons for non-joins and mismatches.

sundial-org
sundial-org
data-ai
open
data-engineering
538

value-stream-mapping

Create and analyze value stream maps with waste identification and process efficiency metrics

a5c-ai
a5c-ai
data-ai
open
data-engineering
538

container-images

Docker and OCI container image expertise for building, optimizing, and securing container images

a5c-ai
a5c-ai
data-ai
open
data-engineering
538

bpmn-generator

Generate and validate BPMN 2.0 diagrams from process descriptions

a5c-ai
a5c-ai
data-ai
open
data-engineering
538

context-engineering

Dynamic context injection, mode switching (dev/review/research), selective loading, and strategic compaction for token optimization.

a5c-ai
a5c-ai
data-ai
open
data-engineering
538

verification-suite

Plan structure validation, phase completeness checks, reference integrity verification, and artifact existence confirmation. Provides the structured verification layer ensuring GSD artifacts are well-formed and complete.

a5c-ai
a5c-ai
data-ai
open
data-engineering
538

chroma-integration

Chroma local vector database setup and operations for development and production

a5c-ai
a5c-ai
data-ai
open
data-engineering
538

haystack-pipeline

Haystack NLP pipeline configuration for document processing and QA

a5c-ai
a5c-ai
data-ai
open
data-engineering
538

milvus-integration

Milvus distributed vector database configuration for large-scale RAG applications

a5c-ai
a5c-ai
data-ai
open
data-engineering
538

rag-embedding-generation

Batch embedding generation with caching, rate limiting, and multiple provider support

a5c-ai
a5c-ai
data-ai
open
data-engineering
538

weaviate-integration

Weaviate vector database setup with GraphQL queries and hybrid search

a5c-ai
a5c-ai
data-ai
open
data-engineering
538

dp-state-designer

Assist in designing optimal DP states and transitions

a5c-ai
a5c-ai
data-ai
open
data-engineering
538

cloud-cost-estimator

Estimate cloud costs for migration targets with resource sizing and optimization recommendations

a5c-ai
a5c-ai
data-ai
open
data-engineering
538

dependency-scanner

Comprehensive dependency scanning, inventory generation, and SBOM creation for migration readiness assessment

a5c-ai
a5c-ai
data-ai
open
data-engineering
538

etl-pipeline-builder

Build and manage ETL pipelines for data migration with transformation, CDC, and monitoring

a5c-ai
a5c-ai
data-ai
open
data-engineering
538

logging-migrator

Migrate logging infrastructure with format standardization, structured logging, and aggregation setup

a5c-ai
a5c-ai
data-ai
open
data-engineering
538

schema-comparator

Compare database schemas between source and target environments for migration planning

a5c-ai
a5c-ai
data-ai
open
data-engineering
538

gas-optimization

Advanced gas optimization techniques for EVM smart contracts. Covers storage packing, memory vs calldata optimization, assembly/Yul, efficient data structures, batch operations, and benchmark-driven optimization strategies.

a5c-ai
a5c-ai
data-ai
open
data-engineering
538

airflow-dag-analyzer

Analyzes, validates, and optimizes Apache Airflow DAGs for reliability, performance, and best practices adherence.

a5c-ai
a5c-ai
data-ai
open
Previous
Page 19 / 65
Next