home/categories/data-engineering
category focus

Data Eng.

ETL pipelines and big data infrastructure.

1541 اسکلزall categories
sorting
stars
current ordering strategy
query
all entries
refine the visible subset
data-engineering
32.5K

implementing-warehouse-sources

Implement and extend PostHog Data warehouse import sources. Use when adding a new source under posthog/temporal/data_imports/sources, adding datasets/endpoints to an existing source, or adding incremental sync support, pagination, credentials validation, and source tests.

PostHog
PostHog
data-ai
open
data-engineering
32.1K

schema-markup

Design, validate, and optimize schema.org structured data for eligibility, correctness, and measurable SEO impact.

sickn33
sickn33
data-ai
open
data-engineering
32.1K

airflow-dag-patterns

Build production Apache Airflow DAGs with best practices for operators, sensors, testing, and deployment. Use when creating data pipelines, orchestrating workflows, or scheduling batch jobs.

sickn33
sickn33
data-ai
open
data-engineering
32.1K

dbt-transformation-patterns

Production-ready patterns for dbt (data build tool) including model organization, testing strategies, documentation, and incremental processing.

sickn33
sickn33
data-ai
open
data-engineering
32.1K

network-engineer

Expert network engineer specializing in modern cloud networking, security architectures, and performance optimization.

sickn33
sickn33
data-ai
open
data-engineering
32.1K

scala-pro

Master enterprise-grade Scala development with functional programming, distributed systems, and big data processing. Expert in Apache Pekko, Akka, Spark, ZIO/Cats Effect, and reactive architectures.

sickn33
sickn33
data-ai
open
data-engineering
32.1K

unity-ecs-patterns

Production patterns for Unity's Data-Oriented Technology Stack (DOTS) including Entity Component System, Job System, and Burst Compiler.

sickn33
sickn33
data-ai
open
data-engineering
32.1K

azure-ai-transcription-py

Azure AI Transcription SDK for Python. Use for real-time and batch speech-to-text transcription with timestamps and diarization.

sickn33
sickn33
data-ai
open
data-engineering
32.1K

azure-cosmos-ts

Azure Cosmos DB JavaScript/TypeScript SDK (@azure/cosmos) for data plane operations. Use for CRUD operations on documents, queries, bulk operations, and container management.

sickn33
sickn33
data-ai
open
data-engineering
32.1K

azure-eventhub-py

Azure Event Hubs SDK for Python streaming. Use for high-throughput event ingestion, producers, consumers, and checkpointing.

sickn33
sickn33
data-ai
open
data-engineering
32.1K

azure-mgmt-fabric-py

Azure Fabric Management SDK for Python. Use for managing Microsoft Fabric capacities and resources.

sickn33
sickn33
data-ai
open
data-engineering
32.1K

azure-servicebus-dotnet

Azure Service Bus SDK for .NET. Enterprise messaging with queues, topics, subscriptions, and sessions.

sickn33
sickn33
data-ai
open
data-engineering
32.1K

azure-storage-file-datalake-py

Azure Data Lake Storage Gen2 SDK for Python. Use for hierarchical file systems, big data analytics, and file/directory operations.

sickn33
sickn33
data-ai
open
data-engineering
32.1K

azure-storage-queue-py

Azure Queue Storage SDK for Python. Use for reliable message queuing, task distribution, and asynchronous processing.

sickn33
sickn33
data-ai
open
data-engineering
32.1K

azure-storage-queue-ts

Azure Queue Storage JavaScript/TypeScript SDK (@azure/storage-queue) for message queue operations. Use for sending, receiving, peeking, and deleting messages in queues.

sickn33
sickn33
data-ai
open
data-engineering
32.1K

cc-skill-clickhouse-io

ClickHouse database patterns, query optimization, analytics, and data engineering best practices for high-performance analytical workloads.

sickn33
sickn33
data-ai
open
data-engineering
32.1K

cloudflare-workers-expert

Expert in Cloudflare Workers and the Edge Computing ecosystem. Covers Wrangler, KV, D1, Durable Objects, and R2 storage.

sickn33
sickn33
data-ai
open
data-engineering
32.1K

data-engineer

Build scalable data pipelines, modern data warehouses, and real-time streaming architectures. Implements Apache Spark, dbt, Airflow, and cloud-native data platforms.

sickn33
sickn33
data-ai
open
data-engineering
32.1K

data-engineering-data-pipeline

You are a data pipeline architecture expert specializing in scalable, reliable, and cost-effective data pipelines for batch and streaming data processing.

sickn33
sickn33
data-ai
open
data-engineering
32.1K

data-quality-frameworks

Implement data quality validation with Great Expectations, dbt tests, and data contracts. Use when building data quality pipelines, implementing validation rules, or establishing data contracts.

sickn33
sickn33
data-ai
open
data-engineering
32.1K

database-admin

Expert database administrator specializing in modern cloud databases, automation, and reliability engineering.

sickn33
sickn33
data-ai
open
data-engineering
32.1K

database-architect

Expert database architect specializing in data layer design from scratch, technology selection, schema modeling, and scalable database architectures.

sickn33
sickn33
data-ai
open
Previous
Page 2 / 65
Next