home/categories/data-engineering
category focus

Data Eng.

ETL pipelines and big data infrastructure.

1541টি স্কিলall categories
sorting
stars
current ordering strategy
query
all entries
refine the visible subset
data-engineering
31.2K

swarm-advanced

Advanced swarm orchestration patterns for research, development, testing, and complex distributed workflows

ruvnet
ruvnet
data-ai
open
data-engineering
31.2K

v3-deep-integration

Deep agentic-flow@alpha integration implementing ADR-001. Eliminates 10,000+ duplicate lines by building claude-flow as specialized extension rather than parallel implementation.

ruvnet
ruvnet
data-ai
open
data-engineering
31.2K

agentdb-advanced-features

Master advanced AgentDB features including QUIC synchronization, multi-database management, custom distance metrics, hybrid search, and distributed systems integration. Use when building distributed AI systems, multi-agent coordination, or advanced vector search applications.

ruvnet
ruvnet
data-ai
open
data-engineering
31.2K

agentdb-performance-optimization

Optimize AgentDB performance with quantization (4-32x memory reduction), HNSW indexing (150x faster search), caching, and batch operations. Use when optimizing memory usage, improving search speed, or scaling to millions of vectors.

ruvnet
ruvnet
data-ai
open
data-engineering
31.2K

stream-chain

Stream-JSON chaining for multi-agent pipelines, data transformation, and sequential workflows

ruvnet
ruvnet
data-ai
open
data-engineering
31.2K

swarm-advanced

Advanced swarm orchestration patterns for research, development, testing, and complex distributed workflows

ruvnet
ruvnet
data-ai
open
data-engineering
31.2K

v3-deep-integration

Deep agentic-flow@alpha integration implementing ADR-001. Eliminates 10,000+ duplicate lines by building claude-flow as specialized extension rather than parallel implementation.

ruvnet
ruvnet
data-ai
open
data-engineering
31.2K

v3-memory-unification

Unify 6+ memory systems into AgentDB with HNSW indexing for 150x-12,500x search improvements. Implements ADR-006 (Unified Memory Service) and ADR-009 (Hybrid Memory Backend).

ruvnet
ruvnet
data-ai
open
data-engineering
29.3K

snowflake-semanticview

Create, alter, and validate Snowflake semantic views using Snowflake CLI (snow). Use when asked to build or troubleshoot semantic views/semantic layer definitions with CREATE/ALTER SEMANTIC VIEW, to validate semantic-view DDL against Snowflake via CLI, or to guide Snowflake CLI installation and connection setup.

github
github
data-ai
open
data-engineering
29.3K

powerbi-modeling

Power BI semantic modeling assistant for building optimized data models. Use when working with Power BI semantic models, creating measures, designing star schemas, configuring relationships, implementing RLS, or optimizing model performance. Triggers on queries about DAX calculations, table relationships, dimension/fact table design, naming conventions, model documentation, cardinality, cross-filter direction, calculation groups, and data model best practices. Always connects to the active model first using power-bi-modeling MCP tools to understand the data structure before providing guidance.

github
github
data-ai
open
data-engineering
29.3K

az-cost-optimize

Analyze Azure resources used in the app (IaC files and/or resources in a target rg) and optimize costs - creating GitHub issues for identified optimizations.

github
github
data-ai
open
data-engineering
29.3K

bigquery-pipeline-audit

Audits Python + BigQuery pipelines for cost safety, idempotency, and production readiness. Returns a structured report with exact patch locations.

github
github
data-ai
open
data-engineering
29.3K

create-specification

Create a new specification file for the solution, optimized for Generative AI consumption.

github
github
data-ai
open
data-engineering
29.3K

dataverse-python-production-code

Generate production-ready Python code using Dataverse SDK with error handling, optimization, and best practices

github
github
data-ai
open
data-engineering
29.3K

dotnet-timezone

.NET timezone handling guidance for C# applications. Use when working with TimeZoneInfo, DateTimeOffset, NodaTime, UTC conversion, daylight saving time, scheduling across timezones, cross-platform Windows/IANA timezone IDs, or when a .NET user needs the timezone for a city, address, region, or country and copy-paste-ready C# code.

github
github
data-ai
open
data-engineering
29.3K

planning-oracle-to-postgres-migration-integration-testing

Creates an integration testing plan for .NET data access artifacts during Oracle-to-PostgreSQL database migrations. Analyzes a single project to identify repositories, DAOs, and service layers that interact with the database, then produces a structured testing plan. Use when planning integration test coverage for a migrated project, identifying which data access methods need tests, or preparing for Oracle-to-PostgreSQL migration validation.

github
github
data-ai
open
data-engineering
24.5K

spacetimedb-cli

SpacetimeDB CLI reference for initializing projects, building modules, publishing databases, querying data, and managing servers

clockworklabs
clockworklabs
data-ai
open
data-engineering
24.5K

spacetimedb-concepts

Understand SpacetimeDB architecture and core concepts. Use when learning SpacetimeDB or making architectural decisions.

clockworklabs
clockworklabs
data-ai
open
data-engineering
24.5K

spacetimedb-csharp

Build C# modules and clients for SpacetimeDB. Covers server-side module development and client SDK integration.

clockworklabs
clockworklabs
data-ai
open
data-engineering
24.5K

spacetimedb-rust

Develop SpacetimeDB server modules in Rust. Use when writing reducers, tables, or module logic.

clockworklabs
clockworklabs
data-ai
open
data-engineering
20.7K

pipeline

Configurable pipeline orchestrator for sequencing stages

Yeachan-Heo
Yeachan-Heo
data-ai
open
data-engineering
18.2K

cdc

Change Data Capture - architecture, entrypoints, bytecode emission, sync engine integration, tests

tursodatabase
tursodatabase
data-ai
open
Previous
Page 4 / 65
Next