home/categories/data-engineering
category focus

Data Eng.

ETL pipelines and big data infrastructure.

1541 스킬all categories
sorting
stars
current ordering strategy
query
all entries
refine the visible subset
data-engineering
1K

bigquery-view-generator

Generate bigquery view generator operations. Auto-activating skill for GCP Skills. Triggers on: bigquery view generator, bigquery view generator Part of the GCP Skills skill category. Use when working with bigquery view generator functionality. Trigger with phrases like "bigquery view generator", "bigquery generator", "bigquery".

jeremylongshore
jeremylongshore
data-ai
open
data-engineering
1K

pivot-table-creator

Create pivot table creator operations. Auto-activating skill for Data Analytics. Triggers on: pivot table creator, pivot table creator Part of the Data Analytics skill category. Use when working with pivot table creator functionality. Trigger with phrases like "pivot table creator", "pivot creator", "pivot".

jeremylongshore
jeremylongshore
data-ai
open
data-engineering
1K

cte-query-builder

Cte Query Builder - Auto-activating skill for Data Analytics. Triggers on: cte query builder, cte query builder Part of the Data Analytics skill category.

jeremylongshore
jeremylongshore
data-ai
open
data-engineering
1K

window-function-generator

Window Function Generator - Auto-activating skill for Data Analytics. Triggers on: window function generator, window function generator Part of the Data Analytics skill category.

jeremylongshore
jeremylongshore
data-ai
open
data-engineering
1K

validating-database-integrity

Process use when you need to ensure database integrity through comprehensive data validation. This skill validates data types, ranges, formats, referential integrity, and business rules. Trigger with phrases like "validate database data", "implement data validation rules", "enforce data integrity constraints", or "validate data formats".

jeremylongshore
jeremylongshore
data-ai
open
data-engineering
1K

schema-validator

Validate schema validator operations. Auto-activating skill for Data Pipelines. Triggers on: schema validator, schema validator Part of the Data Pipelines skill category. Use when working with schema validator functionality. Trigger with phrases like "schema validator", "schema validator", "schema".

jeremylongshore
jeremylongshore
data-ai
open
data-engineering
1K

firecrawl-data-handling

Implement FireCrawl PII handling, data retention, and GDPR/CCPA compliance patterns. Use when handling sensitive data, implementing data redaction, configuring retention policies, or ensuring compliance with privacy regulations for FireCrawl integrations. Trigger with phrases like "firecrawl data", "firecrawl PII", "firecrawl GDPR", "firecrawl data retention", "firecrawl privacy", "firecrawl CCPA".

jeremylongshore
jeremylongshore
data-ai
open
data-engineering
1K

fairdb-backup-manager

Automatically manages PostgreSQL backups with pgBackRest and Wasabi S3 storage when working with FairDB databases

jeremylongshore
jeremylongshore
data-ai
open
data-engineering
1K

archiving-databases

This skill automates database archival processes. It helps reduce primary database size by moving historical records to archive tables or cold storage solutions like S3, Azure Blob, or GCS. The plugin supports PostgreSQL and MySQL, implementing automated retention policies, compression, compliance tracking, and zero-downtime migration. Use this when the user mentions "database archival", "archive old database records", "retention policies", "cold storage", or "reduce database size." It is particularly useful for handling requests related to data lifecycle management and compliance requirements in database systems.

jeremylongshore
jeremylongshore
data-ai
open
data-engineering
1K

kafka-stream-processor

Process kafka stream processor operations. Auto-activating skill for Data Pipelines. Triggers on: kafka stream processor, kafka stream processor Part of the Data Pipelines skill category. Use when working with kafka stream processor functionality. Trigger with phrases like "kafka stream processor", "kafka processor", "kafka".

jeremylongshore
jeremylongshore
data-ai
open
data-engineering
1K

kafka-producer-consumer

Manage kafka producer consumer operations. Auto-activating skill for Backend Development. Triggers on: kafka producer consumer, kafka producer consumer Part of the Backend Development skill category. Use when working with kafka producer consumer functionality. Trigger with phrases like "kafka producer consumer", "kafka consumer", "kafka".

jeremylongshore
jeremylongshore
data-ai
open
data-engineering
1K

kafka-stream-processor

Kafka Stream Processor - Auto-activating skill for Data Pipelines. Triggers on: kafka stream processor, kafka stream processor Part of the Data Pipelines skill category.

jeremylongshore
jeremylongshore
data-ai
open
data-engineering
1K

preprocessing-data-with-automated-pipelines

This skill empowers Claude to preprocess and clean data using automated pipelines. It is designed to streamline data preparation for machine learning tasks, implementing best practices for data validation, transformation, and error handling. Claude should use this skill when the user requests data preprocessing, data cleaning, ETL tasks, or mentions the need for automated pipelines for data preparation. Trigger terms include "preprocess data", "clean data", "ETL pipeline", "data transformation", and "data validation". The skill ensures data quality and prepares it for effective analysis and model training.

jeremylongshore
jeremylongshore
data-ai
open
data-engineering
1K

pyspark-transformer

Pyspark Transformer - Auto-activating skill for Data Pipelines. Triggers on: pyspark transformer, pyspark transformer Part of the Data Pipelines skill category.

jeremylongshore
jeremylongshore
data-ai
open
data-engineering
1K

dbt-model-generator

Generate dbt model generator operations. Auto-activating skill for Data Pipelines. Triggers on: dbt model generator, dbt model generator Part of the Data Pipelines skill category. Use when working with dbt model generator functionality. Trigger with phrases like "dbt model generator", "dbt generator", "dbt".

jeremylongshore
jeremylongshore
data-ai
open
data-engineering
1K

pyspark-transformer

Transform pyspark transformer operations. Auto-activating skill for Data Pipelines. Triggers on: pyspark transformer, pyspark transformer Part of the Data Pipelines skill category. Use when working with pyspark transformer functionality. Trigger with phrases like "pyspark transformer", "pyspark transformer", "pyspark".

jeremylongshore
jeremylongshore
data-ai
open
data-engineering
1K

preprocessing-data-with-automated-pipelines

This skill empowers Claude to preprocess and clean data using automated pipelines. It is designed to streamline data preparation for machine learning tasks, implementing best practices for data validation, transformation, and error handling. Claude should use this skill when the user requests data preprocessing, data cleaning, ETL tasks, or mentions the need for automated pipelines for data preparation. Trigger terms include "preprocess data", "clean data", "ETL pipeline", "data transformation", and "data validation". The skill ensures data quality and prepares it for effective analysis and model training.

jeremylongshore
jeremylongshore
data-ai
open
data-engineering
1K

fairdb-backup-manager

Automatically manages PostgreSQL backups with pgBackRest and Wasabi S3 storage when working with FairDB databases

jeremylongshore
jeremylongshore
data-ai
open
data-engineering
1K

fairdb-backup-manager

Automatically manages PostgreSQL backups with pgBackRest and Wasabi S3 storage when working with FairDB databases Activates when you request "fairdb backup manager" functionality.

jeremylongshore
jeremylongshore
data-ai
open
data-engineering
1K

archiving-databases

This skill automates database archival processes. It helps reduce primary database size by moving historical records to archive tables or cold storage solutions like S3, Azure Blob, or GCS. The plugin supports PostgreSQL and MySQL, implementing automated retention policies, compression, compliance tracking, and zero-downtime migration. Use this when the user mentions "database archival", "archive old database records", "retention policies", "cold storage", or "reduce database size." It is particularly useful for handling requests related to data lifecycle management and compliance requirements in database systems.

jeremylongshore
jeremylongshore
data-ai
open
data-engineering
1K

compression-optimizer

Compression Optimizer - Auto-activating skill for Data Pipelines. Triggers on: compression optimizer, compression optimizer Part of the Data Pipelines skill category.

jeremylongshore
jeremylongshore
data-ai
open
data-engineering
1K

archiving-databases

This skill automates database archival processes. It helps reduce primary database size by moving historical records to archive tables or cold storage solutions like S3, Azure Blob, or GCS. The plugin supports PostgreSQL and MySQL, implementing automated retention policies, compression, compliance tracking, and zero-downtime migration. Use this when the user mentions "database archival", "archive old database records", "retention policies", "cold storage", or "reduce database size." It is particularly useful for handling requests related to data lifecycle management and compliance requirements in database systems.

jeremylongshore
jeremylongshore
data-ai
open
data-engineering
1K

schema-validator

Schema Validator - Auto-activating skill for Data Pipelines. Triggers on: schema validator, schema validator Part of the Data Pipelines skill category.

jeremylongshore
jeremylongshore
data-ai
open
data-engineering
1K

preprocessing-data-with-automated-pipelines

This skill empowers Claude to preprocess and clean data using automated pipelines. It is designed to streamline data preparation for machine learning tasks, implementing best practices for data validation, transformation, and error handling. Claude should use this skill when the user requests data preprocessing, data cleaning, ETL tasks, or mentions the need for automated pipelines for data preparation. Trigger terms include "preprocess data", "clean data", "ETL pipeline", "data transformation", and "data validation". The skill ensures data quality and prepares it for effective analysis and model training.

jeremylongshore
jeremylongshore
data-ai
open
Previous
Page 14 / 65
Next