python-data-engineering

Comprehensive Python data engineering patterns for AWS Data Lake, including PySpark, Pandas, Apache Airflow, AWS Glue, ETL pipelines, data quality, schema management, performance optimization, FastAPI services, streaming with Kafka/Kinesis, data validation with Great Expectations, testing strategies, error handling, logging, and production deployment on AWS EMR and Glue.

Ver código fuente data-engineering

maintainer

b3-competition

Actualizado 11/4/2025

Estrellas

Forks

quick start

Installation and usage

Instalación

$ install --globalskills.sh

Uso

Después de instalarlo, puedes usar este skill ejecutando el siguiente comando en tu terminal:

skills use python-data-engineering