apache-spark-optimizer
Analyzes and optimizes Apache Spark jobs for performance, cost, and resource utilization
Analyzes and optimizes Apache Spark jobs for performance, cost, and resource utilization
Implements Change Data Capture patterns for real-time data integration
Designs and optimizes Apache Kafka topics and configurations
Designs and optimizes One Big Table (OBT) patterns
Manages schema evolution and compatibility across data systems
Analyzes and optimizes SQL queries across different data warehouse platforms (Snowflake, BigQuery, Redshift, Databricks) with platform-specific recommendations.
Designs optimal windowing strategies for stream processing
Arize AI skill for production ML monitoring, embedding drift, and performance analysis.
Dataset versioning skill using DVC for tracking data changes, managing data pipelines, and ensuring reproducibility.
Data quality validation skill using Great Expectations for schema validation, expectation suites, data documentation, and automated data quality checks in ML pipelines.
Kubeflow Pipelines skill for ML workflow orchestration, component management, and Kubernetes-native ML.
Set up cross-platform file system watching with debouncing and efficient change detection
Automated dock appointment scheduling skill with inbound flow optimization and receiving efficiency management
AI-driven warehouse slotting skill to optimize product placement based on velocity, pick frequency, and operational efficiency
Discrete event simulation skill for warehouse design validation and capacity planning
Automated wave planning and pick path optimization skill to maximize warehouse throughput and order accuracy
Drum-Buffer-Rope scheduling skill for constraint-based production pacing with buffer management
Dun & Bradstreet data quality and firmographic enrichment
TAM/SAM/SOM calculation with data source integration (CB Insights, PitchBook, etc.)
Expert FEA skill for aerospace structural analysis workflows
GA4GH standards compliance skill for data sharing and interoperability
Genome in a Bottle benchmark validation skill for pipeline accuracy assessment