data-catalog-enricher
Enriches data catalog entries with automated metadata
Enriches data catalog entries with automated metadata
NVIDIA Collective Communications Library integration for multi-GPU operations. Initialize NCCL communicators, execute collective operations, configure communication topologies, profile collective performance, and support RCCL for AMD compatibility.
High-performance kernel template libraries and DSLs. Generate CUTLASS GEMM configurations, implement Triton kernel definitions, configure epilogue operations, tune tile sizes and warp arrangements, and benchmark against cuBLAS.
Deep integration with NVIDIA CUDA toolkit for kernel development, compilation, and debugging. Execute nvcc compilation with optimization flags analysis, generate and validate CUDA kernel code, analyze PTX/SASS assembly output, and configure execution parameters.
Cross-vendor OpenCL runtime management and kernel development. Query platforms/devices, generate portable OpenCL C kernel code, handle vendor-specific extensions, manage contexts and command queues, compile and cache programs.
Expert skill for CUDA Graph capture and optimization for reduced launch overhead. Capture CUDA operations into graphs, instantiate and execute graph instances, update graph node parameters, profile graph vs stream execution, design graph-friendly kernel patterns, and optimize launch latency for inference.
Expert integration with NVIDIA GPU-accelerated math libraries. Configure cuBLAS tensor core operations, generate cuBLAS GEMM calls, integrate cuDNN layers, handle algorithm selection, and support mixed-precision operations.
Exploit development automation using pwntools framework
Console optimization skill for memory constraints and TCRs.
Expert skill for FPGA place and route optimization and physical implementation
Vendor IP core configuration and integration expertise for FPGA designs
Expert skill for C/C++ to RTL conversion using High-Level Synthesis tools
Expert skill for AMBA AXI protocol implementation and verification in FPGA designs
Embedded wireless protocol implementation (LoRa, Zigbee, Thread, Matter)
Over-the-air firmware update implementation expertise for embedded systems. Expert skill for delta updates, image signing, update protocols, partition management, and rollback mechanisms.
Motor control algorithms and driver implementation
Cambridge Quantum (Quantinuum) t|ket> compiler skill for platform-independent circuit optimization
Quantum error correction code implementation skill for fault-tolerant quantum computing
Google Cirq integration skill for quantum circuit design and execution on Google quantum processors
Quantum circuit optimization skill for gate reduction, depth minimization, and hardware-aware compilation
Define specifications for program synthesis using various paradigms
SSA-form transformations and optimizations for compiler development