domain cluster

Data & AI

Machine learning, LLMs, and data processing.

9743 مهارةall categories
sorting
stars
current ordering strategy
query
all entries
refine the visible subset
machine-learning
2

convex-development

Apply Convex database best practices for cost optimization, performance, security, and architecture. Use when: building Convex backends, optimizing queries, handling embeddings/vector search, reviewing Convex code, designing schemas, planning migrations, or discussing Convex architecture. Keywords: Convex, real-time database, queries, mutations, actions, indexes, pagination, vector search, embeddings, schema, migrations, ctx.auth, convex-helpers, bandwidth.

phrazzld
phrazzld
data-ai
open
data-engineering
2

duck-agent

DuckDB file discovery agent with verified absolute paths

plurigrid
plurigrid
data-ai
open
data-engineering
2

oracle

Use the @steipete/oracle CLI to bundle a prompt plus the right files and get a second-model review (API or browser) for debugging, refactors, design checks, or cross-validation.

LarsEckart
LarsEckart
data-ai
open
data-engineering
2

pulse-mcp-stream

Layer 1 Real-Time Social Stream Monitoring via MCP with DuckDB persistence

plurigrid
plurigrid
data-ai
open
data-engineeringmarketplace
2

adapter-assistant

Complete adapter lifecycle assistant for LimaCharlie. Supports External Adapters (cloud-managed), Cloud Sensors (SaaS/cloud integrations), and On-prem USP adapters. Dynamically researches adapter types from local docs and GitHub usp-adapters repo. Creates, validates, deploys, and troubleshoots adapter configurations. Handles parsing rules (Grok, regex), field mappings, credential setup, and multi-adapter configs. Use when setting up new data sources (Okta, S3, Azure Event Hub, syslog, webhook, etc.), troubleshooting ingestion issues, or managing adapter deployments.

refractionPOINT
refractionPOINT
data-ai
open
data-engineering
2

scalardb-sizing-estimator

ScalarDB Cluster および ScalarDB Analytics のアーキテクチャ、サイジング、構成を見積もるスキル。 性能要件、可用性要件、クラウド環境からScalarDB Cluster Pod数、Kubernetes構成、 バックエンドDB、API Gateway、監視システム等の全体構成を見積もる。 ScalarDB Analyticsを使用する場合はEMR/Databricksのサイジングも含む。 使用タイミング: - 「ScalarDBのサイジングを見積もりたい」「ScalarDB環境を構築したい」 - 「ScalarDB Clusterの構成を決めたい」「ScalarDBの費用を算出したい」 - 「開発/テスト/ステージング/本番環境のScalarDB構成」 - CI/CD、Blue/Green、Canary Deploymentを含む本番環境設計 - 「ScalarDB Analyticsを使いたい」「分析クエリ環境を構築したい」 - 「EMR/Databricksのサイジングを見積もりたい」 出力: Markdown形式の見積もり結果 + HTML形式のレポート 費用: USD/JPY両建て(為替レート明記)

wfukatsu
wfukatsu
data-ai
open
data-engineering
2

airflow

Airflow DAG patterns, KubernetesPodOperator, and debugging. Use on 'dag', 'airflow', 'task', 'operator', 'KPO', 'scheduler', 'XCom'.

pypeaday
pypeaday
data-ai
open
data-engineering
2

dst-data

Fetch actual data from Danmarks Statistik API and store in DuckDB. Use when user wants to download and store specific DST table data for analysis.

mikkelkrogsholm
mikkelkrogsholm
data-ai
open
data-engineering
2

spark-basics

PySpark fundamentals for distributed data processing.

timequity
timequity
data-ai
open
data-engineering
2

anonymise

Anonymise CSV files by removing personal identifying information and adding datetime stamps. Use when user wants to process a new CSV file or strip PII from data.

sofer
sofer
data-ai
open
data-engineering
2

cobol-migration-analyzer

Analyzes legacy COBOL programs and JCL jobs to assist with migration to modern Java applications. Extracts business logic, identifies dependencies, generates migration reports, and creates Java implementation strategies. Use when working with mainframe migration, COBOL analysis, legacy system modernization, JCL workflows, or when users mention COBOL to Java conversion, analyzing .cbl/.CBL/.cob files, working with copybooks, or planning Java service implementations from COBOL programs.

DauQuangThanh
DauQuangThanh
data-ai
open
data-engineering
2

build-graph

GraphDB構築エージェント - ユビキタス言語とコード解析結果からRyuGraphデータベースを構築。/build-graph [対象パス] で呼び出し。

wfukatsu
wfukatsu
data-ai
open
data-engineering
2

entropy-sequencer

Layer 5 Interaction Interleaving for Maximum Information Gain with DuckDB

plurigrid
plurigrid
data-ai
open
data-engineering
2

reduce-orchestrator

MapReduce root/orchestrator with a mandatory parallel Verify phase, narrative-first reduction, deterministic artifact lifecycle management (.rlm run/archives), and concurrency safety (per-run locks + cleanup lock). Use when coordinating many parallel map-worker tasks under optional hint_paths, then synthesizing narrative reports into a decision to iterate or finish.

hyophyop
hyophyop
data-ai
open
data-engineering
2

spring-kafka-integration

[Extends backend-developer] Kafka specialist for Spring/Reactor. Use for Kafka producers/consumers, DLT, retry mechanisms, transactional outbox, event sourcing. Covers Spring Kafka 4.x and Reactor Kafka 1.3.x. Invoke alongside backend-developer.

olehsvyrydov
olehsvyrydov
data-ai
open
data-engineering
2

koan-performance

Streaming, pagination, count strategies, bulk operations

sylin-org
sylin-org
data-ai
open
data-engineering
2

say-ducklake-xor

Parallel thread/DuckLake discovery with XOR uniqueness from gay_seed. Finds "say" or MCP usage, cross-refs with all DuckDB sources, launches bounded parallel ops.

plurigrid
plurigrid
data-ai
open
data-engineering
2

lcp-execplan

Create and maintain ExecPlans for complex work (design-to-implementation) following the repo's ExecPlan standard.

YusukeShimizu
YusukeShimizu
data-ai
open
data-engineering
2

golden-dataset-validation

Validation rules, schema checks, duplicate detection, and coverage analysis for golden dataset integrity

yonatangross
yonatangross
data-ai
open
data-engineering
2

parquet-optimization

Proactively analyzes Parquet file operations and suggests optimization improvements for compression, encoding, row group sizing, and statistics. Activates when users are reading or writing Parquet files or discussing Parquet performance.

EmilLindfors
EmilLindfors
data-ai
open
data-engineering
2

golden-dataset-management

Backup, restore, and validate golden datasets for AI/ML systems - ensuring test data integrity and preventing catastrophic data loss

yonatangross
yonatangross
data-ai
open
data-engineering
2

amp-api-awareness

Extract hidden Amp API patterns from local thread data via DuckDB analysis

plurigrid
plurigrid
data-ai
open
Previous
Page 273 / 406
Next