إعلان[ For iPhone / iPad ] 🔥 خلفيات تفاعلية مذهلة من Nugget. لم ترَ شيئًا كهذا من قبل!

skills.homescapability registry البحث

home/categories/data-ai

domain cluster

Data & AI

Machine learning, LLMs, and data processing.

9743 مهارةall categories

sorting

stars

current ordering strategy

query

all entries

refine the visible subset

machine-learning

2

convex-development

Apply Convex database best practices for cost optimization, performance, security, and architecture. Use when: building Convex backends, optimizing queries, handling embeddings/vector search, reviewing Convex code, designing schemas, planning migrations, or discussing Convex architecture. Keywords: Convex, real-time database, queries, mutations, actions, indexes, pagination, vector search, embeddings, schema, migrations, ctx.auth, convex-helpers, bandwidth.

phrazzld

data-ai

data-engineering

2

duck-agent

DuckDB file discovery agent with verified absolute paths

plurigrid

data-ai

data-engineering

2

oracle

Use the @steipete/oracle CLI to bundle a prompt plus the right files and get a second-model review (API or browser) for debugging, refactors, design checks, or cross-validation.

LarsEckart

data-ai

data-engineering

2

pulse-mcp-stream

Layer 1 Real-Time Social Stream Monitoring via MCP with DuckDB persistence

plurigrid

data-ai

data-engineeringmarketplace

2

adapter-assistant

Complete adapter lifecycle assistant for LimaCharlie. Supports External Adapters (cloud-managed), Cloud Sensors (SaaS/cloud integrations), and On-prem USP adapters. Dynamically researches adapter types from local docs and GitHub usp-adapters repo. Creates, validates, deploys, and troubleshoots adapter configurations. Handles parsing rules (Grok, regex), field mappings, credential setup, and multi-adapter configs. Use when setting up new data sources (Okta, S3, Azure Event Hub, syslog, webhook, etc.), troubleshooting ingestion issues, or managing adapter deployments.

$refractionPOINT$

refractionPOINT

data-ai

data-engineering

2

scalardb-sizing-estimator

ScalarDB Cluster および ScalarDB Analytics のアーキテクチャ、サイジング、構成を見積もるスキル。性能要件、可用性要件、クラウド環境からScalarDB Cluster Pod数、Kubernetes構成、バックエンドDB、API Gateway、監視システム等の全体構成を見積もる。 ScalarDB Analyticsを使用する場合はEMR/Databricksのサイジングも含む。使用タイミング: - 「ScalarDBのサイジングを見積もりたい」「ScalarDB環境を構築したい」 - 「ScalarDB Clusterの構成を決めたい」「ScalarDBの費用を算出したい」 - 「開発/テスト/ステージング/本番環境のScalarDB構成」 - CI/CD、Blue/Green、Canary Deploymentを含む本番環境設計 - 「ScalarDB Analyticsを使いたい」「分析クエリ環境を構築したい」 - 「EMR/Databricksのサイジングを見積もりたい」出力: Markdown形式の見積もり結果 + HTML形式のレポート費用: USD/JPY両建て（為替レート明記）

wfukatsu

data-ai

data-engineering

2

airflow

Airflow DAG patterns, KubernetesPodOperator, and debugging. Use on 'dag', 'airflow', 'task', 'operator', 'KPO', 'scheduler', 'XCom'.

pypeaday

data-ai

data-engineering

2

dst-data

Fetch actual data from Danmarks Statistik API and store in DuckDB. Use when user wants to download and store specific DST table data for analysis.

mikkelkrogsholm

data-ai

data-engineering

2

spark-basics

PySpark fundamentals for distributed data processing.

timequity

data-ai

data-engineering

2

anonymise

Anonymise CSV files by removing personal identifying information and adding datetime stamps. Use when user wants to process a new CSV file or strip PII from data.

sofer

data-ai

data-engineering

2

add-dlt-data-source

Scaffold new DLT pipeline for data ingestion to MotherDuck

nf-core

data-ai

data-engineering

2

cobol-migration-analyzer

Analyzes legacy COBOL programs and JCL jobs to assist with migration to modern Java applications. Extracts business logic, identifies dependencies, generates migration reports, and creates Java implementation strategies. Use when working with mainframe migration, COBOL analysis, legacy system modernization, JCL workflows, or when users mention COBOL to Java conversion, analyzing .cbl/.CBL/.cob files, working with copybooks, or planning Java service implementations from COBOL programs.

DauQuangThanh

data-ai

data-engineering

2

build-graph

GraphDB構築エージェント - ユビキタス言語とコード解析結果からRyuGraphデータベースを構築。/build-graph [対象パス] で呼び出し。

wfukatsu

data-ai

data-engineering

2

entropy-sequencer

Layer 5 Interaction Interleaving for Maximum Information Gain with DuckDB

plurigrid

data-ai

data-engineering

2

reduce-orchestrator

MapReduce root/orchestrator with a mandatory parallel Verify phase, narrative-first reduction, deterministic artifact lifecycle management (.rlm run/archives), and concurrency safety (per-run locks + cleanup lock). Use when coordinating many parallel map-worker tasks under optional hint_paths, then synthesizing narrative reports into a decision to iterate or finish.

hyophyop

data-ai

data-engineering

2

spring-kafka-integration

[Extends backend-developer] Kafka specialist for Spring/Reactor. Use for Kafka producers/consumers, DLT, retry mechanisms, transactional outbox, event sourcing. Covers Spring Kafka 4.x and Reactor Kafka 1.3.x. Invoke alongside backend-developer.

olehsvyrydov

data-ai

data-engineering

2

koan-performance

Streaming, pagination, count strategies, bulk operations

sylin-org

data-ai

data-engineering

2

say-ducklake-xor

Parallel thread/DuckLake discovery with XOR uniqueness from gay_seed. Finds "say" or MCP usage, cross-refs with all DuckDB sources, launches bounded parallel ops.

plurigrid

data-ai

data-engineering

2

lcp-execplan

Create and maintain ExecPlans for complex work (design-to-implementation) following the repo's ExecPlan standard.

YusukeShimizu

data-ai

data-engineering

2

golden-dataset-validation

Validation rules, schema checks, duplicate detection, and coverage analysis for golden dataset integrity

yonatangross

data-ai

data-engineering

2

parquet-optimization

Proactively analyzes Parquet file operations and suggests optimization improvements for compression, encoding, row group sizing, and statistics. Activates when users are reading or writing Parquet files or discussing Parquet performance.

EmilLindfors

data-ai

data-engineering

2

golden-dataset-management

Backup, restore, and validate golden datasets for AI/ML systems - ensuring test data integrity and preventing catastrophic data loss

yonatangross

data-ai

data-engineering

2

bluesky-jetstream

Bluesky Jetstream Firehose Skill

plurigrid

data-ai

data-engineering

2

amp-api-awareness

Extract hidden Amp API patterns from local thread data via DuckDB analysis

plurigrid

data-ai

Page 273 / 406