domain cluster

Data & AI

Machine learning, LLMs, and data processing.

9743 skillsall categories
sorting
stars
current ordering strategy
query
all entries
refine the visible subset
machine-learning
3.1K

kaggle-learner

This skill should be used when the user asks to "learn from Kaggle", "study Kaggle solutions", "analyze Kaggle competitions", or mentions Kaggle competition URLs. Provides access to extracted knowledge from winning Kaggle solutions across NLP, CV, time series, tabular, and multimodal domains.

Galaxy-Dawn
Galaxy-Dawn
data-ai
open
machine-learning
3.1K

debug

Diagnose training issues with Tinker — slow steps, hanging sessions, output mismatches, error messages, renderer problems, and deployment issues. Use this skill whenever a user reports that training is slow, steps take too long, sessions are hanging, model outputs differ between Tinker and external engines (vLLM, SGLang), they get a confusing error message, training quality is poor (high KL, bad outputs), or they suspect something is wrong. Also trigger when users ask "is this a Tinker issue or my issue?", "is Tinker down?", report unexpected wait times, see output quality regressions, get opaque errors, or want to profile/debug their training or deployment pipeline. This skill walks through systematic triage to determine root cause.

thinking-machines-lab
thinking-machines-lab
data-ai
open
data-engineering
3K

muapi-seedance-2

Expert Cinema Director skill for Seedance 2.0 (ByteDance) — high-fidelity video generation using technical camera grammar and multimodal references. Supports text-to-video, image-to-video, video extension, beat-matching, dialogue, and e-commerce patterns.

SamurAIGPT
SamurAIGPT
data-ai
open
data-analysis
2.7K

xlsx

Comprehensive spreadsheet creation, editing, and analysis with support for formulas, formatting, data analysis, and visualization. When Claude needs to work with spreadsheets (.xlsx, .xlsm, .csv, .tsv, etc) for: (1) Creating new spreadsheets with formulas and formatting, (2) Reading or analyzing data, (3) Modify existing spreadsheets while preserving formulas, (4) Data analysis and visualization in spreadsheets, or (5) Recalculating formulas

davepoon
davepoon
data-ai
open
data-analysis
2.7K

used-car-price-search

주요 한국 렌터카 업체를 비교한 뒤 SK렌터카 다이렉트 타고BUY inventory snapshot 으로 중고차 가격/인수가를 조회한다.

NomaDamas
NomaDamas
data-ai
open
data-engineering
2.7K

aiox-data-engineer

Database Architect & Operations Engineer (Dara). Use for database design, schema architecture, Supabase configuration, RLS policies, migrations, query optimization, data modelin...

SynkraAI
SynkraAI
data-ai
open
data-engineering
2.7K

chdb-datastore

Drop-in pandas replacement with ClickHouse performance. Use `import chdb.datastore as pd` (or `from datastore import DataStore`) and write standard pandas code — same API, 10-100x faster on large datasets. Supports 16+ data sources (MySQL, PostgreSQL, S3, MongoDB, ClickHouse, Iceberg, Delta Lake, etc.) and 10+ file formats (Parquet, CSV, JSON, Arrow, ORC, etc.) with cross-source joins. Use this skill when the user wants to analyze data with pandas-style syntax, speed up slow pandas code, query remote databases or cloud storage as DataFrames, or join data across different sources — even if they don't explicitly mention chdb or DataStore. Do NOT use for raw SQL queries, ClickHouse server administration, or non-Python languages.

chdb-io
chdb-io
data-ai
open
data-engineering
2.7K

dataset-annotation

AI-assisted dataset annotation with COCO export — bbox, SAM2, DINOv3 methods

SharpAI
SharpAI
data-ai
open
data-engineering
2.7K

annotation-data

Dataset annotation management — COCO labels, sequences, export, and Kaggle upload

SharpAI
SharpAI
data-ai
open
machine-learning
2.7K

model-training

Agent-driven YOLO fine-tuning — annotate, train, export, deploy

SharpAI
SharpAI
data-ai
open
data-analysis
2.6K

research

Researches a topic by breaking it into subtopics, gathering factual information with reasoning, and producing a structured summary with key findings and open questions. Use when the user asks to research, investigate, look up, summarize a topic, or says 'what is known about...' or 'learn about...'

open-gitagent
open-gitagent
data-ai
open
data-engineering
2.5K

slackdump

Collect the Slack conversation data from Slackdump Archive format.

rusq
rusq
data-ai
open
data-engineering
2.5K

jazz-schema-design

Design and implement collaborative data schemas using the Jazz framework. Use this skill when building or working with Jazz apps to define data structures using CoValues. This skill focuses exclusively on schema definition and data modeling logic.

garden-co
garden-co
data-ai
open
machine-learning
2.5K

skill-prd

AI-optimized PRD creation with 100-point scoring framework

nyldn
nyldn
data-ai
open
data-engineering
2.5K

yaml-pipeline-transfer

YAML 流水线转换指南,涵盖 YAML 与 Model 双向转换、PAC(Pipeline as Code)实现、模板引用、触发器配置。当用户需要解析 YAML 流水线、实现 PAC 模式、处理流水线模板或进行 YAML 语法校验时使用。

TencentBlueKing
TencentBlueKing
data-ai
open
machine-learning
2.5K

pipeline-model-architecture

BK-CI 流水线核心模型(Model)架构详解,涵盖 Pipeline/Stage/Container/Task 四层结构、模型序列化、版本管理、模型校验。当用户理解流水线数据结构、开发流水线功能、处理模型转换或进行模型扩展时使用。

TencentBlueKing
TencentBlueKing
data-ai
open
data-engineering
2.5K

23-database-sharding

数据库分片指南,涵盖分片策略设计、分片键选择、跨分片查询、数据迁移、分片路由规则。当用户设计数据库分片、选择分片键、处理跨分片查询或进行分片数据迁移时使用。

TencentBlueKing
TencentBlueKing
data-ai
open
data-engineering
2.5K

22-yaml-pipeline-transfer

YAML 流水线转换指南,涵盖 YAML 与 Model 双向转换、PAC(Pipeline as Code)实现、模板引用、触发器配置。当用户需要解析 YAML 流水线、实现 PAC 模式、处理流水线模板或进行 YAML 语法校验时使用。

TencentBlueKing
TencentBlueKing
data-ai
open
llm-ai
2.5K

43-agent-module-architecture

Agent 构建机模块架构指南(Go 语言),涵盖 Agent 启动流程、心跳机制、任务领取执行、升级更新、与 Dispatch 交互。当用户开发 Agent 功能、修改心跳逻辑、处理任务执行或实现 Agent 升级时使用。

TencentBlueKing
TencentBlueKing
data-ai
open
llm-ai
2.5K

05-go-agent-development

Go Agent 开发指南,涵盖 Agent 架构设计、心跳机制、任务执行、日志上报、升级流程、与 Dispatch 模块交互。当用户开发构建机 Agent、实现任务执行逻辑、处理 Agent 通信或进行 Go 语言开发时使用。

TencentBlueKing
TencentBlueKing
data-ai
open
llm-ai
2.5K

48-skill-writer

指导用户为 CodeBuddy 创建 Agent Skills。当用户想要创建、编写、设计新的 Skill,或需要帮助编写 SKILL.md 文件、frontmatter、skill 结构时使用。

TencentBlueKing
TencentBlueKing
data-ai
open
machine-learning
2.5K

28-pipeline-model-architecture

BK-CI 流水线核心模型(Model)架构详解,涵盖 Pipeline/Stage/Container/Task 四层结构、模型序列化、版本管理、模型校验。当用户理解流水线数据结构、开发流水线功能、处理模型转换或进行模型扩展时使用。

TencentBlueKing
TencentBlueKing
data-ai
open
machine-learning
2.5K

mf

All-in-one Module Federation skill. Use when the user asks anything about MF — concepts, configuration, runtime API, shared dependencies, type errors, runtime error code troubleshooting, slow builds, Bridge integration, or adding MF to an existing project.

module-federation
module-federation
data-ai
open
Previous
Page 72 / 406
Next