home/categories/data-analysis
category focus

Data Analysis

Statistical analysis and data visualization.

1490 スキルall categories
sorting
stars
current ordering strategy
query
all entries
refine the visible subset
data-analysis
93

hypothesis-generation

Structured hypothesis formulation from observations. Use when you have experimental observations or data and need to formulate testable hypotheses with predictions, propose mechanisms, and design experiments to test them. Follows scientific method framework. For open-ended ideation use scientific-brainstorming; for automated LLM-driven hypothesis testing on datasets use hypogenic.

jaechang-hits
jaechang-hits
data-ai
open
data-analysis
93

seaborn-statistical-plots

Statistical visualization library built on matplotlib with native pandas DataFrame support. Automatic aggregation, confidence intervals, and grouping for distribution plots (histplot, kdeplot), categorical comparisons (boxplot, violinplot, stripplot), relational plots (scatterplot, lineplot), regression plots (regplot, lmplot), matrix plots (heatmap, clustermap), and multi-variable grids (pairplot, jointplot, FacetGrid). Use seaborn for statistical summaries with minimal code; use matplotlib for fine-grained figure control; use plotly for interactive HTML output.

jaechang-hits
jaechang-hits
data-ai
open
data-analysis
93

seaborn-statistical-visualization

Statistical visualization built on matplotlib with pandas integration. Distribution plots (histplot, kdeplot, violinplot, boxplot), relational plots (scatterplot, lineplot), categorical comparisons, regression, correlation heatmaps. Automatic aggregation and CI. For interactive plots use plotly; for low-level control use matplotlib.

jaechang-hits
jaechang-hits
data-ai
open
data-analysis
93

networkx-graph-analysis

Graph and network analysis toolkit: create, manipulate, and analyze complex networks. Four graph types (directed, undirected, multi-edge), centrality measures, shortest paths, community detection, graph generators, I/O (GraphML, GML, edge list, pandas, NumPy), visualization with matplotlib. For large-scale graphs (100K+ nodes) use igraph or graph-tool; for graph neural networks use PyG.

jaechang-hits
jaechang-hits
data-ai
open
data-analysis
93

gwas-database

NHGRI-EBI GWAS Catalog REST API for SNP-trait associations from published genome-wide association studies. Query studies, associations, variants, traits, genes, and summary statistics. Build polygenic risk score candidates, analyze variant pleiotropy, download summary statistics for Manhattan plots. No authentication required.

jaechang-hits
jaechang-hits
data-ai
open
data-analysis
93

multiqc-qc-reports

Aggregates QC outputs from 150+ bioinformatics tools into a single interactive HTML report. Scans directories for FastQC, samtools, STAR, HISAT2, Trim Galore, featureCounts, Kallisto, Salmon, Picard, and GATK logs; merges statistics across samples with interactive plots. Essential for NGS pipeline QC review. Use FastQC directly instead for single-sample initial assessment; MultiQC is for multi-sample pipeline-wide reporting.

jaechang-hits
jaechang-hits
data-ai
open
data-analysis
93

matlab-scientific-computing

MATLAB/GNU Octave numerical computing for matrix operations, linear algebra, differential equations, signal processing, optimization, statistics, and scientific visualization. Code examples in MATLAB syntax (runs on both MATLAB and Octave). For Python-based scientific computing use numpy/scipy; for statistical modeling use statsmodels.

jaechang-hits
jaechang-hits
data-ai
open
data-analysis
93

sympy-symbolic-math

Symbolic mathematics in Python: exact algebra, calculus (derivatives, integrals, limits), equation solving, symbolic matrices, differential equations, code generation (lambdify, C/Fortran). Use when exact symbolic results are needed, not numerical approximations. For numerical computing use numpy/scipy; for statistical modeling use statsmodels.

jaechang-hits
jaechang-hits
data-ai
open
data-analysis
92

data-analysis

Conduct exploratory data analysis and statistical testing with test selection guidance. Use when exploring datasets, selecting statistical tests, performing power analysis, or preparing results for publication.

ChicagoHAI
ChicagoHAI
data-ai
open
data-analysis
92

memory

Structured daily and weekly learning memory with dual graph snapshots.

MathClaw-ruc
MathClaw-ruc
data-ai
open
data-analysis
91

calculator

A simple calculator that can add, subtract, multiply, and divide numbers. Use when the user needs to perform basic arithmetic operations.

EXboys
EXboys
data-ai
open
data-analysis
91

data-analysis

Analyze CSV/JSON data with statistics, filtering, and aggregation. Powered by pandas and numpy.

EXboys
EXboys
data-ai
open
data-analysis
90

reasoning-counterfactual

Evaluate alternative scenarios by simulating interventions on past decisions or hypothetical futures. Use when assessing decisions in hindsight, planning scenarios, or comparing paths not taken. Produces comparative analysis with probability-weighted outcomes.

aiskillstore
aiskillstore
data-ai
open
data-analysis
90

d3js-visualization

Professional data visualization creation using D3.js with support for interactive charts, custom visualizations, animations, and responsive design. Use for: (1) Creating custom interactive charts, (2) Building dashboards, (3) Network/graph visualizations, (4) Geographic data mapping, (5) Time series analysis, (6) Real-time data visualization, (7) Complex multi-dimensional data displays

aiskillstore
aiskillstore
data-ai
open
data-analysis
90

working-with-spreadsheets

Creates and edits Excel spreadsheets with formulas, formatting, and financial modeling standards. Use when working with .xlsx files, financial models, data analysis, or formula-heavy spreadsheets. Covers formula recalculation, color coding standards, and common pitfalls.

aiskillstore
aiskillstore
data-ai
open
data-analysis
90

desmos-graphing

Create interactive Desmos graphs in Obsidian using desmos-graph code blocks. Use when visualizing functions, parametric curves, inequalities, or mathematical relationships with customizable styling and settings.

aiskillstore
aiskillstore
data-ai
open
data-analysis
89

narwhals

Effectively use Narwhals to write dataframe-agnostic code that works seamlessly across multiple Python dataframe libraries. Write correct type annotations for code using Narwhals.

anam-org
anam-org
data-ai
open
data-analysis
89

daily-report-generator

Automatically generate daily construction reports from field data, worker inputs, weather, and progress photos. Creates professional PDF reports.

datadrivenconstruction
datadrivenconstruction
data-ai
open
data-analysis
89

cwicr-report-generator

Generate professional cost estimation reports from CWICR calculations. HTML, PDF, Excel outputs with charts and breakdowns.

datadrivenconstruction
datadrivenconstruction
data-ai
open
data-analysis
89

ifc-qto-extraction

Extract quantities from IFC/Revit models for quantity takeoff. Uses DDC converters to get element counts, areas, volumes, lengths with grouping and reporting.

datadrivenconstruction
datadrivenconstruction
data-ai
open
data-analysis
89

labor-rate

Calculate construction labor rates with overhead, benefits, and productivity factors. Regional rate databases and crew composition.

datadrivenconstruction
datadrivenconstruction
data-ai
open
data-analysis
89

cattoken-report

Generate detailed token usage report with threshold analysis and recommendations

cowwoc
cowwoc
data-ai
open
data-analysis
89

pdf-report-generator

Automatically generate PDF reports from construction data. Create formatted project reports with charts and tables.

datadrivenconstruction
datadrivenconstruction
data-ai
open
data-analysis
89

data-visualization

Create visualizations for construction data. Generate charts, graphs, heatmaps, and interactive dashboards using Matplotlib, Seaborn, and Plotly for project analysis and reporting.

datadrivenconstruction
datadrivenconstruction
data-ai
open
Previous
Page 37 / 63
Next