By pymc-labs
Build, test, and compare Bayesian models with PyMC 6+ and ArviZ, including prior selection, spline regression, model comparison via LOO-CV and Bayes factors, and unit testing with pytest. Also create and convert marimo reactive notebooks for exploratory data analysis.
ALWAYS use when: creating/editing marimo notebooks, working with any .py file containing @app.cell decorators, building reactive Python notebooks, doing exploratory data analysis in notebook form, converting Jupyter (.ipynb) to marimo, or when user mentions "marimo", "reactive notebook", or asks for an interactive Python notebook. Covers marimo CLI (edit, run, convert, export), UI components (mo.ui.*), layout functions, SQL integration, caching, state management, and wigglystuff widgets. If a task involves notebooks and Python, invoke this skill first.
Load when the user is comparing Bayesian models, computing LOO-CV / ELPD, calling arviz_stats.loo or arviz_stats.compare, doing model stacking/averaging, or computing Bayes factors. Covers the ArviZ 1.1 LOO/ELPD/stacking APIs exclusively (no waic). Triggers include: model comparison, LOO, ELPD, compare, loo_expectations, loo_metrics, loo_r2, Pareto k, stacking, Bayes factor, cross-validation, predictive accuracy, information criterion.
Load when the user is choosing priors, running prior predictive checks, calling find_constrained_prior, using PreliZ, or otherwise eliciting domain knowledge into a Bayesian model. Covers weakly informative priors, constrained priors, sensitivity analysis, and elicitation workflows. Triggers include: prior selection, elicitation, find_constrained_prior, PreliZ, prior predictive, expert/informative priors, weakly informative priors, constrained priors.
Load when the user is working with pymc-extras (pmx) features: splines / BSplineBasis, distributional regression / GAMLSS, R2D2M2CP or horseshoe priors, discrete variable marginalization, or Laplace approximation via fit_laplace. Triggers include: pymc_extras, pymc-extras, pmx, splines, BSplineBasis, distributional regression, GAMLSS, R2D2, horseshoe (regularized/Finnish), marginalize, fit_laplace, penalized splines.
Load whenever the user is working on code that imports pymc, pytensor, or arviz, or asks about Bayesian modeling, MCMC, priors, posteriors, sampling, or model diagnostics. Covers PyMC 6+, PyTensor 3+, ArviZ 1.1+ (DataTree API), pymc-bart, pymc-extras, nutpie, and JAX/NumPyro backends. Use for building probabilistic models, specifying priors, running MCMC, diagnosing convergence, or comparing models. Triggers include: Bayesian inference, posterior sampling, hierarchical/multilevel models, GLMs, time series, Gaussian processes, HSGP, BART, mixture models, prior/posterior predictive checks, MCMC diagnostics, LOO-CV, model comparison, causal inference with do/observe, and any PyTensor Op or graph work.
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
A plugin for Claude Code and other AI coding platforms providing Agent Skills for Bayesian modeling and reactive Python notebooks. Packages specialized knowledge for PyMC and marimo into skills that Claude loads on demand.
| Skill | Description |
|---|---|
| pymc-modeling | Bayesian statistical modeling with PyMC 6+, PyTensor 3+, and ArviZ 1.1+. Covers model specification, inference, diagnostics, hierarchical models, GLMs, GPs/HSGPs, BART, time series, mixtures, causal models, priors, and custom likelihoods. |
| prior-elicitation | Prior selection, prior predictive checks, constrained priors, PreliZ workflows, expert priors, weakly informative priors, and prior sensitivity analysis. |
| pymc-extras | pymc-extras guidance for splines, distributional regression, R2D2/horseshoe priors, marginalization, and Laplace approximation. |
| pymc-testing | Testing PyMC models with pytest. Covers pymc.testing.mock_sample, fixtures, structure-only tests, and slow posterior inference tests. |
| model-evaluation | Bayesian model comparison and predictive evaluation with ArviZ 1.1 LOO/ELPD, stacking, model averaging, Bayes factors, and Pareto-k diagnostics. |
| marimo-notebook | Reactive Python notebooks with marimo. Covers CLI usage, @app.cell notebooks, UI components, layout, SQL integration, caching, state management, notebook conversion, templates, and wigglystuff widgets. |
npx skills add pymc-labs/python-analytics-skills
One command, works with Claude Code, Cursor, Gemini CLI, and 15+ other agents.
Two-step process using Claude Code slash commands:
/plugin marketplace add pymc-labs/python-analytics-skills
/plugin install analytics@pymc-labs-python-analytics-skills
Installs all skills plus the keyword-suggestion hook. Supports /plugin update for future updates.
Oh My Pi can install this repository as a marketplace plugin using the Claude-compatible .claude-plugin/marketplace.json catalog:
/marketplace add pymc-labs/python-analytics-skills
/marketplace install analytics@python-analytics-skills
CLI equivalent:
omp plugin marketplace add pymc-labs/python-analytics-skills
omp plugin install analytics@python-analytics-skills
For a project-local install, add --scope project to the install command.
git clone https://github.com/pymc-labs/python-analytics-skills.git
cd python-analytics-skills
./install.sh claude # Claude Code
./install.sh all # All platforms
./install.sh claude -- pymc-modeling # Specific skill only
# List available skills with descriptions
./install.sh --list
# Validate skill structure
./install.sh --validate
./scripts/validate-skills.sh
The repo includes a PyMC skill benchmark in benchmark/pymc-modeling/. It compares Claude outputs with and without skills/pymc-modeling/SKILL.md injected via --append-system-prompt.
Benchmark contents:
tasks.yaml — five Bayesian modeling taskssrc/ — runner, scorer, analysis, and CLI modulestests/ — pytest coverage for the benchmark harnessdata/ — small input datasets used by tasksscripts/prepare_data.py — deterministic data preparationpixi.toml / pixi.lock — benchmark environmentRun benchmark commands from the benchmark directory:
cd benchmark/pymc-modeling
pixi run validate
pixi run test
pixi run run-all
pixi run score-all
pixi run analyze
Generated benchmark outputs such as results/, .nc files, figures, and archived runs are not package source.
| Platform | Install Location | Auto-Discovered |
|---|---|---|
| Claude Code | ~/.claude/skills/ | Yes |
| OpenCode | ~/.config/opencode/skills/ | Yes |
| Gemini CLI | ~/.gemini/skills/ | Yes |
| Cursor | ~/.cursor/skills/ | Yes |
| VS Code Copilot | ~/.copilot/skills/ | Yes |
npx claudepluginhub pymc-labs/python-analytics-skills --plugin analyticsPyMC 6+ / PyTensor 3+ / ArviZ 1.0+ Bayesian modeling skills, agents, hooks, and tools for Claude Code
Comprehensive model evaluation with multiple metrics
Evaluate and compare ML model performance metrics
Self-documenting, self-improving framework for analytical repositories
Editorial "Business Analyst" bundle for Claude Code from Antigravity Awesome Skills.
Multi-agent workflow framework for building, testing, and shipping statistical software packages
Autonomous research loops with 10 commands. Generalizes Karpathy's autoresearch loop to any domain with mechanical evaluation, overnight persistence, and zero dependencies.