From devops-practices
Metrics, logs, traces (observability); choosing what to measure, dashboards, and incident response.
How this skill is triggered — by the user, by Claude, or both
Slash command
/devops-practices:monitoring-instrumentationThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
Observing systems to understand behavior and detect problems.
Observing systems to understand behavior and detect problems.
You are planning observability. Measure what matters; enable debugging.
npx claudepluginhub sethdford/claude-skills --plugin engineer-devops-practicesProvides observability patterns for metrics, logging, tracing, alerting, dashboards, and infrastructure monitoring in production systems with Prometheus, Grafana, OpenTelemetry.
Designs production-grade monitoring, logging, and tracing systems with SLI/SLO management, alerting, and incident response workflows.
Design monitoring and alerting that catches production issues fast without creating alert fatigue. Use when establishing observability or improving incident response.