name: aggregating-performance-metrics
description: Aggregate and centralize performance metrics from applications, systems, databases, caches, and services. Use when consolidating monitoring data from multiple sources. Trigger with phrases like "aggregate metrics", "centralize monitoring", or "collect performance data".
version: 1.0.0
allowed-tools:
- Read
- Write
- Bash(prometheus:*)
- Bash(metrics:*)
- Bash(monitoring:*)
- Grep
license: MIT
Overview
This skill empowers Claude to streamline performance monitoring by aggregating metrics from diverse systems into a unified view. It simplifies the process of collecting, centralizing, and analyzing performance data, leading to improved insights and faster issue resolution.
How It Works
- Metrics Taxonomy Design: Claude assists in defining a clear and consistent naming convention for metrics across all systems.
- Aggregation Tool Selection: Claude helps select the appropriate metrics aggregation tool (e.g., Prometheus, StatsD, CloudWatch) based on the user's environment and requirements.
- Configuration and Integration: Claude guides the configuration of the chosen aggregation tool and its integration with various data sources.
- Dashboard and Alert Setup: Claude helps set up dashboards for visualizing metrics and defining alerts for critical performance indicators.
When to Use This Skill
This skill activates when you need to:
- Centralize performance metrics from multiple applications and systems.
- Design a consistent metrics naming convention.
- Choose the right metrics aggregation tool for your needs.
- Set up dashboards and alerts for performance monitoring.
Examples
Example 1: Centralizing Application and System Metrics
User request: "Aggregate application and system metrics into Prometheus."
The skill will:
- Guide the user in defining metrics for applications (e.g., request latency, error rates) and systems (e.g., CPU usage, memory utilization).
- Help configure Prometheus to scrape metrics from the application and system endpoints.
Example 2: Setting Up Alerts for Database Performance
User request: "Centralize database metrics and set up alerts for slow queries."
The skill will:
- Help the user define metrics for database performance (e.g., query execution time, connection pool usage).
- Guide the user in configuring the aggregation tool to collect these metrics from the database.
- Assist in setting up alerts in the aggregation tool to notify the user when query execution time exceeds a defined threshold.
Best Practices
- Naming Conventions: Use a consistent and well-defined naming convention for all metrics to ensure clarity and ease of analysis.
- Granularity: Choose an appropriate level of granularity for metrics to balance detail and storage requirements.
- Retention Policies: Define retention policies for metrics to manage storage space and ensure data is available for historical analysis.
Integration
This skill integrates with other plugins that manage infrastructure, deploy applications, and monitor system health. For example, it can be used in conjunction with a deployment plugin to automatically configure metrics collection after a new application deployment.
Prerequisites
- Access to metrics collection tools (Prometheus, StatsD, CloudWatch)
- Network connectivity to metric sources
- Metrics storage configuration in {baseDir}/metrics/
- Understanding of metrics taxonomy
Instructions
- Design consistent metrics naming convention
- Select appropriate aggregation tool for environment
- Configure metric collection from all sources
- Set up centralized storage and retention policies
- Create dashboards for visualization
- Define alerts for critical metrics
Output
- Metrics aggregation configuration files
- Unified naming convention documentation
- Dashboard definitions for key metrics
- Alert rules for performance thresholds
- Integration guides for metric sources
Error Handling
If metrics aggregation fails:
- Verify network connectivity to sources
- Check authentication credentials
- Validate metrics format compatibility
- Review storage capacity and retention
- Ensure aggregation tool configuration
Resources
- Prometheus aggregation documentation
- StatsD protocol specifications
- CloudWatch metrics API reference
- Metrics naming best practices