-
Notifications
You must be signed in to change notification settings - Fork 0
Application Metrics
Garot Conklin edited this page Feb 6, 2025
·
1 revision
Example configurations for monitoring application performance and health.
version: "1.0"
dashboards:
- name: "Web Application Monitoring"
description: "Key performance indicators for web applications"
layout_type: "ordered"
template_variables:
- name: "service"
prefix: "service"
default: "web-api"
widgets:
- title: "Request Rate"
type: "timeseries"
query: "sum:http.requests{service:$service} by {endpoint}.as_rate()"
- title: "Error Rate"
type: "timeseries"
query: "sum:http.errors{service:$service} by {endpoint}.as_rate()"
- title: "Response Time (p95)"
type: "timeseries"
query: "p95:http.response.time{service:$service} by {endpoint}"
- title: "Active Users"
type: "query_value"
query: "sum:users.active{service:$service}"version: "1.0"
dashboards:
- name: "Database Performance"
description: "Database performance metrics"
layout_type: "ordered"
template_variables:
- name: "db"
prefix: "database"
default: "main"
widgets:
- title: "Query Response Time"
type: "timeseries"
query: "avg:database.query.time{database:$db} by {query_type}"
- title: "Active Connections"
type: "query_value"
query: "avg:database.connections.active{database:$db}"
- title: "Cache Hit Ratio"
type: "timeseries"
query: "avg:database.cache.hit_ratio{database:$db}"version: "1.0"
dashboards:
- name: "Service Health"
description: "Service health and availability metrics"
layout_type: "ordered"
widgets:
- title: "Service Status"
type: "check_status"
check: "service.up"
grouping: "cluster"
- title: "Error Logs"
type: "log_stream"
query: "status:error service:$service"
- title: "Dependency Map"
type: "service_map"
service: "$service"- Use appropriate time aggregations
- Monitor error rates and latencies
- Track user experience metrics
- Set up meaningful alerts
- Use template variables for flexibility
- Request rates
- Error rates
- Response times
- User sessions
- Query performance
- Connection pools
- Cache efficiency
- Storage metrics
- Availability
- Error rates
- Resource usage
- Dependency health