Executive Overview Dashboard
A single-screen dashboard for management โ health score, active problems, SLO status, and key metrics.
Tile 1: Active Problems (Single Value)
fetch events, from:now()-24h
| filter event.kind == "DAVIS_PROBLEM"
| filter event.status == "ACTIVE"
| summarize cnt=count()
Tile 2: Host Health (Single Value)
fetch dt.entity.host
| summarize total=count(), healthy=countIf(state == "RUNNING")
| fieldsAdd pct = 100.0 * healthy / total
Tile 3: Service Error Rate (Line Chart)
timeseries avg(dt.service.request.failure_rate), by:{dt.entity.service}
Tile 4: CPU Usage (Line Chart)
timeseries avg(dt.host.cpu.usage), by:{dt.entity.host}
Tile 5: Log Errors (Single Value)
fetch logs, from:now()-24h
| filter loglevel == "ERROR"
| summarize cnt=count()
Tile 6: Recent Problems (Table)
fetch events, from:now()-7d
| filter event.kind == "DAVIS_PROBLEM"
| fields display_id, event.name, event.status, timestamp
| sort timestamp desc
| limit 5
Layout
โโโโโโโโโโโโโโโโฌโโโโโโโโโโโโโโโฌโโโโโโโโโโโโโโโ
โ Active โ Host Health โ Log Errors โ
โ Problems โ 100% โ 24h โ
โ (single val) โ (single val) โ (single val) โ
โโโโโโโโโโโโโโโโดโโโโโโโโโโโโโโโดโโโโโโโโโโโโโโโค
โ Service Error Rate (line chart, full width) โ
โโโโโโโโโโโโโโโโฌโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโค
โ CPU Usage โ Recent Problems (table) โ
โ (line chart) โ โ
โโโโโโโโโโโโโโโโดโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
๐ก Keep executive dashboards to 6-8 tiles max. If they need more detail, link to a dedicated service or infrastructure dashboard.
๐ Try it: Open Dashboards โ "+ Dashboard" โ add a tile โ select "Query Grail" โ paste fetch dt.entity.host | summarize total=count(), healthy=countIf(state == "RUNNING") โ choose "Single value" visualization. You just built your first executive KPI tile.