Fleet Geolocation Map in hub Alert Management Overview page#2040
Conversation
|
[APPROVALNOTIFIER] This PR is NOT APPROVED This pull-request has been approved by: The full list of commands accepted by this bot can be found here. DetailsNeeds approval from an approver in each of these files:Approvers can indicate their approval by writing |
|
@jgbernalp @saswatamcode Please review this enhancement. |
4036659 to
95d242c
Compare
saswatamcode
left a comment
There was a problem hiding this comment.
To be quite honest here, I don't really see a strict need for this? Explained more in comments below
| 4. The Perses geomap panel on the alerting overview page renders the cluster as a dot on the world map, colored by health status and sized by the selected metric. | ||
| 5. The administrator toggles between heatmap and map views to assess fleet health spatially. | ||
|
|
||
| For cloud-provisioned clusters (Phase 3), the geolocation is auto-derived from the `topology.kubernetes.io/region` label — no manual action required. |
There was a problem hiding this comment.
You can set this to auto right?
There was a problem hiding this comment.
What do you mean?
It doesnt replace the original label, it adds a new one if missing, that is derived from the topology.kubernetes.io/region label
| see-also: | ||
| - "/enhancements/monitoring/alerts-ui-management.md" | ||
| --- | ||
| # Fleet Geolocation Map |
There was a problem hiding this comment.
I, of course, agree with the utility of knowing what region and AZ an alert is firing in, but I don't understand why we have to visualize this in a map.
Usually, admin/SRE personas don't exactly care about seeing alerts on a map, as there is virtually no utility in having that spatial context.
I think what might be useful is grouping together alerts by regions and AZs, which will give an SRE the same info, without a complex map view.
However, doing that automatically from our side is something I would caution against for a couple of reasons,
- Region/AZs from a cloud provider are something we can be aware of, but if a customer is running their own on-prem infra, we aren't really aware of what their region/AZ split looks like.
- Most admins/SRE teams label or name their infra (and alerts as well) by their associated geographic position, as best practice or as per internal conventions. This is on purpose, so that their teams can determine regionality/az at a glance. We would have to actively override that on resources, especially alerts, which will likely cause customers quite a lot of confusion
There was a problem hiding this comment.
Thanks for the feedback @saswatamcode .
A few clarifications:
-
Map vs. grouping: You're right that for daily SRE triage, a grouped list by region is more practical. The map isn't replacing that — it's a complementary view mode (toggle) for scenarios where spatial context helps: large edge fleets (50–500+ sites), blast radius assessment, and stakeholder/NOC communication. I'll make this framing clearer in the proposal and add "grouped by region" to Alternatives Considered.
-
On overriding labels: To clarify — this doesn't modify any existing resources or alert labels. geolocation is a new, opt-in label on ManagedCluster resources. Clusters without it simply don't appear on the map. Existing conventions are untouched.
-
On on-prem: Agreed, we can't auto-derive for on-prem. The design is entirely opt-in — on-prem customers who want it set the label themselves; those who don't aren't affected.
-
On scope: The core deliverable is the geolocation label convention, which enables grouping/filtering independently of the map. The map panel is Phase 2, gated on upstream Perses availability.
Introduces a geolocation label convention for ManagedCluster resources and a Perses geomap panel for visualizing fleet clusters on a world map. The panel resolves standard location codes (countries, states) to coordinates using a built-in gazetteer. Each cluster dot is colored by health status and sized by a configurable metric, enabling spatial awareness of fleet health at a glance. Upstream Perses issue: perses/perses#4132 Signed-off-by: Shirly Radco <sradco@redhat.com> Co-authored-by: Cursor <cursoragent@cursor.com>
95d242c to
2e980d3
Compare
|
@sradco: all tests passed! Full PR test history. Your PR dashboard. DetailsInstructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here. |
Summary
This proposal introduces a geographic map visualization for managed clusters on the hub alerting overview page. Clusters are plotted on a world map as dots — colored by health status and sized by configurable metrics — providing spatial awareness of fleet health.
Key Design Decisions
geolocationlabel on ManagedCluster (country/state codes, e.g.,IE,US.VA)acm_managed_cluster_labelsmetric — no new synthetic metrics.Phased Delivery
geolocationto the label allowlist, document conventions, file upstream Perses issue.Dependencies
Test Plan
Covered in the enhancement document.
Made with Cursor