fix(valkey): seed on the current master, not a reachable replica#687
Open
mreho wants to merge 2 commits into
Open
fix(valkey): seed on the current master, not a reachable replica#687mreho wants to merge 2 commits into
mreho wants to merge 2 commits into
Conversation
📝 WalkthroughWalkthroughThe Sentinel entrypoint's initial bootstrap logic in the Valkey Helm chart's configmap is modified to discover the current master by repeatedly calling ChangesSentinel Bootstrap Discovery Update
Estimated code review effort: 2 (Simple) | ~10 minutes 🚥 Pre-merge checks | ✅ 5✅ Passed checks (5 passed)
✨ Finishing Touches🧪 Generate unit tests (beta)
Comment |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
Problem
The Sentinel bootstrap seeds on the node currently reporting
role:master, but the wait loop only ran that discovery when the static seed (node-0) was unreachable. After a failover,node-0can be a live replica, so a reachability-only check can seed Sentinel from a read-only node and advertise the wrong master.Fix
Discover the node reporting
role:masteramong the data-node peers first, and fall back to the static seed only if none answers within the bounded budget.Type Of Change
PR Governance
Resolves #NNNorRelated to #NNN)enhancementandtype:featureare appliedChecklist
mainmainversioninChart.yamlmanuallySite Sync (GR-007)
Local Validation
make validate-chart CHART=valkey TIMEOUT=1200passed end-to-end: FULLY VALIDATED (21 layers), including k3d behavioral scenarios for default, cluster, dual-stack, existing-secret, external-secrets, metrics, networkpolicy, replication, sentinel, standalone, tls-cluster, and tls-replication.make site-sync-check CHART=valkeypassed on site#365.make release-check REPO=chartspassed with the expected GR-077 post-merge release confirmation warning.make attribution-check REPO=chartspassed.Remote Validation
standards-checkis skipped by fork policy and covered locally.Issue: #633