Skip to content

🐛 manager: suppress expected leader-election-lost error on graceful shutdown#3536

Open
liketosweep wants to merge 2 commits into
kubernetes-sigs:mainfrom
liketosweep:issue-3535
Open

🐛 manager: suppress expected leader-election-lost error on graceful shutdown#3536
liketosweep wants to merge 2 commits into
kubernetes-sigs:mainfrom
liketosweep:issue-3535

Conversation

@liketosweep

Copy link
Copy Markdown

Fixes #3535

What the Issue Was

On graceful shutdown with LeaderElection enabled, cancelling the
leader election context triggers OnStoppedLeading, which pushes
errLeaderElectionLost onto errChan. The drain goroutine in
engageStopProcedure only filtered context.Canceled, so this
expected error was logged as unexpected:

level=ERROR msg="error received after stop sequence was engaged" err="leader election lost"

When combined with t.Output() in tests, this also causes panics or
data races as the manager writes to the test log after the test exits.

Solution

  • Added a package-level errLeaderElectionLost sentinel for reliable
    errors.Is matching.
  • Used the sentinel in OnStoppedLeading instead of an anonymous
    errors.New.
  • Added the sentinel to the engageStopProcedure drain filter
    alongside context.Canceled.
    Genuine mid-run leader election loss (before stop is engaged) still
    propagates to Start's caller unchanged.

Changes

  • pkg/manager/internal.go: sentinel + filter fix.
  • pkg/manager/manager_test.go: regression test for graceful shutdown.

Testing

$ go test sigs.k8s.io/controller-runtime/pkg/manager -v \
  -run "should not log an error when leader election is lost during graceful shutdown"
--- PASS
PASS

@kubernetes-prow

Copy link
Copy Markdown
Contributor

Welcome @liketosweep!

It looks like this is your first PR to kubernetes-sigs/controller-runtime 🎉. Please refer to our pull request process documentation to help your PR have a smooth ride to approval.

You will be prompted by a bot to use commands during the review process. Do not be afraid to follow the prompts! It is okay to experiment. Here is the bot commands documentation.

You can also check if kubernetes-sigs/controller-runtime has its own contribution guidelines.

You may want to refer to our testing guide if you run into trouble with your tests not passing.

If you are having difficulty getting your pull request seen, please follow the recommended escalation practices. Also, for tips and tricks in the contribution process you may want to read the Kubernetes contributor cheat sheet. We want to make sure your contribution gets all the attention it needs!

Thank you, and welcome to Kubernetes. 😃

@kubernetes-prow kubernetes-prow Bot added the cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. label Jun 25, 2026
@kubernetes-prow

Copy link
Copy Markdown
Contributor

Hi @liketosweep. Thanks for your PR.

I'm waiting for a kubernetes-sigs member to verify that this patch is reasonable to test. If it is, they should reply with /ok-to-test on its own line. Until that is done, I will not automatically test new commits in this PR, but the usual testing commands by org members will still work.

Regular contributors should join the org to skip this step.

Once the patch is verified, the new status will be reflected by the ok-to-test label.

I understand the commands that are listed here.

Details

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

@kubernetes-prow kubernetes-prow Bot added the needs-ok-to-test Indicates a PR that requires an org member to verify it is safe to test. label Jun 25, 2026
@kubernetes-prow

Copy link
Copy Markdown
Contributor

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: liketosweep
Once this PR has been reviewed and has the lgtm label, please assign sbueringer for approval. For more information see the Code Review Process.

The full list of commands accepted by this bot can be found here.

Details Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@kubernetes-prow kubernetes-prow Bot added the size/M Denotes a PR that changes 30-99 lines, ignoring generated files. label Jun 25, 2026
Comment thread pkg/manager/internal.go
Signed-off-by: liketosweep <liketosweep@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. needs-ok-to-test Indicates a PR that requires an org member to verify it is safe to test. size/M Denotes a PR that changes 30-99 lines, ignoring generated files.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

level=ERROR msg="error received after stop sequence was engaged" err="leader election lost"

2 participants