Overlay V2 cleanup by drebelsky · Pull Request #5296 · stellar/stellar-core

drebelsky · 2026-05-27T18:39:34Z

The goal of this PR is to create a more stable baseline for experiments to compare against. In particular, the goal is to reach a steady state with data sizes being capped and data being regularly cleaned up.

Changes

RemoveTxsFromMempool (which is called on ledger close) now also calls evict_expired to remove stale TXs
PendingRequests::process_timeouts now evicts the hashes that were given up on
Unify the two LRUs in InvTracker
Switch from removing random to removing last tx set in TxSetCache and re-order eviction after insertion.

Not changed since we're assuming non-malicious peers + a reasonably bounded number of peers

InvBatcher keeps an entry in the map for every peer that ever existed
SharedState::peer_streams (unbounded, but limited to total number of connected peers)
App::pending_scp_state_requests (unbounded, but as long as nodes aren't falling out of sync, should only have at most one entry per peer (from the message sent on start up))
App::{known_peers, peer_hostnames, configured_peers}: all of these are bounded by the number of configured peers
App::local_addrs: there shouldn't be too many local addresses
The following LRU cache sizes remain unchanged and cleanup remains just LRU eviction. The sizes are small enough that I think we should hit the steady state full usage relatively quickly (although, I'm still examining the sizes for scp_seen and scp_sent_to)
- InvTracker's cache(s)
- SharedState::{scp_seen, tx_seen, scp_sent_to, tx_set_sources}
- TxBuffer

Otherwise unchanged:

I left the channels alone since, hopefully under normal load these shouldn't start backing up.
Claude suggests that there is some unbounded state when accepting peers, but this will be fine for benchmarking
SharedState::pending_txset_requests: if the request isn't responded to, these continue to take up space. For benchmarking, this should only happen if a peer disconnects, in which case they do get cleaned up, and we try to fetch from the next peer. This is probably worth addressing with some reasonable timeouts later.

drebelsky · 2026-05-27T18:45:06Z

For many of the LRU caches without cleanup, it might be worth considering switching to some form of TTLCache (although, the decreased cache locality from lru probably isn't that substantial in our workload).

drebelsky added 4 commits May 27, 2026 11:32

Clean up expired txs from mempool on ledger close

1959989

Abandon hashes that have reached give up threshold

325f709

Unify LRU order in InvTracker

6ec271c

Remove oldest inserted tx set in TxSetCache

461d6ae

drebelsky changed the title ~~V2 clean up data~~ Overlay V2 cleanup May 27, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Overlay V2 cleanup#5296

Overlay V2 cleanup#5296
drebelsky wants to merge 4 commits into
stellar:overlay-v2-sharedfrom
drebelsky:v2-clean-up-data

drebelsky commented May 27, 2026 •

edited

Loading

Uh oh!

drebelsky commented May 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

drebelsky commented May 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

drebelsky commented May 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

drebelsky commented May 27, 2026 •

edited

Loading