TXM: Rpc liveness by GabrielMartinezRodriguez · Pull Request #562 · HappyChainDevs/happychain

GabrielMartinezRodriguez · 2025-03-26T14:41:06Z

Linked Issues

closes https://linear.app/happychain/issue/HAPPY-366/txm-add-rpc-liveness-monitor

Description

Added a liveness monitor to the Transaction Manager

Include all relevant context (but no need to repeat the issue's content).
Draw attention to new, noteworthy & unintuitive elements.

Toggle Checklist

Checklist

Basics

B1. I have applied the proper label & proper branch name (e.g. norswap/build-system-caching).
B2. This PR is not so big that it should be split & addresses only one concern.
B3. The PR targets the lowest branch it can (ideally master).

Reminder: PR review guidelines

Correctness

C1. Builds and passes tests.
C2. The code is properly parameterized & compatible with different environments (e.g. local,
testnet, mainnet, standalone wallet, ...).
C3. I have manually tested my changes & connected features.

< INDICATE BROWSER, DEMO APP & OTHER ENV DETAILS USED FOR TESTING HERE >

< INDICATE TESTED SCENARIOS (USER INTERFACE INTERACTION, CODE FLOWS) HERE >

C4. I have performed a thorough self-review of my code after submitting the PR,
and have updated the code & comments accordingly.

Architecture & Documentation

D1. I made it easy to reason locally about the code, by (1) using proper abstraction boundaries,
(2) commenting these boundaries correctly, (3) adding inline comments for context when needed.
D2. All public-facing APIs & meaningful (non-local) internal APIs are properly documented in code
comments.
D3. If appropriate, the general architecture of the code is documented in a code comment or
in a Markdown document.
D4. An appropriate Changeset has been generated (and committed) for changes that touch npm published packages (currently packages/core and packages/react), see here for more info.

cloudflare-workers-and-pages · 2025-03-26T14:41:07Z

Deploying happychain with Cloudflare Pages

Latest commit:	`c884815`
Status:	✅ Deploy successful!
Preview URL:	https://2608acea.happychain.pages.dev
Branch Preview URL:	https://gabriel-txm-rpc-liveness.happychain.pages.dev

View logs

GabrielMartinezRodriguez · 2025-03-26T14:41:21Z

Randomness monitor service #584
Txm traces #521
fix(txm): returned nonce queue order #578
fix(txm): not process the same block multiple times #577
Fix: Viem sends random undefined blocks #575
fix(txm): heap out of memory #573
Avoid initiating new attempts if the gas conditions persist unchanged #571
Dynamic priority fee #570
TXM: Rpc liveness #562 👈 (View in Graphite)
fix(txm): nonce gap #561
PoC: Add metrics to TXM #503
master

This stack of pull requests is managed by Graphite. Learn more about stacking.

linear · 2025-03-26T14:54:13Z

HAPPY-366 TXM: Add RPC liveness monitor

Goal: avoid creating an unbounded amount of attempts when the RPC is down, which then creates a lot of load on the service or on the RPC. At the same time, we want to retry once the RPC comes back up.

This service can receive pings from other components (e.g. block monitor or tx submitter) to determine if the service is alive.

This is a policy that could be customized by the user.

not-reed · 2025-03-27T16:17:17Z

            return
        }

+        this.txmgr.rpcLivenessMonitor.onSuccess()


onSuccess/onFailure sound like callback listeners to me

onSuccess(args => console.log("success", args))

maybe trackSuccess() or something?

Yes, much better!

aodhgan · 2025-04-01T01:03:00Z

+         * @default 2000 (2 seconds)
+         * @unit milliseconds
+         */
+        livenessPingInterval?: number


i think this was supposed to be used in conjunction with livenessSuccessCount but dont see it anywhere

Yup, good catch. It was renamed to livenessCheckInterval, and I forgot to remove livenessPingInterval

norswap · 2025-04-08T19:08:31Z

+            occurredAt: new Date(),
+            success: true,
+        })
+        this.checkIfDown()


I'm wondering if that doesn't add a lot of overhead? If we're making 1000 RPC calls per second (which is not insane, each tx might require a few of these calls), then we're calling this 1000/s, and there are ~10k events in the event window, so 1000 times per second we're filtering through a list of 10,000 events.

Maybe we could maintain a rolling list of counters? Like for 10s period, 10 counters for success events, 10 counters for error events. And just maintain a single timestamp corresponding to the second-granularity timestamp of the oldest counters?

cc @aodhgan for an opinion here

I think you're right. I refactored the code, and I believe this approach is much better

Nice! If we ever need to optimize this more, we can replace the object by an array, and we could also update the count dynamically instead of recomputing it in ratioOfSuccess but this works nicely, merging this.

…s monitor

GabrielMartinezRodriguez mentioned this pull request Mar 26, 2025

fix(txm): nonce gap #561

Merged

11 tasks

GabrielMartinezRodriguez changed the title ~~feat(txm): rpc liveness~~ TXM: Rpc liveness Mar 26, 2025

GabrielMartinezRodriguez self-assigned this Mar 26, 2025

GabrielMartinezRodriguez added the reviewing-1 Ready for, or undergoing first-line review label Mar 26, 2025

GabrielMartinezRodriguez marked this pull request as ready for review March 26, 2025 14:49

GabrielMartinezRodriguez force-pushed the gabriel/fix-randomnness branch from 102b106 to aa82f76 Compare March 27, 2025 10:24

GabrielMartinezRodriguez force-pushed the gabriel/txm-rpc-liveness branch from 8fb618e to b708472 Compare March 27, 2025 10:24

GabrielMartinezRodriguez force-pushed the gabriel/fix-randomnness branch from aa82f76 to 5b83c84 Compare March 27, 2025 10:32

GabrielMartinezRodriguez force-pushed the gabriel/txm-rpc-liveness branch from b708472 to db14c74 Compare March 27, 2025 10:32

This was referenced Mar 27, 2025

Dynamic priority fee #570

Merged

Avoid initiating new attempts if the gas conditions persist unchanged #571

Merged

GabrielMartinezRodriguez force-pushed the gabriel/fix-randomnness branch from 5b83c84 to dafe8fa Compare March 27, 2025 14:40

GabrielMartinezRodriguez force-pushed the gabriel/txm-rpc-liveness branch from db14c74 to db7aedd Compare March 27, 2025 14:41

GabrielMartinezRodriguez force-pushed the gabriel/fix-randomnness branch from dafe8fa to 1beb802 Compare March 27, 2025 14:51

GabrielMartinezRodriguez force-pushed the gabriel/txm-rpc-liveness branch from db7aedd to cc22eb4 Compare March 27, 2025 14:51

GabrielMartinezRodriguez force-pushed the gabriel/fix-randomnness branch from 1beb802 to 3bfc7d8 Compare March 27, 2025 15:39

GabrielMartinezRodriguez force-pushed the gabriel/txm-rpc-liveness branch from cc22eb4 to 32771ef Compare March 27, 2025 15:39

not-reed reviewed Mar 27, 2025

View reviewed changes

GabrielMartinezRodriguez mentioned this pull request Mar 28, 2025

fix(txm): heap out of memory #573

Merged

11 tasks

GabrielMartinezRodriguez force-pushed the gabriel/fix-randomnness branch from 3bfc7d8 to 5692dc3 Compare March 31, 2025 09:54

GabrielMartinezRodriguez force-pushed the gabriel/txm-rpc-liveness branch from 32771ef to 5801281 Compare March 31, 2025 09:54

This was referenced Mar 31, 2025

Fix: Viem sends random undefined blocks #575

Merged

Txm traces #521

Merged

PoC: Add metrics to TXM #503

Merged

aodhgan reviewed Apr 1, 2025

View reviewed changes

Comment thread packages/txm/lib/TransactionManager.ts

aodhgan reviewed Apr 1, 2025

View reviewed changes

Comment thread packages/txm/lib/telemetry/metrics.ts

norswap reviewed Apr 8, 2025

View reviewed changes

norswap added updating Updating after review and removed reviewing-2 Ready for, or undergoing final review labels Apr 8, 2025

GabrielMartinezRodriguez force-pushed the gabriel/fix-randomnness branch from ce73860 to 0c6f204 Compare April 10, 2025 07:52

GabrielMartinezRodriguez force-pushed the gabriel/txm-rpc-liveness branch 3 times, most recently from a3d1d4d to 0c8f54d Compare April 10, 2025 08:37

GabrielMartinezRodriguez force-pushed the gabriel/fix-randomnness branch from bb3d2ab to d03ec85 Compare April 10, 2025 08:51

GabrielMartinezRodriguez force-pushed the gabriel/txm-rpc-liveness branch 2 times, most recently from d72d783 to 75f46bf Compare April 10, 2025 09:12

Base automatically changed from gabriel/fix-randomnness to master April 10, 2025 09:17

GabrielMartinezRodriguez added 8 commits April 10, 2025 11:19

feat(txm): rpc liveness

a5ca61d

feat(txm): added hooks to notify when rpc liveness changes+

1f8449d

feat(txm): added livenessCheckInterval

7c9da01

chore(txm): pr review

032d0c0

chore(txm): added test for RpcLivenessMonitor

325dcca

chore(txm): increase hook timeout

6e8a7ed

chore(txm): fix tests

b2f9c57

chore(txm): implemented a counter for every second in the rpc livenes…

c884815

…s monitor

GabrielMartinezRodriguez force-pushed the gabriel/txm-rpc-liveness branch from 75f46bf to c884815 Compare April 10, 2025 09:56

GabrielMartinezRodriguez added reviewing-2 Ready for, or undergoing final review and removed updating Updating after review labels Apr 10, 2025

GabrielMartinezRodriguez mentioned this pull request Apr 10, 2025

Txm transactions support arbitrary calldata #596

Merged

11 tasks

norswap approved these changes Apr 10, 2025

View reviewed changes

norswap merged commit d22cb22 into master Apr 10, 2025

norswap deleted the gabriel/txm-rpc-liveness branch April 10, 2025 20:11

This was referenced Apr 21, 2025

feat(txm): transactions with value #647

Merged

Faucet service & Iframe integration #661

Merged

feat: deploy faucet #668

Merged

feat(randomness): drand prune #693

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

TXM: Rpc liveness#562

TXM: Rpc liveness#562
norswap merged 8 commits into
masterfrom
gabriel/txm-rpc-liveness

GabrielMartinezRodriguez commented Mar 26, 2025 •

edited

Loading

Uh oh!

cloudflare-workers-and-pages Bot commented Mar 26, 2025 •

edited

Loading

Uh oh!

GabrielMartinezRodriguez commented Mar 26, 2025 •

edited

Loading

Uh oh!

linear Bot commented Mar 26, 2025

Uh oh!

not-reed Mar 27, 2025

Uh oh!

GabrielMartinezRodriguez Apr 3, 2025

Uh oh!

Uh oh!

aodhgan Apr 1, 2025

Uh oh!

GabrielMartinezRodriguez Apr 3, 2025

Uh oh!

Uh oh!

norswap Apr 8, 2025

Uh oh!

norswap Apr 8, 2025

Uh oh!

GabrielMartinezRodriguez Apr 10, 2025

Uh oh!

norswap Apr 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Conversation

GabrielMartinezRodriguez commented Mar 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Linked Issues

Description

Checklist

Basics

Correctness

Architecture & Documentation

Uh oh!

cloudflare-workers-and-pages Bot commented Mar 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Deploying happychain with Cloudflare Pages

Uh oh!

GabrielMartinezRodriguez commented Mar 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

linear Bot commented Mar 26, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

GabrielMartinezRodriguez commented Mar 26, 2025 •

edited

Loading

cloudflare-workers-and-pages Bot commented Mar 26, 2025 •

edited

Loading

GabrielMartinezRodriguez commented Mar 26, 2025 •

edited

Loading