ln: reduce size_of Event from 1856 B -> 720 B, prep for inline storage in offer `*Contents` #3723

phlip9 · 2025-04-09T04:08:23Z

It looks like the Event enum has gotten pretty large @ 1856 B, which forces even small variants like Event::OnionMessagePeerConnected { peer_node_id: PublicKey } to waste a bunch of memory. It also blows up the size of our handler Future(s), since we move Events into them.

Fortunately we can clean up some of the low-hanging fruit pretty easily. Here's two diffs that reduce size_of::<Event>() from ~~1680 B -> 576 B~~ 1856 B -> 720 B.

Box InvoiceContents in Bolt12Invoice and StaticInvoice. This shrinks Event from 1856 B -> 1072 B

InvoiceContents is private, so this shouldn't be a breaking change

Box AnchorDescriptor in BumpTransactionEvent. This shrinks Event from 1072 B -> 720 B.

AnchorDescriptor is technically public and boxing it is a semver breaking change, but the struct's pretty deep in there... Guess I'll leave that to your discretion.

We could go even further to 320 B by boxing PaymentPurpose, but that feels like a much more invasive / semver breaking change.

ldk-reviews-bot · 2025-04-09T04:08:26Z

I've assigned @wpaulino as a reviewer!
I'll wait for their review and will help manage the review process.
Once they submit their review, I'll check if a second reviewer would be helpful.

codecov · 2025-04-09T04:22:11Z

Codecov Report

Attention: Patch coverage is 96.49123% with 2 lines in your changes missing coverage. Please review.

Project coverage is 89.75%. Comparing base (18eb30b) to head (4038f14).

Files with missing lines	Patch %	Lines
lightning/src/offers/invoice.rs	95.00%	0 Missing and 1 partial ⚠️
lightning/src/offers/invoice_request.rs	91.66%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #3723      +/-   ##
==========================================
+ Coverage   89.72%   89.75%   +0.02%     
==========================================
  Files         159      159              
  Lines      128672   128675       +3     
  Branches   128672   128675       +3     
==========================================
+ Hits       115456   115493      +37     
+ Misses      10531    10496      -35     
- Partials     2685     2686       +1

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

tnull

To be honest, I'm not sure if its preferable to trade the size against an increased risk of heap fragmentation? Did you consider that, and how did you decide a smaller Event type was worth it?

vincenzopalazzo

I’m not sure I understand the reason for this change. Did you encounter any issues, or is it just for size reduction?

wpaulino

I can see how the large size can be a constraint for environments with smaller stack sizes/limited memory. Events are tracked in LDK within a Vec, so the memory of each one already lives on the heap. The issue comes from moving each event into the event handler, so the 1680B will be put on the stack every time regardless of the event variant size. Perhaps we should modify the event handler to take events by reference instead?

tnull · 2025-04-09T17:08:56Z

I can see how the large size can be a constraint for environments with smaller stack sizes/limited memory.

Sure, not denying it can become an issue in certain circumstances.

Events are tracked in LDK within a Vec, so the memory of each one already lives on the heap.

The issue is not that they live on the heap, the issue of heap fragmentation is that you have a bazillion tiny allocations that make huge parts of memory unusable after some time, as it becomes harder and harder to find 'fitting gaps' for larger allocations.

The issue comes from moving each event into the event handler, so the 1680B will be put on the stack every time regardless of the event variant size. Perhaps we should modify the event handler to take events by reference instead?

I guess although would be a pity to give up on move semantics? FWIW, it might be worth exploring if the event queue could be an Arc<Mutex<Vec<Event>>> to avoid the reallocations of a Vec per invocation?

wpaulino · 2025-04-09T17:48:21Z

The issue is not that they live on the heap, the issue of heap fragmentation is that you have a bazillion tiny allocations that make huge parts of memory unusable after some time, as it becomes harder and harder to find 'fitting gaps' for larger allocations.

Yeah I definitely don't think we should Box here, was just noting that the events already live on the heap before handling them.

I guess although would be a pity to give up on move semantics?

Not sure how else we can address this otherwise. FWIW, we don't have move semantics on wire messages even though we could, possibly for the same reason as we'd want to avoid them here.

FWIW, it might be worth exploring if the event queue could be an Arc<Mutex<Vec>> to avoid the reallocations of a Vec per invocation?

That extra allocation is not ideal, but it would also go away if we gave the EventHandler references.

phlip9 · 2025-04-09T18:56:24Z

To be honest, I'm not sure if its preferable to trade the size against an increased risk of heap fragmentation? Did you consider that, and how did you decide a smaller Event type was worth it?

Right, there is slight heap fragmentation in 1.5 / 25 variants in exchange for reduced heap+stack usage, cache misses, etc for all variants.

I'm not super familiar with the bench setup in LDK, but I tried running channelmanager::bench_sends before and after this PR on an M1 mac:

dev/ldk/bench$ RUSTFLAGS="--cfg=ldk_bench" c bench
   Compiling lightning v0.2.0+git (dev/ldk/lightning)
   Compiling lightning-rapid-gossip-sync v0.2.0+git (dev/ldk/lightning-rapid-gossip-sync)
   Compiling lightning-persister v0.2.0+git (dev/ldk/lightning-persister)
   Compiling lightning-bench v0.0.1 (dev/ldk/bench)
    Finished `bench` profile [optimized + debuginfo] target(s) in 3m 18s
     Running benches/bench.rs (target/release/deps/bench-126259c02b665b62)
bench_sends             time:   [4.3944 ms 4.4082 ms 4.4229 ms]
                        change: [-1.5040% -0.7683% -0.1187%] (p = 0.03 < 0.05)
                        Change within noise threshold.
Found 5 outliers among 100 measurements (5.00%)
  5 (5.00%) high mild

Within the noise threshold, but not nothing. An OnionMessenger process_events bench would probably make this more apparent.

Ultimately the most performant approach IMO would be to just hand a batch of events to the consumer instead of the current "event-by-event" handling. That would complicate the error handling, but would let us persist batches of events that we need to handle asynchronously.

phlip9 · 2025-04-09T19:30:50Z

Sorry, to add on to this, we currently have some layers of async handlers for each Event that pass by value. Like:

async fn event_handler.get_ldk_handler_future(event) // 14624 B (!)
async fn event_handler.handle_inline(event) // 12928 B
async fn event_handler.handle_event(id, event) // 11136 B
async fn event_handler.persist_and_spawn_handler(event) // 1872 B
async fn event::handle_payment_claimable(id, {..}) // ...
...

And each layer gets blown up by + size_of Event. So reducing size_of Event would have a multiplicative effect in reducing the size of our futures.

tnull · 2025-04-10T12:08:56Z

I'm not super familiar with the bench setup in LDK, but I tried running channelmanager::bench_sends before and after this PR on an M1 mac:

Did you adjust the sample size to something significant? Otherwise executing a single bench run is far from represenative either way.

Ultimately the most performant approach IMO would be to just hand a batch of events to the consumer instead of the current "event-by-event" handling. That would complicate the error handling, but would let us persist batches of events that we need to handle asynchronously.

Hmm, well, I think we started discussing this in https://github.com/orgs/lightningdevkit/discussions/2381 / #2491, where we established why it's not trivial to "just enable" concurrent event handling for certain variants at least. But I agree that would be the proper/longer term fix for the issue at hand.

Sorry, to add on to this, we currently have some layers of async handlers for each Event that pass by value.

Without more context I'm not quite understanding what these layers do or why you chose to go that way. But it seems the issue you are trying to solve by this PR is partially self-inflicted by your architecture, IIUC? Just out of curiousity, would you like the alternative solution proposed above, i.e., giving Events by reference rather than by value?

TheBlueMatt · 2025-04-11T01:18:05Z

Within the noise threshold, but not nothing.

FWIW I'm incredibly skeptical that changing a few boxes would result in a change that is more than 100 nanoseconds across the entire send pipeline, which is definitely unmeasureable in the bench_sends.

As for whether to reduce the size of Events at all, indeed, there's a tradeoff between heap fragmentation and object size. Generally in LDK we try to be cautious about heap allocations as much as possible, and in the case of Event, it shows 😅.

Box AnchorDescriptor in BumpTransactionEvent. This shrinks Event from 1072 B -> 576 B.

If it were just this, I'd say go for it! This event is relatively rare (as it only happens on a timer when we have stuff pending on chain), so the impact is capped.

Box InvoiceContents in Bolt12Invoice and StaticInvoice. This shrinks Event from 1680 B -> 1072 B

But this case is a bit trickier. #3730 may reduce allocations in InvoiceContents by a few (as features are actually added for the offers/invoice-request/invoice/blinded path contexts), but really there's already a lot of small allocations in BOLT 12's, which we kinda need to cut down on.

One thing we could do here is take #3730 a few steps further and have similar pre-allocated variable-length things for some of the stuff in BOLT 12 structs - eg. pre-allocate the blinded paths, pre-allocate the issuer, description, and payer note, and pre-allocate in HumanReadableName. Of course in most of those cases we wouldn't be able to do the pre-allocation without any additional memory usage (unlike #3730), but allocating a bit bigger is a reasonable tradeoff if we're then going to actually store the contents on the heap (a string on the heap is going to have at least 6 pointers overhead anyway - 3 pointers for malloc on the heap and 3 pointers for the String itself, plus you can generally multiply by two for fragmentation costs, so pre-allocating 32 or even 64 bytes isn't really crazy).

TheBlueMatt · 2025-05-13T18:24:54Z

Looking at the contents, ISTM if we just defined a pre-allocated string+vec type, we could reduce our allocations to zero in common cases - if we make OfferContents::description, OfferContents::issuer, and InvoiceRequestContentsWithoutPayerSigningPubkey::payer_note pre-allocated strings as well as OfferContents::paths, Metadata::Bytes, and InvoiceFields::payment_paths pre-allocated vecs (and ofc store the OfferContents and InvoiceRequestContents boxed within their usual structs as well) allocations in the BOLT12 *Contents would just be OfferContents::chains (which should always be empty on normal mainnet offers so who cares), and InvoiceFields::fallbacks (which will generally never be set, I imagine).

Given #3730 basically defines a pre-allocated Vec type (it just needs to be generic-ified), it wouldn't be hard to extend it to remove the allocations in the BOLT 12 structs. Thus, ISTM we should move forward with this with the intention of eventually removing all the allocations entirely.

That said, rustfmt is failing, so that needs fixing. Mind going ahead and moving the *Contents containers into Boxes in all the BOLT 12 structs?

Prep for inline storage for `description`, `issuer`, `payer_note` * invoice: box `InvoiceContents` reduces `mem::size_of::<Event>()` : 1856 -> 1072 * invoice_request: box contents * offer: box `OfferContents` * refund: box `RefundContents` * static invoice: box `InvoiceContents`

ldk-reviews-bot requested a review from wpaulino April 9, 2025 04:18

tnull reviewed Apr 9, 2025

View reviewed changes

vincenzopalazzo reviewed Apr 9, 2025

View reviewed changes

wpaulino reviewed Apr 9, 2025

View reviewed changes

phlip9 added 2 commits May 17, 2025 00:03

events: box AnchorDescriptor. reduce size_of Event 1072 -> 720

4038f14

phlip9 force-pushed the phlip9/reduce-events-size branch from ddf2296 to 4038f14 Compare May 17, 2025 07:05

phlip9 changed the title ~~ln: reduce size_of Event from 1680 B -> 576 B~~ ln: reduce size_of Event from 1856 B -> 720 B, prep for inline storage in offer *Contents May 17, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ln: reduce size_of Event from 1856 B -> 720 B, prep for inline storage in offer `*Contents` #3723

ln: reduce size_of Event from 1856 B -> 720 B, prep for inline storage in offer `*Contents` #3723

phlip9 commented Apr 9, 2025 •

edited

Loading

Uh oh!

ldk-reviews-bot commented Apr 9, 2025 •

edited

Loading

Uh oh!

codecov bot commented Apr 9, 2025 •

edited

Loading

Uh oh!

tnull left a comment •

edited

Loading

Uh oh!

vincenzopalazzo left a comment

Uh oh!

wpaulino left a comment

Uh oh!

tnull commented Apr 9, 2025

Uh oh!

wpaulino commented Apr 9, 2025

Uh oh!

phlip9 commented Apr 9, 2025 •

edited

Loading

Uh oh!

phlip9 commented Apr 9, 2025 •

edited

Loading

Uh oh!

tnull commented Apr 10, 2025

Uh oh!

TheBlueMatt commented Apr 11, 2025

Uh oh!

TheBlueMatt commented May 13, 2025

Uh oh!

Uh oh!

ln: reduce size_of Event from 1856 B -> 720 B, prep for inline storage in offer *Contents #3723

Are you sure you want to change the base?

ln: reduce size_of Event from 1856 B -> 720 B, prep for inline storage in offer *Contents #3723

Conversation

phlip9 commented Apr 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ldk-reviews-bot commented Apr 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov bot commented Apr 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

tnull left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

vincenzopalazzo left a comment

Choose a reason for hiding this comment

Uh oh!

wpaulino left a comment

Choose a reason for hiding this comment

Uh oh!

tnull commented Apr 9, 2025

Uh oh!

wpaulino commented Apr 9, 2025

Uh oh!

phlip9 commented Apr 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

phlip9 commented Apr 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tnull commented Apr 10, 2025

Uh oh!

TheBlueMatt commented Apr 11, 2025

Uh oh!

TheBlueMatt commented May 13, 2025

Uh oh!

Uh oh!

ln: reduce size_of Event from 1856 B -> 720 B, prep for inline storage in offer `*Contents` #3723

ln: reduce size_of Event from 1856 B -> 720 B, prep for inline storage in offer `*Contents` #3723

phlip9 commented Apr 9, 2025 •

edited

Loading

ldk-reviews-bot commented Apr 9, 2025 •

edited

Loading

codecov bot commented Apr 9, 2025 •

edited

Loading

tnull left a comment •

edited

Loading

phlip9 commented Apr 9, 2025 •

edited

Loading

phlip9 commented Apr 9, 2025 •

edited

Loading