Switch in-memory index to hash set #4665

SirTyson · 2025-03-11T17:06:19Z

Description

Significantly improve in-memory bucket performance.

This PR changes in-memory buckets from a vector to an unordered_set. This doubles median lookup speed, but more importantly removes all slow outliers seen with the vector approach (as tested on my laptop). To keep the minimum possible memory footprint, I've created a bit of a hacky set which stores an impl class. The impl can either be a key type or a value type. The intention is the set only stores value type entries, but key type entries can be used for querying. C++20 introduces transparent hashes which accomplish the same goal, but in a much cleaner, more statically typesafe way, so we should refactor this once we upgrade.

Checklist

Reviewed the contributing document
Rebased on top of master (no merge commits)
Ran clang-format v8.0.0 (via make format or the Visual Studio extension)
Compiles
Ran all tests
If change impacts performance, include supporting evidence per the performance document

marta-lokhova

LGTM overall, just a few questions

src/bucket/InMemoryIndex.cpp

marta-lokhova · 2025-03-24T17:20:41Z

@SirTyson ah looks like this PR didn't merge due to some status check failures

SirTyson requested a review from marta-lokhova March 11, 2025 17:06

marta-lokhova requested changes Mar 21, 2025

View reviewed changes

src/bucket/InMemoryIndex.cpp Show resolved Hide resolved

src/bucket/InMemoryIndex.cpp Show resolved Hide resolved

marta-lokhova enabled auto-merge March 22, 2025 00:26

marta-lokhova previously approved these changes Mar 22, 2025

View reviewed changes

marta-lokhova added this pull request to the merge queue Mar 22, 2025

github-merge-queue bot removed this pull request from the merge queue due to failed status checks Mar 22, 2025

marta-lokhova added this pull request to the merge queue Mar 22, 2025

github-merge-queue bot removed this pull request from the merge queue due to failed status checks Mar 22, 2025

SirTyson added this pull request to the merge queue Mar 24, 2025

SirTyson removed this pull request from the merge queue due to a manual request Mar 24, 2025

Switch in-memory index to hash set

3cd1714

SirTyson dismissed marta-lokhova’s stale review via 3cd1714 March 24, 2025 17:37

SirTyson force-pushed the in-memory-bucket-optimization branch from 11edf7f to 3cd1714 Compare March 24, 2025 17:37

marta-lokhova approved these changes Mar 24, 2025

View reviewed changes

marta-lokhova enabled auto-merge March 24, 2025 17:56

marta-lokhova added this pull request to the merge queue Mar 24, 2025

Merged via the queue into stellar:master with commit 91ce086 Mar 24, 2025
13 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Switch in-memory index to hash set #4665

Switch in-memory index to hash set #4665

SirTyson commented Mar 11, 2025

marta-lokhova left a comment

marta-lokhova commented Mar 24, 2025

Switch in-memory index to hash set #4665

Switch in-memory index to hash set #4665

Conversation

SirTyson commented Mar 11, 2025

Description

Checklist

marta-lokhova left a comment

Choose a reason for hiding this comment

marta-lokhova commented Mar 24, 2025