Add set operations to `@immut/hash{map, set}` and `@internal/sparse_array` Summary #2145

Asterless · 2025-05-21T21:37:12Z

Related Issue

Fixes #1830: Efficient set operations for @immut/hash{map, set}

Changes

`@immut/hashmap`

union_with(f: (K, V, V) => V)
Merges two hashmaps, resolving key conflicts with a custom function f.
intersection()
Returns a new hashmap containing keys present in both input maps, with values from the first map.
intersection_with(f: (K, V, V) => V)
Computes intersection, resolving overlapping keys' values with function f.
difference()
Returns entries present in the first map but not in the second.

`@immut/hashset`

intersection()
Returns a new set containing elements common to both input sets.
difference()
Returns elements present in the first set but not in the second.

`@internal/sparse_array`

intersection()
Computes index-wise intersection of two sparse arrays.
difference()
Computes index-wise difference between two sparse arrays.

Motivation

These changes provide a more complete and consistent set of set operations for immutable collections, making it easier to perform common set algebra tasks and improving API parity across collection types.

Tests

Added and updated unit tests for all new and modified methods to ensure correctness and expected behavior.

Checklist

All new and existing tests pass
Code is formatted and documented where appropriate

…ence methods, now they can handle branchs

peter-jerry-ye-code-review · 2025-05-21T21:37:40Z

Inconsistent naming pattern in method declarations

Category
Maintainability
Code Snippet
pub fn[K : Eq + Hash, V] union(self : T[K, V], other : T[K, V]) -> T[K, V]
pub fn[K : Eq + Hash, V] T::union(self : T[K, V], other : T[K, V]) -> T[K, V]
Recommendation
Consistently use T:: prefix for all methods. Update the union function to match other new methods using T:: prefix.
Reasoning
The codebase shows mixed usage of method declaration styles between old and new code. Using consistent T:: prefix improves readability and maintains uniform method declaration pattern across the codebase.

Redundant checks in hashset intersection

Category
Performance
Code Snippet
pub fn[K : Eq + Hash] T::intersection(self : T[K], other : T[K]) -> T[K] {
match (self, other) {
...
(Branch(sa1), Branch(sa2)) => {
let res = sa1.intersection(sa2, fn(m1, m2) { m1.intersection(m2) })
if res.size() == 0 { Empty } else { Branch(res) }
}
Recommendation
Remove redundant size check and directly return Branch(res), as empty sparse arrays are already handled by the Empty pattern match.
Reasoning
The size check is unnecessary since empty results are already handled by the Empty constructor in the match pattern. Removing it simplifies the code and eliminates a redundant operation.

Missing documentation for type constraints in hashmap intersection operation

Category
Correctness
Code Snippet
pub fn[K : Eq + Hash, V] T::intersection_with(
self : T[K, V],
other : T[K, V],
f : (K, V, V) -> V
) -> T[K, V]
Recommendation
Add documentation explaining the type constraints (Eq + Hash) and f function requirements, for example:
///| Combines overlapping entries using function f.
///| K must implement Eq + Hash traits.
///| f receives the key and both values, returns combined value.
Reasoning
Type constraints and function parameters need clear documentation to help users understand requirements and correct usage. This is especially important for generic operations with special constraints.

coveralls · 2025-05-21T21:39:13Z

Pull Request Test Coverage Report for Build 7010

Details

52 of 103 (50.49%) changed or added relevant lines in 3 files are covered.
1 unchanged line in 1 file lost coverage.
Overall coverage decreased (-0.5%) to 92.068%

Changes Missing Coverage	Covered Lines	Changed/Added Lines	%
immut/internal/sparse_array/sparse_array.mbt	15	16	93.75%
immut/hashset/HAMT.mbt	7	26	26.92%
immut/hashmap/HAMT.mbt	30	61	49.18%

Files with Coverage Reduction	New Missed Lines	%
bench/stats.mbt	1	85.48%

Totals
Change from base Build 7006:	-0.5%
Covered Lines:	8798
Relevant Lines:	9556

💛 - Coveralls

…ence methods, now they can handle branchs

…o feature/20250522_HAMT

Add set operations to @immut/hash{map, set} and @internal/sparse_array Summary and re-fmt

immut/hashmap/HAMT.mbt

…ype prefix

FlyCloudC · 2025-05-30T15:24:40Z

The issues I reported seem to still exist in the merged PR. Here's the test evidence showing the problematic behavior

///|
priv type MyInt Int derive(Eq, Show)

///|
impl Hash for MyInt with hash_combine(_, _) {
  panic()
}

///|
impl Hash for MyInt with hash(self) {
  self._
}

///|
test {
  let m1 = new().add(MyInt(0b1_00001), 1)
  let m2 = m1.add(MyInt(0b11_00001), 1)
  inspect(m1.to_array(), content="[(MyInt(33), 1)]")
  inspect(m2.to_array(), content="[(MyInt(33), 1), (MyInt(97), 1)]")
  inspect(m2.difference(m1).to_array(), content="[]") // should be [(MyInt(97), 1)]
}

Another example without MyInt:

test {
  let m1 = for i = 0, m1 = new(); i < 100; {
    continue i + 1, m1.add(i, i)
  } else {
    m1
  }
  let m2 = m1.add(200, 200)
  inspect(m2.difference(m1), content="@immut/hashmap.of([])") // should be [(200, 200)]
}

Asterless · 2025-05-31T03:31:47Z

Sorry, it seems that I mistakenly associated the issue. Please ignore my operation🙂

FlyCloudC · 2025-05-31T03:37:41Z

Ah, just to be clear - I'm talking about the code review comments I left on this pull request above, not about any independent issue ticket in the repo.

…arse_array` (moonbitlang#2145) * feat: add four new functions to HAMT and their tests * fix(union_with): fix union_with for HAMT,now it can handle branch * feat(sparse_array): add intersection and difference methods * fix(HAMT): fix union_with, intersection, intersection_with and difference methods, now they can handle branchs * feat(hashset): add intersection and difference methods to hashset * commit other files * feat: add four new functions to HAMT and their tests * fix(union_with): fix union_with for HAMT,now it can handle branch * feat(sparse_array): add intersection and difference methods * fix(HAMT): fix union_with, intersection, intersection_with and difference methods, now they can handle branchs * feat(hashset): add intersection and difference methods to hashset * style: change the position of some function declarations * fix: fix formatting of the code * feat: Update the function declarations of hash tables and sparse arrays * refactor:update mbti * refactor: update hashmap and hashset function signatures to include type prefix --------- Co-authored-by: 东灯 <[email protected]>

Asterless added 5 commits May 22, 2025 03:06

feat: add four new functions to HAMT and their tests

a9b3888

fix(union_with): fix union_with for HAMT,now it can handle branch

09b6a99

feat(sparse_array): add intersection and difference methods

1b84483

fix(HAMT): fix union_with, intersection, intersection_with and differ…

0cdca5b

…ence methods, now they can handle branchs

feat(hashset): add intersection and difference methods to hashset

44c0a79

Asterless marked this pull request as ready for review May 21, 2025 21:39

Asterless added 6 commits May 22, 2025 05:45

commit other files

f80c9f2

feat: add four new functions to HAMT and their tests

7f2524c

fix(union_with): fix union_with for HAMT,now it can handle branch

fc71f34

feat(sparse_array): add intersection and difference methods

5adb5ce

fix(HAMT): fix union_with, intersection, intersection_with and differ…

8d06b82

…ence methods, now they can handle branchs

feat(hashset): add intersection and difference methods to hashset

eb6ef82

bobzhang force-pushed the feature/20250522_HAMT branch from 44c0a79 to eb6ef82 Compare May 22, 2025 01:12

Asterless added 2 commits May 22, 2025 10:27

Merge branch 'feature/20250522_HAMT' of github.com:Asterless/core int…

61ca5f0

…o feature/20250522_HAMT

style: change the position of some function declarations

a0d7669

peter-jerry-ye requested a review from Lampese May 22, 2025 02:46

Asterless and others added 2 commits May 22, 2025 11:10

fix: fix formatting of the code

29c51a5

Merge pull request #1 from Asterless/feature/20250522_HAMT

4b46d45

Add set operations to @immut/hash{map, set} and @internal/sparse_array Summary and re-fmt

Asterless mentioned this pull request May 22, 2025

Fix formatting of the code from [pull#2145](https://github.com/moonbitlang/core/pull/2145) #2147

Closed

peter-jerry-ye mentioned this pull request May 22, 2025

fix conflit for 2145 #2148

Closed

Asterless and others added 8 commits May 22, 2025 21:18

Merge branch 'main' of github.com:Asterless/core

256557a

feat: Update the function declarations of hash tables and sparse arrays

dab49f5

Merge branch 'main' into feature/20250522_HAMT

779268b

Merge branch 'main' into feature/20250522_HAMT

3f120f1

Merge branch 'main' into feature/20250522_HAMT

34bce14

Merge branch 'main' into feature/20250522_HAMT

3b99be9

Merge branch 'main' into feature/20250522_HAMT

139cda1

Merge branch 'main' into feature/20250522_HAMT

0f648d5

bobzhang requested a review from Guest0x0 May 25, 2025 01:00

bobzhang reviewed May 25, 2025

View reviewed changes

immut/hashmap/HAMT.mbt Show resolved Hide resolved

Lampese and others added 4 commits May 25, 2025 19:22

Merge branch 'main' into feature/20250522_HAMT

d72a3c0

Merge branch 'main' into feature/20250522_HAMT

155aec3

refactor:update mbti

04bfb69

Merge branch 'main' into feature/20250522_HAMT

7eef793

CAIMEOX approved these changes May 30, 2025

View reviewed changes

refactor: update hashmap and hashset function signatures to include t…

47a7a00

…ype prefix

CAIMEOX merged commit ea0d2ef into moonbitlang:main May 30, 2025
12 checks passed

Asterless deleted the feature/20250522_HAMT branch May 30, 2025 16:52

This was referenced Jun 4, 2025

Refactor immut/{set, map, sparse_array} #2205

Closed

Add map and map_with_key Methods to Map #2210

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add set operations to `@immut/hash{map, set}` and `@internal/sparse_array` Summary #2145

Add set operations to `@immut/hash{map, set}` and `@internal/sparse_array` Summary #2145

Uh oh!

Asterless commented May 21, 2025

Uh oh!

peter-jerry-ye-code-review bot commented May 21, 2025 •

edited

Loading

Uh oh!

coveralls commented May 21, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

FlyCloudC commented May 30, 2025 •

edited

Loading

Uh oh!

Asterless commented May 31, 2025

Uh oh!

FlyCloudC commented May 31, 2025 •

edited

Loading

Uh oh!

Uh oh!

Add set operations to @immut/hash{map, set} and @internal/sparse_array Summary #2145

Add set operations to @immut/hash{map, set} and @internal/sparse_array Summary #2145

Uh oh!

Conversation

Asterless commented May 21, 2025

Related Issue

Changes

@immut/hashmap

@immut/hashset

@internal/sparse_array

Motivation

Tests

Checklist

Uh oh!

peter-jerry-ye-code-review bot commented May 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

coveralls commented May 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Pull Request Test Coverage Report for Build 7010

Details

💛 - Coveralls

Uh oh!

Uh oh!

Uh oh!

FlyCloudC commented May 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Asterless commented May 31, 2025

Uh oh!

FlyCloudC commented May 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Add set operations to `@immut/hash{map, set}` and `@internal/sparse_array` Summary #2145

Add set operations to `@immut/hash{map, set}` and `@internal/sparse_array` Summary #2145

`@immut/hashmap`

`@immut/hashset`

`@internal/sparse_array`

peter-jerry-ye-code-review bot commented May 21, 2025 •

edited

Loading

coveralls commented May 21, 2025 •

edited

Loading

FlyCloudC commented May 30, 2025 •

edited

Loading

FlyCloudC commented May 31, 2025 •

edited

Loading