Using OrdMap::entry performs worse than separate map method calls #167

SimonSapin · 2020-12-21T18:27:16Z

Consider two implementations of the same algorithm:

match map.get(&key) {
    Some(old_value) => map.insert(key, merge(old_value, new_value)),
    None => map.insert(key, new_value),
}

match map.entry(key) {
    Entry::Occupied(occupied) => occupied.insert(merge(occupied.get(), new_value)),
    Entry::Empty(empty) => empty.insert(new_value),
};

With std’s HashMap, using entry is more efficient since the returned Entry carries extra information to speed up its methods. The code above effectively only does one "map lookup", whereas the get+insert variant does two.

With OrdMap however, using entry is measurably slower in a real-world Mercurial benchmark. The returned Entry does carry extra information, OccupiedEntry::get just calls OrdMap::get, and OccupiedEntry::insert call OrdMap::get then OrdMap::get_mut. So in the occupied case, the entry-based code does four map lookups instead of one. (Three if get_mut is optimized away as its result is unused.)

im-rs/src/ord/map.rs

Lines 1551 to 1558 in 3f4e01a

    
           pub struct OccupiedEntry<'a, K, V> 
        
           where 
        
               K: Ord + Clone, 
        
               V: Clone, 
        
           { 
        
               map: &'a mut OrdMap<K, V>, 
        
               key: K, 
        
           }

im-rs/src/ord/map.rs

Lines 1608 to 1615 in 3f4e01a

    
           pub struct VacantEntry<'a, K, V> 
        
           where 
        
               K: Ord + Clone, 
        
               V: Clone, 
        
           { 
        
               map: &'a mut OrdMap<K, V>, 
        
               key: K, 
        
           }

Better efficiency through fewer lookups is what motivates the existence of the Entry API in std, but on OrdMap it has the opposite effect.

Could this be optimized, by having OccupiedEntry and VacantEntry each contain a PoolRef<Node> and whatever else is needed to make their methods very cheap? If not, should the entry be considered for removal since instead of bringing efficiency benefits it is actively worse?

The text was updated successfully, but these errors were encountered:

arthurprs · 2021-02-05T09:29:01Z

A word of caution since you mentioned real-world, there are some cripling bugs in OrdMap once the tree gains a 3rd level. I've been trying to keep it fixed here #154

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Using OrdMap::entry performs worse than separate map method calls #167

Using OrdMap::entry performs worse than separate map method calls #167

SimonSapin commented Dec 21, 2020

arthurprs commented Feb 5, 2021 •

edited

Loading

Using OrdMap::entry performs worse than separate map method calls #167

Using OrdMap::entry performs worse than separate map method calls #167

Comments

SimonSapin commented Dec 21, 2020

arthurprs commented Feb 5, 2021 • edited Loading

arthurprs commented Feb 5, 2021 •

edited

Loading