Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Change spatial traversal to fix perf regression from #897 #976

Merged
merged 2 commits into from
Dec 7, 2023

Conversation

aprokop
Copy link
Contributor

@aprokop aprokop commented Dec 7, 2023

Fix perf regression from #897

On Saturn with the standard HACC 37M problem (eps=0.042), the traversal time goes down (compared to master (fbd2584)):

minpts = 2 (query+cluster timer):
FDBSCAN: 0.147 -> 0.114
FDBSCAN-DenseBox: 0.154 -> 0.126

minpts = 10
FDBSCAN
neigh: 0.058 -> 0.057
query: 0.137 -> 0.115

FDBSCAN-DenseBox
neigh: 0.057 -> 0.057
query: 0.161 -> 0.129

This patch should restore most of the lost performance due to APIv2 refactoring:
apiv2_timeline

@aprokop aprokop added the performance Something is slower than it should be label Dec 7, 2023
Copy link
Contributor

@dalg24 dalg24 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Nice. Just less branching effectively?

@aprokop
Copy link
Contributor Author

aprokop commented Dec 7, 2023

I've run the bvh benchmark on Saturn for Serial and CUDA, and surprisingly, it showed no difference in times. Maybe that's why we did not catch the regression.
master_fbd25845.json
pr976_e61159fb.json

@dalg24
Copy link
Contributor

dalg24 commented Dec 7, 2023

What is "Saturn" again?

@aprokop
Copy link
Contributor Author

aprokop commented Dec 7, 2023

Nice. Just less branching effectively?

It's a bit of mystery for me. Possibly. I'm pretty sure I played around with different traversals in #897 but could not find a good one. It could be that Power9 was regressing or something.

@aprokop
Copy link
Contributor Author

aprokop commented Dec 7, 2023

What is "Saturn" again?

One of the University of Oregon OACISS center systems.

@aprokop aprokop mentioned this pull request Dec 7, 2023
@dalg24
Copy link
Contributor

dalg24 commented Dec 7, 2023

HIP build failure was "No space left on device"
I am ok to merge as is.

@aprokop
Copy link
Contributor Author

aprokop commented Dec 7, 2023

All pass (HIP did not run).

@aprokop aprokop merged commit 851e915 into arborx:master Dec 7, 2023
1 check passed
@aprokop aprokop deleted the fixing_perf branch December 7, 2023 04:35
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
performance Something is slower than it should be
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants