-
Notifications
You must be signed in to change notification settings - Fork 17
Use shared memory positions in per molecule small kernels #61
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
Greptile Overview
|
| Filename | Score | Overview |
|---|---|---|
| src/minimizer/bfgs_minimize_permol_kernels.cu | 5/5 | Updated to use local position buffers (shared memory) instead of global arrays, removing unnecessary global memory copies during line search |
| src/forcefields/dist_geom_kernels.cu | 5/5 | Refactored to pass pre-offset molecule coordinates to device functions for better shared memory utilization |
| src/forcefields/dist_geom_kernels_device.cuh | 5/5 | Changed function signatures to accept pre-offset molecule coordinates instead of global arrays with internal offsetting |
| src/forcefields/mmff_kernels.cu | 5/5 | Updated kernel calls to pass molecule-local coordinate pointers to device functions |
| src/forcefields/mmff_kernels_device.cuh | 5/5 | Modified device function signatures to receive pre-offset coordinate pointers for shared memory optimization |
| src/CMakeLists.txt | 5/5 | Added bfgs as PUBLIC dependency to etkdg library for proper linking |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Additional Comments (1)
-
src/minimizer/bfgs_minimize_permol_kernels.cu, line 617-621 (link)style: unnecessary copy to global memory during line search - energy functions now read from
scratchPosso this write has no effect until position is accepted (line 661)
6 files reviewed, 1 comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
6 files reviewed, no comments
stslxg-nv
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
We'd migrated gradients and other shared buffers but were still using global arrays for positions.