Add RVV optimizations for SIMD-based calculations in ScalarQuantizer.cpp #4503

vsvnakers · 2025-08-04T16:54:53Z

We have added the RVV implementation code for RISCV, which is benchmarked against NEON, in scalarquantizer.cpp for performance testing. The decode_8_components of CPU and RVV were tested and scored on the RISCV board, and the implementation of RVV showed significant performance improvement.

One of the RVV tests regarding decode_8_components

Test Principle:

This program compares the execution time of Scalar decoding and RVV Vectorized decoding to verify the performance improvements brought by RVV optimization.

Scalar Decoding: Each byte is processed individually and decoded sequentially.
RVV Vectorized Decoding: Uses RVV vector instructions to process 8 bytes at once, utilizing parallel processing to accelerate the decoding.

Test Procedure:

Initial Data: Create a random array code.
Scalar Decoding: Decode each byte sequentially and measure the time.
RVV Decoding: Decode 8 bytes at once using RVV vector instructions and measure the time.
Timing: Record the time for Scalar decoding and RVV decoding.

Output:

Dummy sum: Used to ensure consistency in calculations.
RVV execution time: Time taken for RVV decoding.
Scalar execution time: Time taken for scalar decoding.

[root@EulixOS ~]# ./decode_8
Dummy sum = 9476415.0000
RVV time    = 1571.824 ms
Scalar time = 2450.067 ms

meta-cla · 2025-08-04T16:54:59Z

Hi @vsvnakers!

Thank you for your pull request and welcome to our community.

Action Required

In order to merge any pull request (code, docs, etc.), we require contributors to sign our Contributor License Agreement, and we don't seem to have one on file for you.

Process

In order for us to review and merge your suggested changes, please sign at https://code.facebook.com/cla. If you are contributing on behalf of someone else (eg your employer), the individual CLA may not be sufficient and your employer may need to sign the corporate CLA.

Once the CLA is signed, our tooling will perform checks and validations. Afterwards, the pull request will be tagged with CLA signed. The tagging process may take up to 1 hour after signing. Please give it that time before contacting us about it.

If you have received this in error or have any questions, please contact us at [email protected]. Thanks!

Signed-off-by: lyd1992 <[email protected]> Signed-off-by: vsvnakers <[email protected]>

vsvnakers · 2025-08-15T06:41:33Z

I hope you’re all doing well! I wanted to kindly follow up on my PR (#4503), which has been open for about a week.
This PR includes an optimization for Scalarquantizer using RVV, and I’d greatly appreciate it if one of potential maintainers(@mnorris11 @Enet4 @alexanderguzhva ) could take a look when you have a chance.

If there are any issues, missing details, or areas that could be improved, please feel free to let me know — I’m more than happy to make adjustments.
Thank you very much for your time and support in helping move this forward! 🙏

alexanderguzhva · 2025-08-15T13:18:27Z

@vsvnakers my comments are the following

please remove unneeded debugging commented out code, such as // printf(...);
please remove unneeded meaningful commented out code, such as // vfloat32m1x2_t res;
please remove debugging code with active printf(...); lines, unless it is critically important debugging info (which needs to be covered with if (verbose) {} construct.
please keep existing code style, so there's no need to change if (cond) { do_it(); } into if (cond) do_it(); for one-liners. The rules for formatting the code can be found in included .clang-format file(s)

overall, the change looks good

mnorris11 · 2025-08-15T13:47:20Z

Thanks @alexanderguzhva for taking a look

Sorry @vsvnakers , it fell through the cracks during change in oncall who reviews issues.

for context, we are working on a large refactor of the library to allow easier maintenance of SIMD code.

@subhadeepkaran do you want to see how compatible this one is?

Signed-off-by: lyd1992 <[email protected]> Signed-off-by: vsvnakers <[email protected]>

vsvnakers · 2025-08-19T08:35:50Z

Hi @alexanderguzhva @mnorris11, thanks a lot for the detailed feedback 🙏
I’ve addressed the comments (removed the unnecessary debug/commented code and kept the existing code style as suggested).
When you have a moment, could you please take another look? I’d really appreciate it!

alexanderguzhva · 2025-08-19T11:59:15Z

@vsvnakers the code looks good

mnorris11 · 2025-08-19T15:14:44Z

@vsvnakers , thanks for this contribution! After the large SIMD refactor, we can start reviewing it. ETA this month. @subhadeepkaran to check

mdouze · 2025-08-29T08:16:53Z

Hi! AFAICS, the diff is still mainly formatting comments. Please use the clang formatter to make it readable. Thanks!

Signed-off-by: lyd1992 <[email protected]> Signed-off-by: vsvnakers <[email protected]>

vsvnakers · 2025-09-04T07:53:32Z

Hi @mdouze, I reformatted the file using clang-format as recommended.
Could you please take another look when you have time? Thanks a lot!

mdouze · 2025-09-07T20:30:06Z

@vsvnakers, thanks a lot, the PR is much more readable now.
As @mnorris11 says, we are busy reshaping all the SIMD code, mainly to avoid the many ifdefs related to different SIMD flavors. This will impact any SIMD PRs, so we'd rather have a PR against that new API. You can see PR #4557 about how this is going to be implemented.

vsvnakers force-pushed the rvv-support branch from 318ba22 to 1a1c57a Compare August 5, 2025 02:39

meta-cla bot added the CLA Signed label Aug 5, 2025

Add RVV optimizations for SIMD-based calculations in ScalarQuantizer.cpp

9d8fb46

Signed-off-by: lyd1992 <[email protected]> Signed-off-by: vsvnakers <[email protected]>

vsvnakers force-pushed the rvv-support branch from 1a1c57a to 9d8fb46 Compare August 7, 2025 08:02

fix: cleanup debug code and follow project code style

b8b3c22

Signed-off-by: lyd1992 <[email protected]> Signed-off-by: vsvnakers <[email protected]>

vsvnakers force-pushed the rvv-support branch from 850ccd8 to b8b3c22 Compare August 19, 2025 08:24

vsvnakers added 2 commits September 1, 2025 16:57

using clang-format -i ScalarQuantizer.cpp

c01ba12

Signed-off-by: lyd1992 <[email protected]> Signed-off-by: vsvnakers <[email protected]>

Normalize line endings to LF

8485e20

Signed-off-by: lyd1992 <[email protected]> Signed-off-by: vsvnakers <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add RVV optimizations for SIMD-based calculations in ScalarQuantizer.cpp #4503

Add RVV optimizations for SIMD-based calculations in ScalarQuantizer.cpp #4503

vsvnakers commented Aug 4, 2025

Uh oh!

meta-cla bot commented Aug 4, 2025

Uh oh!

vsvnakers commented Aug 15, 2025

Uh oh!

alexanderguzhva commented Aug 15, 2025 •

edited

Loading

Uh oh!

mnorris11 commented Aug 15, 2025

Uh oh!

vsvnakers commented Aug 19, 2025

Uh oh!

alexanderguzhva commented Aug 19, 2025

Uh oh!

mnorris11 commented Aug 19, 2025

Uh oh!

mdouze commented Aug 29, 2025

Uh oh!

vsvnakers commented Sep 4, 2025

Uh oh!

mdouze commented Sep 7, 2025

Uh oh!

Uh oh!

Add RVV optimizations for SIMD-based calculations in ScalarQuantizer.cpp #4503

Are you sure you want to change the base?

Add RVV optimizations for SIMD-based calculations in ScalarQuantizer.cpp #4503

Conversation

vsvnakers commented Aug 4, 2025

Test Principle:

Test Procedure:

Output:

Uh oh!

meta-cla bot commented Aug 4, 2025

Action Required

Process

Uh oh!

vsvnakers commented Aug 15, 2025

Uh oh!

alexanderguzhva commented Aug 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mnorris11 commented Aug 15, 2025

Uh oh!

vsvnakers commented Aug 19, 2025

Uh oh!

alexanderguzhva commented Aug 19, 2025

Uh oh!

mnorris11 commented Aug 19, 2025

Uh oh!

mdouze commented Aug 29, 2025

Uh oh!

vsvnakers commented Sep 4, 2025

Uh oh!

mdouze commented Sep 7, 2025

Uh oh!

Uh oh!

alexanderguzhva commented Aug 15, 2025 •

edited

Loading