Bunch of fixes and changes #58

lukamac · 2025-04-10T14:29:56Z

I have been working on adding support for an updated Neureka gvsoc model. Furthermore, I was writing a script that fakes quantization and wanted to use the Generic target for testing it.
This is a collection of small fixes and changes I needed for this. I am creating a pull request now because I'm abandoning the effort in the Generic platform and going straight for the siracusa platform.

Best reviewed commit by commit.

Added

check for CMAKE variable
tensor name mangling
identity operation removal
_unpack_const helper function to NodeParser to allow for node attributes that are direct Constant tensors or direct numpy values
load_file_to_local in dory_mem as a way to load values directly to a local memory (not ram). needed for copying values from flash to wmem needed for Neureka v2
add_gvsoc_v2_emulation macro - changed the naming to reflect that it is the same macro like in pulp-sdk but using a different (standalone) gvsoc, added target argument to it to allow reuse. This might clash with whatever has been cooked in parallel on the topic of standalone gvsoc.

Changed

duplicateConstants now also duplicate constant nodes
check float output define in DeeployTest Generic platform
kernel_shape now inferred from weight shape if not present as per ONNX spec
USE_NEUREKA moved into TargetLibraries where it's closer to pulp-nnx
hex dumping logic for pulp platforms in prep for neureka v2 where I need to save weights to flash and move them during runtime to wmem

Fixed

RequantShift when log2d is 0
missing math.h headers
clang on mac doesn't support -Wl,--gc-sections flag, moved it into each target and for host it's checking now for host system
--ffast-math caused numerical errors on generic so moved into each target and removed from that one since I'm imagining it as the debug target
Gather kernel on generic target

PR Merge Checklist

The PR is rebased on the latest devel commit and pointing to devel.
Your PR reviewed and approved.
All checks are passing.
The CHANGELOG.md file has been updated.
If the docker was modified, change back its link after review.

Those flags are not actually common. The -Wl,--gc-section does not exist on Apple's llvm build but should be replaced by -Wl,-dead_strip by this (stackoverflow post)[https://stackoverflow.com/a/24799865]. -ffast-math flag caused some numerical errors probably due to aggressive optimizations so I disabled it for the Generic target since that one is supposed to be more for debugging.

Slightly more legible outer loop

…rite

The ONNX spec defines kernel_shape as optional because it can be inferred from the weight's shape. This commit changes the parser to allow for that.

lukamac requested review from Victor-Jung and Xeratec as code owners April 10, 2025 14:29

lukamac added 12 commits May 6, 2025 13:00

Check whether CMAKE is set as an environment variable

8574db8

Fix RequantShift merge rounding when shift is 0

16edb12

Fix RequantShift kernel when log2d is 0

a47eace

Add missing math.h header to generic kernels

91915eb

Change _duplicateConstants to also duplicate Constant nodes

0c2c312

Check for float output in generic platform

5cd70a4

Add tensor name mangling

2dec23d

Add identity operation removal

3835176

Fix Gather

862d5b6

Layernorm nitty rewrite

e2acd53

Slightly more legible outer loop

Add _unpack_const helper function and slight RequantShift parsing rew…

7e24551

…rite

lukamac force-pushed the bag-of-fix branch from 372a65f to 8671039 Compare May 6, 2025 11:24

lukamac added 3 commits May 6, 2025 13:35

Change convolution's kernel_shape attribute to be optional

9488cea

The ONNX spec defines kernel_shape as optional because it can be inferred from the weight's shape. This commit changes the parser to allow for that.

Add load_file_to_local to dory_mem

f2dc964

Formatting

e005760

lukamac force-pushed the bag-of-fix branch from 8671039 to 71b7a1d Compare May 6, 2025 11:37

lukamac added 5 commits May 6, 2025 13:45

Fix compiler warning for implicit conversion from int to float

b13d0c6

Add a little bit more info to mappping error

fdd9291

Add gvsoc target to simulation.cmake and move USE_NEUREKA

853089b

Hex dump all global buffers with extName

2c8e1c6

Rewrite hex dumping logic in preparation for neureka v2

6a7fd3d

lukamac force-pushed the bag-of-fix branch 3 times, most recently from 6a9546a to 1889383 Compare May 7, 2025 12:47

Change Neureka conv parsers to inherit from Conv2d parser

0c2cbac

lukamac force-pushed the bag-of-fix branch from 1889383 to 0c2cbac Compare May 7, 2025 12:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bunch of fixes and changes #58

Bunch of fixes and changes #58

lukamac commented Apr 10, 2025

Bunch of fixes and changes #58

Are you sure you want to change the base?

Bunch of fixes and changes #58

Conversation

lukamac commented Apr 10, 2025

Added

Changed

Fixed

PR Merge Checklist