Finite Lifetime for IO Tensors #51

Victor-Jung · 2025-03-17T13:22:18Z

~~This PR is based on this currently open PR, don't review it before PR#44 is merged.~~

Added

Two attributes to VariableBuffer is_output, and is_input, to indicate that a VariableBuffer is an IO of the network. IO buffers have to be treated differently than normal buffers; while they live in the global scope, inputs need to be alive at the beginning of the computation (lifetime[0]=0), and outputs have to be alive at the end of the computation (lifetime[-1]=inf).
One comprehensive test of the memory map generated by the tiler.

Changed

The _calculateLifetimes method of the MemoryScheduler is now giving a proper lifetime of VariableBuffer from the global scope.
Memory arena buffers are now added to the beginning of the OrderedDict representing the global context. This is necessary as other global buffers can depend on these arenas for their definition.

Fixed

Align the test memory allocation to fail properly, caused by optimization of the memory footprint.
Fix generateBufferAllocationCode for PULP Deployer. Previously, the output was loaded in L3 at the beginning of the computation. This is completely unnecessary; breaks when the outputs don't have an infinite lifetime, and it is prone to error. Hence now the outputs are not loaded in L3.
The Softmax_fp32 kernel for Snitch has a memory leak if used with 8 cores. I added a comment and constrained its execution to a single core.
In _calculateLifetimes: The lifetime of aliased buffers is not correctly computed.
Some type hinting was done as a forward reference (represented by strings); this is unnecessary and has been adapted to a proper type hint.

PR Merge Checklist

The PR is rebased on the latest devel commit and pointing to devel.
Your PR reviewed and approved.
All checks are passing.
The CHANGELOG.md file has been updated.
If the docker was modified, change back its link after review.

… have finite lifetime

…y are I/O buffers

Xeratec

LGTM and I already know how important it is ;) I also like the testMemoryMapCorrectness you added.

No major comments, just some small questions.

Xeratec · 2025-04-17T11:26:26Z

Deeploy/Targets/Snitch/Templates/FloatSoftmaxTemplate.py

-    uint32_t compute_num = snrt_cluster_compute_core_num();
+    uint32_t compute_num = 1; //snrt_cluster_compute_core_num();
    int32_t ldI = compute_num * ${input_samples};
    int32_t batch_offset = ${seq_len} * ${input_samples};
-
-    ${kernelName}(${data_in}, ${data_out}, ldI, batch_offset, batch_size, ${seq_len}, ${input_samples});
+
+    // JUNGVI: This implementation is broken and has memory leak.
+    if (snrt_hartid() == 0){
+        ${kernelName}(${data_in}, ${data_out}, ldI, batch_offset, batch_size, ${seq_len}, ${input_samples});
+    }


Should this be part of this PR? How hard is it to fix it or did Run already fix it?

Xeratec · 2025-04-17T11:27:16Z

Deeploy/TilingExtension/MemoryScheduler.py

                    # SCHEREMO: ConstantBuffers are assigned and allocated at compile time, Global Var Buffers are assigned at init time
-                    if isinstance(ctxt.lookup(tensorMemoryConstraint.tensorName), ConstantBuffer) or ctxt.is_global(
-                            tensorMemoryConstraint.tensorName):
+                    if isinstance(ctxt.lookup(tensorMemoryConstraint.tensorName), ConstantBuffer):


Please adapt the comment according to you changes.

Xeratec · 2025-04-17T11:31:42Z

Deeploy/TilingExtension/TilerExtension.py

+            if hasattr(ctxt.lookup(buffer.name), "_alias"):
+                continue


Could it be helpful to mark aliased buffers with a special color or sth. else?

Xeratec · 2025-04-17T11:32:38Z

Deeploy/TilingExtension/TilerExtension.py

-    def setupModel(self, ctxt: NetworkContext, schedule: Schedule, layerBinding: OrderedDict[str, ONNXLayer],
+    def setupModel(self, ctxt: NetworkContext, schedule: Schedule, layerBinding: 'OrderedDict[str, ONNXLayer]',


Why this change? The ONNXLayer is imported above.

Victor-Jung requested a review from Xeratec as a code owner March 17, 2025 13:22

Victor-Jung self-assigned this Mar 17, 2025

Victor-Jung added 17 commits March 18, 2025 11:27

WIP Static Memory Allocation of IOs

29baf2c

Temporary fix broken float softmax

25be229

Fix lifetime of aliased input buffers

da56cbe

Fix output buffer lifetime

721f747

Linting

78685e5

WIP fix output buffer lifetime

02b5435

Change RQHardswish dim due to compiler bug

a2d67a0

Fix typo

bdd92de

Fix duplicated IO in memory allocation visualization

20b1f8b

Fix the Constant Tensor offset to not take into account IO since they…

c708069

… have finite lifetime

Add new attribute to Variable and Transient buffer to annotate if the…

b6e2448

…y are I/O buffers

Adapt calculateLifetime to use buffer I/O annotation

7e96f18

Fix typo

b923520

Remove IO buffer name and refactor var name

f4cb9e0

Linting

435cc9d

Test the correctness of the memory map after memory allocation

731f39f

Allocate memory arena first

dd1370c

Victor-Jung force-pushed the pr/finite-lifetime-io branch from 0084047 to dd1370c Compare March 18, 2025 10:32

Align memory allocation test

f01eb7f

Victor-Jung marked this pull request as draft March 18, 2025 13:50

Victor-Jung marked this pull request as ready for review March 19, 2025 08:01

Xeratec changed the title ~~DRAFT: Finite Lifetime for IO Tensors~~ Finite Lifetime for IO Tensors Mar 19, 2025

Xeratec reviewed Apr 17, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Finite Lifetime for IO Tensors #51

Finite Lifetime for IO Tensors #51

Victor-Jung commented Mar 17, 2025 •

edited

Loading

Xeratec left a comment

Xeratec Apr 17, 2025

Xeratec Apr 17, 2025

Xeratec Apr 17, 2025

Xeratec Apr 17, 2025

		def setupModel(self, ctxt: NetworkContext, schedule: Schedule, layerBinding: OrderedDict[str, ONNXLayer],
		def setupModel(self, ctxt: NetworkContext, schedule: Schedule, layerBinding: 'OrderedDict[str, ONNXLayer]',

Finite Lifetime for IO Tensors #51

Are you sure you want to change the base?

Finite Lifetime for IO Tensors #51

Conversation

Victor-Jung commented Mar 17, 2025 • edited Loading

Added

Changed

Fixed

PR Merge Checklist

Xeratec left a comment

Choose a reason for hiding this comment

Xeratec Apr 17, 2025

Choose a reason for hiding this comment

Xeratec Apr 17, 2025

Choose a reason for hiding this comment

Xeratec Apr 17, 2025

Choose a reason for hiding this comment

Xeratec Apr 17, 2025

Choose a reason for hiding this comment

Victor-Jung commented Mar 17, 2025 •

edited

Loading