Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Pull latest ROCm6.2 changes #79

Closed
wants to merge 45 commits into from

Conversation

ppalaniappan-amd
Copy link

No description provided.

jiabaxie and others added 30 commits January 18, 2024 16:30
Change-Id: I96f009b417869f86a71a7683f51089f4b0461893
Change-Id: Ief99aac3775b0a0c6372215e5a0a9ecbbdca4ca5
Change-Id: Ia6b9a428da7b54819fb3be928a33cdffcd04ecdf
Signed-off-by: Lang Yu <[email protected]>
…ler build

Change-Id: I89199ccc4bdf79940dc6693e07c03115c52bf613
hsa_amd_memory_lock_to_pool() and hsa_amd_memory_unlock()
should be called balanced.

Change-Id: I8b1549861bff752aabbb6399d717d1e346079a38
Signed-off-by: Lang Yu <[email protected]>
use AMD_COMGR_ACTION_COMPILE_SOURCE_TO_RELOCATABLE action
to compile source to realoc. Currently we have source->bc,
link->bc and bc->realoc. This new action replaces the
three steps with one.

Change-Id: I6ba551b8d04c7e06f41c4324026e4dcd2db1970f
Change-Id: Ibde2ae5b8d240f3986bcd168facc513a319c0f17
…erator.

Change-Id: I8349e99c03422c268bbb60a8c143bd492d9cec09
Change-Id: Ie07cf82025b65e4f95d4ce3cf46a6b8081f29b6a
This reverts commit 551cdcd.

Reason for revert: breaks windows-psdb

Change-Id: I8b2fcaf0d0be09a9364e386127cabafb68a67e0d
…essFlagsProtRead

Change-Id: I8e5c460ac310ad32849359720a40fc5131d4f32c
… RT path with windows

Certain build systems such as NMake seem to interpret backslashes as escapes.

Change-Id: Ifc04d9427148e83084efb1af94a1ce85cb2a6c4e
…s hipSuccess

Change-Id: Ie322a1a0165111d2ca129e68f49b54fe93aa088e
Add logic to prevent significant grow of reserved memory in mempool.

Change-Id: If034433c26e904e3037200a593ef338896d7f16d
Change-Id: I0619a9f66ae3d64a0f3fc36384f46adb302e725d
…recompiled header file hipRTC

The precompiled header files have hard coded paths in comments. Using the disable linemarker option(-P) will skip the generation of comments

Change-Id: Ifb134052996c343f5405e954784b4b2c286c36b1
Generic event synchronize can be called from mempool implementation directly.

Change-Id: Id66ab9cdab486390df4dd405a4ac291eb0bb568f
use AMD_COMGR_ACTION_COMPILE_SOURCE_TO_RELOCATABLE action
to compile source to realoc. Currently we have source->bc,
link->bc and bc->realoc. This new action replaces the
three steps with one.

Change-Id: I8089cbef681e079702fefc2d2085a23bc3578d02
hipDestroyExternalSemaphore_fn was missing initialization

Change-Id: Ieab019afd0a0c62a2aa534fb262436edda3499db
Change-Id: I0651f6a836d2f063caf651520de96d7675428771
Change-Id: Ic7812e21618f48c737ce2c51b22a153099cc5d66
… node params with graph performance optimizations.

During hipGraphExecKernelNodeSetParams kernel function can also be updated.
Hence size required for kernel parameters differs from what is allocated during graphInstantiation.
So, create new 128KB kernel pool and allocate kernel args from the pool.
If the pool is full create new 128KB pool. Release kernel pools when graph exec object is destroyed.

Change-Id: I9567946d63400c79cbfd4c5439c654c92557ceae
Link to amd_comgr_2.dll

Change-Id: I5d21af0cc91f2082099273f75ebe1c3279b6365d
Change-Id: Ib0b3d737cd5d297ac2d3c1ba549902b96e29c18c
…entation Fault

Change-Id: I0c0c7a2ea7c35237332ddd7c2461e4d9ff8e2f1c
Change-Id: Ie2efd233c0bcf8ad0e06223ee854fe4bd1060443
Debugger support requires FreezeExecutable() call inside the loader interface.

Change-Id: Iff01840a5980500d8d51da8d73c3f24b2960c44a
Change-Id: I7290345cfc60cd878fb39a06b03105441793c27b
Change-Id: Ib6b8bc59a889ad2e4f440e847d9f2e7e8fb2cda2
…rue.

Change-Id: I3119bd8f99c079280bde1ba32b3d26b474899e7d
saleelk and others added 15 commits January 26, 2024 12:24
- The correlation_id had random junk values which we were inserting in
the dispatch AQL packet even when no profiler was attached but if we had
a valid timestamp.
- Also make sure we dont even write the reserved2 field in the AQL
packet if no profiler attached.

Change-Id: Icdb7493198c1bb5e2d786a97e027288660854cd7
…orrectly

If a system has LLVM installed, `find_package` could choose that one
even if we set `HIP_LLVM_ROOT`. `LLVM_ROOT` is ignored because of this
CMake policy is set to `OLD` by default.

Change-Id: I18fa0453afe170c229e92d6ddc386b43eb0c44f6
…etAttribute

Change-Id: I17b6448bf49aed49315a0b4c1426dcbe011735d9
Use GLVND versin of GL

Change-Id: I1d6b0ec8672a232d017ad27a01803556992e8a49
HIP now has bfloat16 types via amd_hip_bf16.h in hipamd, hip_bf16.h in
hip folder.

Change-Id: Ic4d227936c5013e4b5d577725b8f55bebdc15499
…iable

Change-Id: I9bf72b9c1a56980352109bd4d42b54ecb2d1b8f9
Display queue base pointer in the log. This can be co-related with AQL
packets

Change-Id: I544f9b6db6ae01c85e57e4b3f0b3fffefcd7c2ed
Change-Id: Icfcd1ecf2f2956ba015820ba250f06b438169a6a
ROCm6.0 hipamd runtime changes to support Kernel Fusion
minor bug in yaml file during graph recording
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.