-
Notifications
You must be signed in to change notification settings - Fork 57
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Pull latest ROCm6.2 changes #79
Closed
Closed
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Change-Id: I96f009b417869f86a71a7683f51089f4b0461893
Change-Id: Ief99aac3775b0a0c6372215e5a0a9ecbbdca4ca5
Change-Id: Ia6b9a428da7b54819fb3be928a33cdffcd04ecdf Signed-off-by: Lang Yu <[email protected]>
…ler build Change-Id: I89199ccc4bdf79940dc6693e07c03115c52bf613
hsa_amd_memory_lock_to_pool() and hsa_amd_memory_unlock() should be called balanced. Change-Id: I8b1549861bff752aabbb6399d717d1e346079a38 Signed-off-by: Lang Yu <[email protected]>
use AMD_COMGR_ACTION_COMPILE_SOURCE_TO_RELOCATABLE action to compile source to realoc. Currently we have source->bc, link->bc and bc->realoc. This new action replaces the three steps with one. Change-Id: I6ba551b8d04c7e06f41c4324026e4dcd2db1970f
Change-Id: Ibde2ae5b8d240f3986bcd168facc513a319c0f17
…erator. Change-Id: I8349e99c03422c268bbb60a8c143bd492d9cec09
Change-Id: Ie07cf82025b65e4f95d4ce3cf46a6b8081f29b6a
This reverts commit 551cdcd. Reason for revert: breaks windows-psdb Change-Id: I8b2fcaf0d0be09a9364e386127cabafb68a67e0d
…essFlagsProtRead Change-Id: I8e5c460ac310ad32849359720a40fc5131d4f32c
… RT path with windows Certain build systems such as NMake seem to interpret backslashes as escapes. Change-Id: Ifc04d9427148e83084efb1af94a1ce85cb2a6c4e
…s hipSuccess Change-Id: Ie322a1a0165111d2ca129e68f49b54fe93aa088e
Add logic to prevent significant grow of reserved memory in mempool. Change-Id: If034433c26e904e3037200a593ef338896d7f16d
Change-Id: I0619a9f66ae3d64a0f3fc36384f46adb302e725d
…recompiled header file hipRTC The precompiled header files have hard coded paths in comments. Using the disable linemarker option(-P) will skip the generation of comments Change-Id: Ifb134052996c343f5405e954784b4b2c286c36b1
Generic event synchronize can be called from mempool implementation directly. Change-Id: Id66ab9cdab486390df4dd405a4ac291eb0bb568f
use AMD_COMGR_ACTION_COMPILE_SOURCE_TO_RELOCATABLE action to compile source to realoc. Currently we have source->bc, link->bc and bc->realoc. This new action replaces the three steps with one. Change-Id: I8089cbef681e079702fefc2d2085a23bc3578d02
hipDestroyExternalSemaphore_fn was missing initialization Change-Id: Ieab019afd0a0c62a2aa534fb262436edda3499db
Change-Id: I0651f6a836d2f063caf651520de96d7675428771
Change-Id: Ic7812e21618f48c737ce2c51b22a153099cc5d66
… node params with graph performance optimizations. During hipGraphExecKernelNodeSetParams kernel function can also be updated. Hence size required for kernel parameters differs from what is allocated during graphInstantiation. So, create new 128KB kernel pool and allocate kernel args from the pool. If the pool is full create new 128KB pool. Release kernel pools when graph exec object is destroyed. Change-Id: I9567946d63400c79cbfd4c5439c654c92557ceae
Link to amd_comgr_2.dll Change-Id: I5d21af0cc91f2082099273f75ebe1c3279b6365d
Change-Id: Ib0b3d737cd5d297ac2d3c1ba549902b96e29c18c
…entation Fault Change-Id: I0c0c7a2ea7c35237332ddd7c2461e4d9ff8e2f1c
Change-Id: Ie2efd233c0bcf8ad0e06223ee854fe4bd1060443
Debugger support requires FreezeExecutable() call inside the loader interface. Change-Id: Iff01840a5980500d8d51da8d73c3f24b2960c44a
Change-Id: I7290345cfc60cd878fb39a06b03105441793c27b
Change-Id: Ib6b8bc59a889ad2e4f440e847d9f2e7e8fb2cda2
…rue. Change-Id: I3119bd8f99c079280bde1ba32b3d26b474899e7d
- The correlation_id had random junk values which we were inserting in the dispatch AQL packet even when no profiler was attached but if we had a valid timestamp. - Also make sure we dont even write the reserved2 field in the AQL packet if no profiler attached. Change-Id: Icdb7493198c1bb5e2d786a97e027288660854cd7
…orrectly If a system has LLVM installed, `find_package` could choose that one even if we set `HIP_LLVM_ROOT`. `LLVM_ROOT` is ignored because of this CMake policy is set to `OLD` by default. Change-Id: I18fa0453afe170c229e92d6ddc386b43eb0c44f6
…etAttribute Change-Id: I17b6448bf49aed49315a0b4c1426dcbe011735d9
Use GLVND versin of GL Change-Id: I1d6b0ec8672a232d017ad27a01803556992e8a49
HIP now has bfloat16 types via amd_hip_bf16.h in hipamd, hip_bf16.h in hip folder. Change-Id: Ic4d227936c5013e4b5d577725b8f55bebdc15499
…iable Change-Id: I9bf72b9c1a56980352109bd4d42b54ecb2d1b8f9
Display queue base pointer in the log. This can be co-related with AQL packets Change-Id: I544f9b6db6ae01c85e57e4b3f0b3fffefcd7c2ed
Change-Id: Icfcd1ecf2f2956ba015820ba250f06b438169a6a
ROCm6.0 hipamd runtime changes to support Kernel Fusion
minor bug in yaml file during graph recording
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
No description provided.