Update to LLVM 12 #147

omern1 · 2021-05-27T14:13:14Z

Trying to update Repo to LLVM 12. Still working through the build faliures, etc.

…y. NFCI. Use const reference to avoid std::string copy - accordingly to the style guide we shouldn't be using auto anyway. Fixes MSVC analyzer warning.

…ector() shuffle patterns. We allow insert_subvector lowering of all legal types, so don't always cast to the vXi64/vXf64 shuffle types - this is only necessary for X86ISD::SHUF128/X86ISD::VPERM2X128 patterns later.

Remove a new line in CompilerInvocation, to now follow the style when clang-format is applied.

The included test case triggered a sign assertion on the result in `Success()`. This was caused by the APSInt created for a bitcast having its signedness bit inverted. The second APSInt constructor argument is `isUnsigned`, so invert the result of `isSignedIntegerType`. Differential Revision: https://reviews.llvm.org/D95135

We allow extract_subvector lowering of all legal types, so pre-bitcast the source type to try and reduce bitcast pollution.

This change also changes getReductionCost to return InstructionCost, and it simplifies two expressions by removing a redundant 'isValid' check.

Update the preprocessor regression tests to use the new driver if the new driver is built (FLANG_BUILD_NEW_DRIVER=On), otherwise the tests will still run using f18. Summary of changes: - Introduce %flang to the regression tests, which points to the new driver if it is built or otherwise points to f18 - Update all tests in flang/test/Preprocessing/ to use %flang Differential Revision: https://reviews.llvm.org/D94805

This transformation anchors on a padding op whose result is only used as an input to a Linalg op and pulls it out of a given number of loops. The result is a packing of padded tailes of ops that is amortized just before the outermost loop from which the pad operation is hoisted. Differential revision: https://reviews.llvm.org/D95243

This reverts commit 14947cd because it broke clang-cmake-armv7-quick.

@src

We can sink extends after min/max if they match and would not change the sign-interpreted compare. The only combo that doesn't work is zext+smin/smax because the zexts could change a negative number into positive: https://alive2.llvm.org/ce/z/D6sz6J Sext+umax/umin works: define i32 @src(i8 %x, i8 %y) { %0: %sx = sext i8 %x to i32 %sy = sext i8 %y to i32 %m = umax i32 %sx, %sy ret i32 %m } => define i32 @tgt(i8 %x, i8 %y) { %0: %m = umax i8 %x, %y %r = sext i8 %m to i32 ret i32 %r } Transformation seems to be correct!

D62056 makes the output color if clang auto-detects a tty, but if it does not, there is no way to force it to use colors anyway. This patch adjusts the command-lines given to ClangTool which will force color on or off if --use-color is specified.

…r matching in lit tests - continued" This reverts commit 520b5ec.

This reverts commit 06f8a49.

…inal root size. NFCI. We're relying on the source inputs for shuffle combining having already been widened to the root size (otherwise the offset logic falls over) - we're going to be supporting different sized shuffle inputs soon, so we need to explicitly make the minimum widened width the original root size.

This revision addresses a remaining comment that was overlooked in https://reviews.llvm.org/D95243: the pad hoisting transformation is made to additionally bail out on side effecting ops other than LoopLikeOps.

When working with invalid code, we would try to dereference a nullptr while deducing template arguments in some dependend code operating on a lambda with invalid return type. Differential Revision: https://reviews.llvm.org/D95145

…perands. This revision starts evolving the APIs to manipulate ops with offsets, sizes and operands towards a ValueOrAttr abstraction that is already used in folding under the name OpFoldResult. The objective, in the future, is to allow such manipulations all the way to the level of ODS to avoid all the genuflexions involved in distinguishing between values and attributes for generic constant foldings. Once this evolution is accepted, the next step will be a mechanical OpFoldResult -> ValueOrAttr. Differential Revision: https://reviews.llvm.org/D95310

This patch adds plumbing to handle scalarized values directly in VPTransformState. Reviewed By: gilr Differential Revision: https://reviews.llvm.org/D92282

This fix will make us not crash, but ideally we would handle this case better. Differential Revision: https://reviews.llvm.org/D94919

This has been specifically requested: clangd/vscode-clangd#114 and various issues can be addressed with this as a workaround, e.g.: clangd/clangd#662 Differential Revision: https://reviews.llvm.org/D95349

…nance checking Checking the llvm.experimental.noalias.scope.decl dominance can be worstcase O(N^2). Limit the dominance check to N=32. Reviewed By: fhahn Differential Revision: https://reviews.llvm.org/D95335

…egal 256-bit vector types Remove bitcasts to/from v4x64 types through vperm2f128/vperm2i128 ops to help improve shuffle combining and demanded vector elts folding.

This was enabled in https://reviews.llvm.org/D95335 but it breaks the stage2 fuchsia build (See http://lab.llvm.org:8011/#/builders/98/builds/4105/steps/9/logs/stdio)

This patch removes a couple of left-overs and a typo from RegisterInfos_arm64_sve.h and RegisterInfoPOSIX_arm64.h.

This patch defines AUXV_AT_HWCAP2 for accessing Aux extensions.

TestPlatformProcessConnect is randomly failing on LLDB Arm/AArch64 buildbot. I am disabling it temporarily untill problem is fixed.

This patch improves the availability for variables stored in the coroutine frame by emitting an alloca to hold the pointer to the frame object and rewriting dbg.declare intrinsics to point inside the frame object using salvaged DIExpressions. Finally, a new alloca is created in the funclet to hold the FramePtr pointer to ensure that it is available throughout the entire function at -O0. This path also effectively reverts D90772. The testcase updates highlight nicely how every removed CHECK for a dbg.value is preceded by a new CHECK for a dbg.declare. Thanks to JunMa, Yifeng, and Bruno for their thoughtful reviews! Differential Revision: https://reviews.llvm.org/D93497 rdar://71866936

Just use the existing `Known.sextInReg` implementation. - Update KnownBitsTest.cpp. - Update combine-redundant-and.mir for a more concrete example. Differential Revision: https://reviews.llvm.org/D95484

…e, NFC Add a new `raw_pwrite_ostream` variant, `buffer_unique_ostream`, which is like `buffer_ostream` but with unique ownership of the stream it's wrapping. Use this in CompilerInstance to simplify the ownership of non-seeking output streams, avoiding logic sprawled around to deal with them specially. This also simplifies future work to encapsulate output files in a different class. Differential Revision: https://reviews.llvm.org/D93260

Fix layering between `CompilerInstance::createDefaultOutputFile` and the two versions of `createOutputFile`. - Add missing configuration flags to `createDefaultOutputFile` so that GeneratePCHAction and GenerateModuleFromModuleMapAction can use it. They previously promised that temporary files were turned on; now `createDefaultOutputFile` handles that logic. - Lift the logic handling `InFile` and `Extension` to `createDefaultOutputFile`, since it's only the callers of that function that are using it. - Rename the deeper of the two `createOutputFile`s to `createOutputFileImpl` and make it private to `CompilerInstance` (to prove that no one else is using it). - Sink the logic for adding to `CompilerInstance::OutputFiles` down to `createOutputFileImpl`, allowing two "optional" (but always used) `std::string*` out parameters to be removed. - Instead of passing a `std::error_code` out parameter into `createOutputFileImpl`, have it return `Expected<>`. - As a drive-by, inline `CompilerInstance::addOutputFile` into its only caller, `createOutputFileImpl`. Clean layering makes it easier for a future commit to extract `createOutputFileImpl` out of `CompilerInstance`. Differential Revision: https://reviews.llvm.org/D93248

Differential Revision: https://reviews.llvm.org/D95486

Slightly changes the output in error code, but no behavior change in normal use. This is for preparation for using these two functions elsewhere.

The unwinder used by the crash handler on versions of Android prior to API 29 did not correctly handle binaries built with rosegment, which is enabled by default for LLD. Android only supports LLD, so it's not an issue that this flag is not accepted by other linkers. Reviewed By: srhines Differential Revision: https://reviews.llvm.org/D95166

…hout prebuilts

[libomptarget][cuda] Handle missing _v2 symbols gracefully Follow on from D95367. Dlsym the _v2 symbols if present, otherwise use the unsuffixed version. Builds a hashtable for the check, can revise for zero heap allocations later if necessary. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D95415

This patch sets the def-allocator-var ICV based on the environment variables provided in OMP_ALLOCATOR. Previously, only allowed value for OMP_ALLOCATOR was a predefined memory allocator. OpenMP 5.1 specification allows predefined memory allocator, predefined mem space, or predefined mem space with traits in OMP_ALLOCATOR. If an allocator can not be created using the provided environment variables, the def-allocator-var is set to omp_default_mem_alloc. Differential Revision: https://reviews.llvm.org/D94985

…d <. Split out of D93512.

This change implements support for applying profile instrumentation only to selected files or functions. The implementation uses the sanitizer special case list format to select which files and functions to instrument, and relies on the new noprofile IR attribute to exclude functions from instrumentation. Differential Revision: https://reviews.llvm.org/D94820

Remove common instructions from rv64 tests since they are now covered by the rv64 run lines in the rv32 tests. Add rv32-only* tests for a few cases that aren't common between r32 and rv64. Addresses review feedback from D95150. Reviewed By: frasercrmck Differential Revision: https://reviews.llvm.org/D95272

With D94745, we no longer use CUDA SDK to compile `deviceRTLs`. Therefore, many CMake code in the project is useless. This patch cleans up unnecessary code and also drops the requirement to build NVPTX `deviceRTLs`. CUDA detection is still being used however to determine whether we need to involve the tests. Auto detection of compute capability is enabled by default and can be disabled by setting CMake variable `LIBOMPTARGET_NVPTX_AUTODETECT_COMPUTE_CAPABILITY=OFF`. If auto detection is enabled, and CUDA is also valid, it will only build the bitcode library for the detected version; otherwise, all variants supported will be generated. One drawback of this patch is, we now generate 96 variants of bitcode library, and totally 1485 files to be built with a clean build on a non-CUDA system. `LIBOMPTARGET_NVPTX_COMPUTE_CAPABILITIES=""` can be used to disable building NVPTX `deviceRTLs`. Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D95466

A follow up patch will add a few success cases here; rename it to `output-paths.c` instead of `output-failures.c`.

Use early returns in `CompilerInstance::clearOutputFiles` to clarify the logic, and rename `ec` to `EC` as a drive-by. No functionality change.

Co-authored-by: Ying Yi <[email protected]>

omern1 · 2021-05-28T14:01:09Z

Welp, I've gotten rid of all errors now (atleast on Linux)

RKSimon and others added 30 commits January 25, 2021 11:35

[TableGen] RuleMatcher::defineComplexSubOperand avoid std::string cop…

9641bd0

…y. NFCI. Use const reference to avoid std::string copy - accordingly to the style guide we shouldn't be using auto anyway. Fixes MSVC analyzer warning.

[flang][driver] Remove newline in CompilerInvocation

8e3adda

Remove a new line in CompilerInvocation, to now follow the style when clang-format is applied.

[X86][AVX] LowerTRUNCATE - avoid bitcasts around extract_subvectors.

1b780cf

We allow extract_subvector lowering of all legal types, so pre-bitcast the source type to try and reduce bitcast pollution.

[SLPVectorizer] NFC: Migrate getVectorCallCosts to use InstructionCost.

171d124

This change also changes getReductionCost to return InstructionCost, and it simplifies two expressions by removing a redundant 'isValid' check.

Revert "[clang] Fix signedness in vector bitcast evaluation"

b16fb1f

This reverts commit 14947cd because it broke clang-cmake-armv7-quick.

[InstCombine] add tests for min/max intrinsics with extended values; NFC

07b60d0

Revert "[SystemZ][z/OS] Fix No such file or directory expression erro…

84851a2

…r matching in lit tests - continued" This reverts commit 520b5ec.

Revert "[SystemZ][z/OS] Fix No such file or directory expression error"

978444d

This reverts commit 06f8a49.

[mlir][Linalg] Address missed review item

68eee55

This revision addresses a remaining comment that was overlooked in https://reviews.llvm.org/D95243: the pad hoisting transformation is made to additionally bail out on side effecting ops other than LoopLikeOps.

[clang] Fix a nullptr dereference bug on invalid code

d462aa5

When working with invalid code, we would try to dereference a nullptr while deducing template arguments in some dependend code operating on a lambda with invalid return type. Differential Revision: https://reviews.llvm.org/D95145

[mlir][Linalg] Fix incorrect erase order

52e2552

[NFC] Fix title comment typo and provide description for LLJIT example.

7163aa9

[VPlan] Handle scalarized values in VPTransformState.

3201274

This patch adds plumbing to handle scalarized values directly in VPTransformState. Reviewed By: gilr Differential Revision: https://reviews.llvm.org/D92282

[Doc][NFC] Fix Kaleidoscope links, typos and add blog posts for MCJIT

3546b37

[clangd] Fix a crash when indexing invalid ObjC method declaration

0005438

This fix will make us not crash, but ideally we would handle this case better. Differential Revision: https://reviews.llvm.org/D94919

[clangd] Allow diagnostics to be suppressed with configuration

7e506b3

This has been specifically requested: clangd/vscode-clangd#114 and various issues can be addressed with this as a workaround, e.g.: clangd/clangd#662 Differential Revision: https://reviews.llvm.org/D95349

[Verifier] enable and limit llvm.experimental.noalias.scope.decl domi…

6e530a3

…nance checking Checking the llvm.experimental.noalias.scope.decl dominance can be worstcase O(N^2). Limit the dominance check to N=32. Reviewed By: fhahn Differential Revision: https://reviews.llvm.org/D95335

[X86][AVX] Generalize vperm2f128/vperm2i128 patterns to support all l…

13f2aee

…egal 256-bit vector types Remove bitcasts to/from v4x64 types through vperm2f128/vperm2i128 ops to help improve shuffle combining and demanded vector elts folding.

[Verifier] disable llvm.experimental.noalias.scope.decl dominance check.

3b5d36e

This was enabled in https://reviews.llvm.org/D95335 but it breaks the stage2 fuchsia build (See http://lab.llvm.org:8011/#/builders/98/builds/4105/steps/9/logs/stdio)

[LLDB] Remove leftovers and typos from RegisterInfos_arm64_sve.h

b45020c

This patch removes a couple of left-overs and a typo from RegisterInfos_arm64_sve.h and RegisterInfoPOSIX_arm64.h.

[LLDB] Define AUXV_AT_HWCAP2 in AuxVector.h

2fd4d92

This patch defines AUXV_AT_HWCAP2 for accessing Aux extensions.

[LLDB] Skip TestPlatformProcessConnect on arm/aarch64 buildbot

e9a3fac

TestPlatformProcessConnect is randomly failing on LLDB Arm/AArch64 buildbot. I am disabling it temporarily untill problem is fixed.

adrian-prantl and others added 20 commits January 26, 2021 15:01

[GlobalISel] Implement computeKnownBits for G_SEXT_INREG

f36007e

Just use the existing `Known.sextInReg` implementation. - Update KnownBitsTest.cpp. - Update combine-redundant-and.mir for a more concrete example. Differential Revision: https://reviews.llvm.org/D95484

[llc] Add reportError helper and canonicalize error messages

4d28f0a

[libomptarget][NFC] Avoid gcc 5/6 issue with lambda captures.

3caa2d3

Differential Revision: https://reviews.llvm.org/D95486

llvm-lib: Pull error printing code out of two functions

f3c9687

Slightly changes the output in error code, but no behavior change in normal use. This is for preparation for using these two functions elsewhere.

[gn build] restore build command removed in 9595a7f for platforms wit…

4dcb5c4

…hout prebuilts

[gn build] fix get.py change

65e2fa5

[libc++] Give MoveOnly all six comparison operators, not just == an…

fc31920

…d <. Split out of D93512.

[gn build] Port bb9eb19

1458987

Rename clang/test/Frontend/output-{failures,paths}.c, NFC

e4871c1

A follow up patch will add a few success cases here; rename it to `output-paths.c` instead of `output-failures.c`.

Frontend: Use early returns in CompilerInstance::clearOutputFiles, NFC

8e464dd

Use early returns in `CompilerInstance::clearOutputFiles` to clarify the logic, and rename `ec` to `EC` as a drive-by. No functionality change.

Merge commit '8e464dd76befbc4a39a1d21968a3d5f543e29312' into llvm-12

50da5b9

omern1 requested review from paulhuggett and MaggieYingYi May 27, 2021 14:13

omern1 self-assigned this May 27, 2021

omern1 and others added 5 commits May 27, 2021 15:00

Fix 2 build errors

2c0c599

Fix missing functions error

3290797

Co-authored-by: Ying Yi <[email protected]>

Make get_fd() public and fix repeating case

5924e80

Fix compiler warning

faad351

Fix weird 'no member function' errors

b6082b0

omern1 removed their assignment Jun 3, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update to LLVM 12 #147

Update to LLVM 12 #147

omern1 commented May 27, 2021 •

edited

Loading

omern1 commented May 28, 2021

Update to LLVM 12 #147

Are you sure you want to change the base?

Update to LLVM 12 #147

Conversation

omern1 commented May 27, 2021 • edited Loading

omern1 commented May 28, 2021

omern1 commented May 27, 2021 •

edited

Loading