llvm-project

Author	SHA1	Message	Date
Simon Pilgrim	328754474a	[DAG] SimplifySetCC - clang-format add/xor/sub with constant handling. NFC.	2022-04-04 13:30:17 +01:00
Simon Pilgrim	9db1eb13b6	[Thumb2] Regenerate thumb2-teq2 tests	2022-04-04 12:48:20 +01:00
David Green	2abaa027d9	[AArch64] Teach the costmodel about widening muls A vector mul(sext, sext) or mul(zext, zext) will be code generated as a single smull or umull instruction. This most notably effects v2i64 multiplies, which are otherwise not legal and need to be expanded. The oneuse check has also been slightly changed, as it is already checked from the use of isWideningInstruction in getCastInstrCost. Differential Revision: https://reviews.llvm.org/D123006	2022-04-04 12:45:04 +01:00
Simon Pilgrim	ec93435ba0	[Thumb2] Regenerate thumb2-teq tests	2022-04-04 12:24:35 +01:00
Simon Pilgrim	d4cdaa24fd	[MIPS] Regenerate countleading tests with common check prefixes	2022-04-04 12:19:57 +01:00
David Green	2e2f38a1ac	[AArch64] Add widening arithmetic cost tests. NFC	2022-04-04 12:19:45 +01:00
Nikita Popov	3c9f3f76f1	[ConstantFold] Fold zero-index GEPs with opaque pointers With opaque pointers, we can eliminate zero-index GEPs even if they have multiple indices, as this no longer impacts the result type of the GEP. This optimization is already done for instructions in InstSimplify, but we were missing the corresponding constant expression handling. The constexpr transform is a bit more powerful, because it can produce a vector splat constant and also handles undef values -- it is an extension of an existing single-index transform.	2022-04-04 13:04:27 +02:00
Nikita Popov	d092df42f3	[InstSimplify] Add tests for zero-offset opaque ptr constexpr GEP (NFC)	2022-04-04 13:04:26 +02:00
Simon Pilgrim	ad59bd0be9	[X86] Regenerate peep tests checks	2022-04-04 12:02:33 +01:00
Muhammad Omair Javaid	a96638e50e	Revert "[NFCI] Regenerate PhaseOrdering test checks" This reverts commit e91fe08999d5f5d7e7777837c529bac692d06c1b. Breaks following buildbots: https://lab.llvm.org/buildbot/#/builders/171	2022-04-04 15:30:57 +05:00
Jeremy Morse	059d1f84d2	[DebugInfo] Correctly recognize bitfields when emitting dwarf Use the "isBitfield" flag for debug types to determine whether something is a bitfield, rather than trying to guess from it's layout. Fixes https://bugs.llvm.org/show_bug.cgi?id=44601 Patch by: mahkoh Differential Revision: https://reviews.llvm.org/D96334	2022-04-04 11:14:13 +01:00
Simon Pilgrim	623d4b5787	[X86] Support optional NOT stages in the AND(SRL(X,Y),1) -> SETCC(BT(X,Y)) fold Extension to D122891, peek through NOT() ops, adjusting the condcode as we go.	2022-04-04 10:51:26 +01:00
Simon Pilgrim	842175676c	[X86] Add additional test cases for NOT(AND(SRL(X,Y),1))/AND(SRL(NOT(X(,Y),1) -> SETCC(BT(X,Y)) As suggested in post review on D122891	2022-04-04 10:29:33 +01:00
Florian Hahn	1817c526e1	[VPlan] Update VPInterleavedAccessInfo to use getVectorLoopRegion. Update VPInterleavedAccessInfo to use the generic getVectorLoopRegion helper instead of relying on the entry block being the top-most vector loop region.	2022-04-04 10:26:39 +01:00
Martin Sebor	5ccfd5f6d4	[SimplifyLibCalls] Optimize memchr() with known char+str and unknown length If both the character and string are known, but the length potentially isn't, we can optimize the memchr() call to a select of either the known position of the character or null. Split off from https://reviews.llvm.org/D122836.	2022-04-04 11:01:33 +02:00
Martin Sebor	5197d2791f	[SimplifyLibCalls] Move handling of constant char earlier (NFC) Handle the simple constant char case before the bitmask optimization. This will allow extending the code to handle a non-constant size argument in a followup change. Split out from https://reviews.llvm.org/D122836.	2022-04-04 11:01:33 +02:00
Martin Sebor	d18991debf	[SimplifyLibCalls] Fold memchr() with size 1 If the memchr() size is 1, then we can convert the call into a single-byte comparison. This works even if both the string and the character are unknown. Split off from https://reviews.llvm.org/D122836.	2022-04-04 10:41:20 +02:00
Martin Sebor	0f08875744	[InstCombine] Add additional memchr test (NFC) And fix some test names / comments.	2022-04-04 10:41:20 +02:00
Florian Hahn	8cd1892725	[VPlan] Remember previous loop and reset vector loop. At the moment this is NFC, but will be needed once nested loops are also modeled as regions. Preparation for D123005.	2022-04-04 09:27:15 +01:00
Nikita Popov	a5c3b5748c	[MemCpyOpt] Work around PR54682 As discussed on https://github.com/llvm/llvm-project/issues/54682, MemorySSA currently has a bug when computing the clobber of calls that access loop-varying locations. I think a "proper" fix for this on the MemorySSA side might be non-trivial, but we can easily work around this in MemCpyOpt: Currently, MemCpyOpt uses a location-less getClobberingMemoryAccess() call to find a clobber on either the src or dest location, and then refines it for the src and dest clobber. This was intended as an optimization, as the location-less API is cached, while the location-affected APIs are not. However, I don't think this really makes a difference in practice, because I don't think anything will use the cached clobbers on those calls later anyway. On CTMark, this patch seems to be very mildly positive actually. So I think this is a reasonable way to avoid the problem for now, though MemorySSA should also get a fix. Differential Revision: https://reviews.llvm.org/D122911	2022-04-04 10:19:51 +02:00
Nikita Popov	c0cc98251a	[Float2Int] Make sure dependent ranges are calculated first (PR54669) The range calculation in walkForwards() assumes that the ranges of the operands have already been calculated. With the used visit order, this is not necessarily the case when there are multiple roots. (There is nothing guaranteeing that instructions are visited in topological order.) Fix this by queuing instructions for reprocessing if the operand ranges haven't been calculated yet. Fixes https://github.com/llvm/llvm-project/issues/54669. Differential Revision: https://reviews.llvm.org/D122817	2022-04-04 10:18:39 +02:00
Min-Yih Hsu	fccdc5618d	[M68k] Adopt VarLenCodeEmitter for shift / rotate instructions This patch is covered by existing MC tests.	2022-04-03 22:52:32 -07:00
Min-Yih Hsu	22201f499d	[M68k][test] Remove redundant CHECK-LABEL directive The associated test had a redundant CHECK-LABEL directive that might fail the test since the inception, but this issue was "burried" by a missing colon, which was addressed in fb65aaf0be09936e657d339f3dc8e62666a41956. Thus, the test finally failed after the said commit. This patch remove that CHECK-LABEL directive.	2022-04-03 22:51:03 -07:00
Yuanfang Chen	948f3deca9	Reland "[lit] Use sharding for GoogleTest format" This relands commit a87ba5c86d5d72defdbcdb278baad6515ec99463. Adjust llvm/utils/lit/tests/googletest-timeout.py for new test output.	2022-04-03 22:35:45 -07:00
Argyrios Kyrtzidis	5877df735d	[Support/BLAKE3] CMake: Remove the workaround that checks for "CC=ccache /path/to/clang" The LLVM builders that were doing that have been updated to use "-DLLVM_CCACHE_BUILD=ON" instead.	2022-04-03 21:02:02 -07:00
Augie Fackler	603ae73146	AttributorAttributes: guard against TLI being nullptr I didn't dig into this very much because it appears to be totally valid (especially once these properties can come from attributes instead of only from hard-coded library functions) for TLI to not be defined, and nothing broke when I added this check, including with all my other patches applied. Differential Revision: https://reviews.llvm.org/D122917	2022-04-03 23:19:23 -04:00
Augie Fackler	e90bce8f91	CallBase: fix getFnAttr so it also checks the function Prior to this change, CallBase::hasFnAttr checked the called function to see if it had an attribute if it wasn't set on the CallBase, but getFnAttr didn't do the same delegation, which led to very confusing behavior. This patch fixes the issue by making CallBase::getFnAttr also check the function under the same circumstances. Test changes look (to me) like they're cleaning up redundant attributes which no longer get specified both on the callee and call. We also clean up the one ad-hoc implementation of this getter over in InlineCost.cpp. Differential Revision: https://reviews.llvm.org/D122821	2022-04-03 23:19:23 -04:00
Philip Reames	88de27e3fd	[LV] Handle non-integral types when considering interleave widening legality In general, anywhere we might need to insert a blind bitcast, we need to make sure the types are losslessly convertible. This fixes pr54634.	2022-04-03 20:16:20 -07:00
Philip Reames	7c51669c21	[memcpyopt] Restructure store(load src, dest) form of callslotopt for compile time The search for the clobbering call is fairly expensive if uses are not optimized at construction. Defer the clobber walk to the point in the implementation we need it; there are a bunch of bailouts before that point. (e.g. If the source pointer is not an alloca, we can't do callslotopt.) On a test case which involves a bunch of copies from argument pointers, this switches memcpyopt from > 1/2 second to < 10ms.	2022-04-03 20:16:20 -07:00
Yuanfang Chen	c0f90c84b1	Revert "[lit] Use sharding for GoogleTest format" This reverts commit a87ba5c86d5d72defdbcdb278baad6515ec99463. Breaks bots: https://lab.llvm.org/buildbot/#/builders/196/builds/10454	2022-04-03 20:04:55 -07:00
Yuanfang Chen	a87ba5c86d	[lit] Use sharding for GoogleTest format This helps lit unit test performance by a lot, especially on windows. The performance gain comes from launching one gtest executable for many subtests instead of one (this is the current situation). The shards are executed by the test runner and the results are stored in the json format supported by the GoogleTest. Later in the test reporting stage, all test results in the json file are retrieved to continue the test results summary etc. On my Win10 desktop, before this patch: `check-clang-unit`: 177s, `check-llvm-unit`: 38s; after this patch: `check-clang-unit`: 37s, `check-llvm-unit`: 11s. On my Linux machine, before this patch: `check-clang-unit`: 46s, `check-llvm-unit`: 8s; after this patch: `check-clang-unit`: 7s, `check-llvm-unit`: 4s. Reviewed By: yln, rnk Differential Revision: https://reviews.llvm.org/D122251	2022-04-03 19:47:02 -07:00
Xiang1 Zhang	f830392be7	Correct spelling error in TLS-Load-Hoist	2022-04-04 08:27:54 +08:00
Dávid Bolvanský	872f7000fc	Revert "[NFCI] Regenerate SROA/LoopVectorize test checks" This reverts commit 14e3450fb57305aa9ff3e9e60687b458e43835c9.	2022-04-04 01:15:30 +02:00
Dávid Bolvanský	14e3450fb5	[NFCI] Regenerate SROA test checks	2022-04-04 00:55:54 +02:00
Dávid Bolvanský	e91fe08999	[NFCI] Regenerate PhaseOrdering test checks	2022-04-04 00:28:57 +02:00
Dávid Bolvanský	260679b000	[NFCI] Regenerate LoopIdiomRecognize test checks	2022-04-04 00:21:26 +02:00
David Green	3c88ff44c5	[AArch64] Remove unsued WideningBaseCost. NFC The WideningBaseCost is always 0. This removes it to clean up the code.	2022-04-03 22:16:39 +01:00
Dávid Bolvanský	a113a582b1	[NFCI] Regenerate LoopVectorize test checks	2022-04-03 21:56:24 +02:00
Kazu Hirata	d3684c3359	[IR] Remove unused forward declarations (NFC)	2022-04-03 12:54:54 -07:00
Dávid Bolvanský	11b41910dd	[NFCI] Regenerate instsimplify test checks	2022-04-03 20:55:15 +02:00
Kazu Hirata	e5121be910	Revert "Apply clang-tidy fixes for readability-redundant-declaration in Debug.cpp (NFC)" This reverts commit 0fe01a9346658c0955b68b123f2b470b018114b1. The commit caused build failures like: llvm/lib/Support/Debug.cpp:65:3: error: ‘setCurrentDebugTypes’ was not declared in this scope; did you mean ‘setCurrentDebugType’?	2022-04-03 08:14:11 -07:00
LLVM GN Syncbot	6020830e88	[gn build] Port e476df5629ee	2022-04-03 15:09:33 +00:00
Kazu Hirata	1fe01a9346	Apply clang-tidy fixes for readability-redundant-declaration in Debug.cpp (NFC)	2022-04-03 08:04:12 -07:00
Kazu Hirata	c45d369ced	Apply clang-tidy fixes for readability-redundant-member-init in YAMLParser.cpp (NFC)	2022-04-03 08:04:11 -07:00
Hirochika Matsumoto	f138a9964b	Reapply "[InstSimplify][NFC] Add baseline tests for folds of icmp with ctpop" This change was previously reverted because I forgot rerunning update_test_checks.py and tests were not actually baseline. Extracted from: https://reviews.llvm.org/D122757	2022-04-03 22:07:04 +09:00
Dávid Bolvanský	fb65aaf0be	[NFCI] Fixed missing colon in CHECK directives - part 2	2022-04-03 14:42:59 +02:00
Dávid Bolvanský	f02a0a69af	[NFCI] Fixed missing colon in CHECK directives	2022-04-03 11:52:38 +02:00
Simon Pilgrim	fbfd78f7aa	[X86] lowerShuffleAsRepeatedMaskAndLanePermute - allow v16i32 sub-lane permutes for v64i8 shuffles Without VBMI, we are better off permuting v16i32 sub-lanes, even though its a variable shuffle, if it allows us to then shuffle v64i8 inlane repeated masks (PSHUFB etc.) Fixes #54658	2022-04-03 10:05:10 +01:00
Alexander Shaposhnikov	6cf10b7e6e	[InstCombine] Fold srem(X, PowerOf2) == C into (X & Mask) == C for positive C This diff extends InstCombinerImpl::foldICmpSRemConstant to handle the cases srem(X, PowerOf2) == C and srem(X, PowerOf2) != C for positive C. This addresses the issue https://github.com/llvm/llvm-project/issues/54650 Differential revision: https://reviews.llvm.org/D122942 Test plan: make check-all	2022-04-03 03:57:05 +00:00
Alexander Shaposhnikov	911cfcd7f5	[InstCombine][NFC] Add baseline tests for folds of srem(X, PowerOf2) == C Extracted from: https://reviews.llvm.org/D122942 Test plan: make check-all	2022-04-03 03:26:47 +00:00

1 2 3 4 5 ...

230971 Commits