llvm-project

Author	SHA1	Message	Date
Eli Friedman	295e346da4	[EarlyIfConversion] Don't if-convert unconditional branches. A block ending in an unconditional branch can have two successors if one is a landing pad. In practice, I think this only has an effect on Windows because landing pads are never empty for Itanium unwinding. (Alternatively, I could add a check to AArch64InstrInfo::canInsertSelect, but this seems more obvious.) Differential Revision: https://reviews.llvm.org/D56468 llvm-svn: 351142	2019-01-15 00:19:46 +00:00
Nikita Popov	5885eec35a	Revert "[CodeGen][X86] Expand USUBSAT to UMAX+SUB, also for vectors" This reverts commit r351125. I missed test changes in an SLPVectorizer test, due to the cost model changes. Reverting for now. llvm-svn: 351129	2019-01-14 22:18:39 +00:00
Nikita Popov	8e9a8432a8	[CodeGen][X86] Expand USUBSAT to UMAX+SUB, also for vectors Related to https://bugs.llvm.org/show_bug.cgi?id=40123. Rather than scalarizing, expand a vector USUBSAT into UMAX+SUB, which produces much better code for X86. Differential Revision: https://reviews.llvm.org/D56636 llvm-svn: 351125	2019-01-14 21:43:30 +00:00
Adrian Prantl	fa2e35838c	Reapply r345008 "Split MachinePipeliner code into header and cpp files" Split MachinePipeliner code into header and cpp files to allow inheritance from SwingSchedulerDAG. This reapplies https://reviews.llvm.org/D56084 after moving the implementation of the dump functions into the .cpp files. This fixes a linker error when building with Clang modules enables and local submodule visibility disabled. Original patch by Lama Saba <lama.saba@intel.com>! llvm-svn: 351077	2019-01-14 17:24:11 +00:00
Nirav Dave	3badfe74a2	Reland "Refactor GetRegistersForValue. NFCI." Remove over-strictification class membership check. llvm-svn: 351074	2019-01-14 17:09:45 +00:00
Simon Pilgrim	a1bd4a6ba4	[DAGCombiner] Add (sub_sat x, x) -> 0 combine llvm-svn: 351073	2019-01-14 15:43:34 +00:00
Simon Pilgrim	fa1f518748	[DAGCombiner] Enable sub saturation constant folding llvm-svn: 351072	2019-01-14 15:28:53 +00:00
Simon Pilgrim	7fc6882374	[DAGCombiner] Add add/sub saturation undef handling Match ConstantFolding.cpp: (add_sat x, undef) -> -1 (sub_sat x, undef) -> 0 llvm-svn: 351070	2019-01-14 14:16:24 +00:00
Simon Pilgrim	cfa5f06dde	[DAGCombiner] Enable add saturation constant folding llvm-svn: 351060	2019-01-14 12:34:31 +00:00
Simon Pilgrim	67610926fc	[DAGCombiner] Add add saturation constant folding tests. Exposes an issue with sadd_sat for computeOverflowKind, so I've disabled it for now. llvm-svn: 351057	2019-01-14 12:12:42 +00:00
Simon Pilgrim	3d42815cd8	[SelectionDAG] Add type sanity assertions for add/sub saturation node creation. llvm-svn: 351055	2019-01-14 11:56:59 +00:00
Francis Visoiu Mistrih	b7cef81fd3	Replace "no-frame-pointer-" function attributes with "frame-pointer" Part of the effort to refactoring frame pointer code generation. We used to use two function attributes "no-frame-pointer-elim" and "no-frame-pointer-elim-non-leaf" to represent three kinds of frame pointer usage: (all) frames use frame pointer, (non-leaf) frames use frame pointer, (none) frame use frame pointer. This CL makes the idea explicit by using only one enum function attribute "frame-pointer" Option "-frame-pointer=" replaces "-disable-fp-elim" for tools such as llc. "no-frame-pointer-elim" and "no-frame-pointer-elim-non-leaf" are still supported for easy migration to "frame-pointer". tests are mostly updated with // replace command line args ‘-disable-fp-elim=false’ with ‘-frame-pointer=none’ grep -iIrnl '\-disable-fp-elim=false' \| xargs sed -i '' -e "s/-disable-fp-elim=false/-frame-pointer=none/g" // replace command line args ‘-disable-fp-elim’ with ‘-frame-pointer=all’ grep -iIrnl '\-disable-fp-elim' * \| xargs sed -i '' -e "s/-disable-fp-elim/-frame-pointer=all/g" Patch by Yuanfang Chen (tabloid.adroit)! Differential Revision: https://reviews.llvm.org/D56351 llvm-svn: 351049	2019-01-14 10:55:55 +00:00
Simon Pilgrim	56ba1db933	[DAGCombiner] If add_sat(x,y) can't overflow -> add(x,y) NOTE: We need more powerful signed overflow detection in computeOverflowKind llvm-svn: 351026	2019-01-13 22:08:26 +00:00
Simon Pilgrim	888fa8680c	Fix unused variable warning. NFCI. llvm-svn: 351025	2019-01-13 21:53:12 +00:00
Simon Pilgrim	897d4c6fe9	[DAGCombiner] Some very basic add/sub saturation combines. Handle combines with zero and constant canonicalization for adds. llvm-svn: 351024	2019-01-13 21:50:24 +00:00
Craig Topper	4978de36e4	[LegalizeDAG] Remove 'NeedInvert' code from expansion of BR_CC. Replace with an assert. I accidentally triggered this code while doing some experiments and it doesn't look lke it could possibly work. It calls 'getNOT' on a node that should be a CondCode. I think to do this right we would need to swap the branch target and the fallthrough target. But that's not easy to do. Or we could create an explicit SetCC and feed that into a new BR_CC? llvm-svn: 351022	2019-01-13 19:33:30 +00:00
Nikita Popov	0400e50445	[X86] Rename overly verbose method; NFC As suggested on D56636. llvm-svn: 351021	2019-01-13 16:41:26 +00:00
Benjamin Kramer	b17d2136ea	Give helper classes/functions local linkage. NFC. llvm-svn: 351016	2019-01-12 18:36:22 +00:00
Sanjay Patel	625d5aef62	[DAGCombiner] fold insert_subvector of insert_subvector This pattern: t33: v8i32 = insert_subvector undef:v8i32, t35, Constant:i64<0> t21: v16i32 = insert_subvector undef:v16i32, t33, Constant:i64<0> ...shows up in PR33758: https://bugs.llvm.org/show_bug.cgi?id=33758 ...although this patch doesn't make any difference to the final result on that yet. In the affected tests here, it looks like it just makes RA wiggle. But we might as well squash this to prevent it interfering with other pattern-matching. Differential Revision: https://reviews.llvm.org/D56604 llvm-svn: 351008	2019-01-12 15:12:28 +00:00
Simon Pilgrim	0d92c4debc	Use getShiftAmountTy for shift amounts. llvm-svn: 351005	2019-01-12 12:00:43 +00:00
Simon Pilgrim	ca0de0363b	[X86][AARCH64] Improve ISD::ABS support This patch takes some of the code from D49837 to allow us to enable ISD::ABS support for all SSE vector types. Differential Revision: https://reviews.llvm.org/D56544 llvm-svn: 350998	2019-01-12 09:59:32 +00:00
Pirama Arumuga Nainar	cc07dabdaa	[Legalizer] Use correct ValueType of SELECT_CC node during Float promotion Summary: When legalizing the result of a SELECT_CC node by promoting the floating-point type, use the promoted-to type rather than the original type. Fix PR40273. Reviewers: efriedma, majnemer Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D56566 llvm-svn: 350951	2019-01-11 18:46:02 +00:00
Martin Storsjo	114ad37c1d	Revert "[SelectionDAGBuilder] Refactor GetRegistersForValue. NFCI." This reverts commit r350841, as it actually had functional changes and broke compilation. See PR40290. llvm-svn: 350921	2019-01-11 07:31:17 +00:00
Gerolf Hoflehner	cb7d968f73	[MachineCombiner][NFC] Prevent dereferencing past-the-end object in an MRI container llvm-svn: 350896	2019-01-10 21:53:13 +00:00
Sanjay Patel	9b368f39a9	[DAGCombiner] simplify code; NFC llvm-svn: 350844	2019-01-10 16:47:42 +00:00
Nirav Dave	cd18977add	[SelectionDAGBuilder] Refactor GetRegistersForValue. NFCI. llvm-svn: 350841	2019-01-10 16:25:47 +00:00
Nirav Dave	4817c0e46c	[SelectionDAGBuilder] Fix formatting. NFC. llvm-svn: 350839	2019-01-10 16:22:19 +00:00
Nirav Dave	57f2c14860	[SelectionDAGBuilder] Refactor visitInlineAsm. NFC. llvm-svn: 350837	2019-01-10 16:18:18 +00:00
James Y Knight	62df5eed16	[opaque pointer types] Remove some calls to generic Type subtype accessors. That is, remove many of the calls to Type::getNumContainedTypes(), Type::subtypes(), and Type::getContainedType(N). I'm not intending to remove these accessors -- they are useful/necessary in some cases. However, removing the pointee type from pointers would potentially break some uses, and reducing the number of calls makes it easier to audit. llvm-svn: 350835	2019-01-10 16:07:20 +00:00
Francis Visoiu Mistrih	ac6454a7f6	[CodeGen] Ignore return sext/zext attributes of unused results for tail calls If the caller's return type does not have a zeroext attribute but the callee does a tail call zeroext, we won't consider the tail call during CodeGenPrepare because the attributes don't match. However, if the result of the tail call has no uses, it makes sense to drop the sext/zext attributes. Differential Revision: https://reviews.llvm.org/D56486 llvm-svn: 350753	2019-01-09 19:46:15 +00:00
David Stenberg	33b192d72b	[DebugInfo] Omit location list entries with empty ranges Summary: This fixes PR39710. In that case we emitted a location list looking like this: .Ldebug_loc0: .quad .Lfunc_begin0-.Lfunc_begin0 .quad .Lfunc_begin0-.Lfunc_begin0 .short 1 # Loc expr size .byte 85 # DW_OP_reg5 .quad .Lfunc_begin0-.Lfunc_begin0 .quad .Lfunc_end0-.Lfunc_begin0 .short 1 # Loc expr size .byte 85 # super-register DW_OP_reg5 .quad 0 .quad 0 As seen, the first entry's beginning and ending addresses evalute to 0, which meant that the entry inadvertently became an "end of list" entry, resulting in the location list ending sooner than expected. To fix this, omit all entries with empty ranges. Location list entries with empty ranges do not have any effect, as specified by DWARF, so we might as well drop them: "A location list entry (but not a base address selection or end of list entry) whose beginning and ending addresses are equal has no effect because the size of the range covered by such an entry is zero." Reviewers: davide, aprantl, dblaikie Reviewed By: aprantl Subscribers: javed.absar, JDevlieghere, llvm-commits Tags: #debug-info Differential Revision: https://reviews.llvm.org/D55919 llvm-svn: 350698	2019-01-09 09:58:59 +00:00
Matt Arsenault	3dddb163dd	GlobalISel: Implement fewerElements for implicit_def llvm-svn: 350697	2019-01-09 07:51:52 +00:00
Matt Arsenault	befee402ff	GlobalISel: Implement widenScalar for implicit_def llvm-svn: 350695	2019-01-09 07:34:14 +00:00
Hiroshi Inoue	dad8c6a1c9	[NFC] fix trivial typos in comments llvm-svn: 350690	2019-01-09 05:11:10 +00:00
Stanislav Mekhanoshin	ed0d6c60af	Remove check for single use in ShrinkDemandedConstant This removes check for single use from general ShrinkDemandedConstant to the BE because of the AArch64 regression after D56289/rL350475. After several hours of experiments I did not come up with a testcase failing on any other targets if check is not performed. Moreover, direct call to ShrinkDemandedConstant is not really needed and superceed by SimplifyDemandedBits. Differential Revision: https://reviews.llvm.org/D56406 llvm-svn: 350684	2019-01-09 02:24:22 +00:00
Matt Arsenault	0ad1b71fe3	RegisterCoalescer: Assume CR_Replace for SubRangeJoin Currently it's possible for following check on V.WriteLanes (which is not really meaningful during SubRangeJoin) to pass for one half of the pair, and then fall through to to one of the impossible or unresolved states. This then fails as inconsistent on the other half. During the main range join, the check between V.WriteLanes and OtherV.ValidLanes must have passed, meaning this should be a CR_Replace. Fixes most of the testcases in bugs 39542 and 39602 llvm-svn: 350678	2019-01-08 23:22:18 +00:00
Matt Arsenault	2c807410fd	RegisterCoalescer: Defer clearing implicit_def lanes We can't go back and recover the lanes if it turns out the implicit_def really can't be erased. Assume all lanes are valid if an unresolved conflict is encountered. There aren't any tests where this seems to matter either way, but this seems like a safer option. Fixes bug 39602 llvm-svn: 350676	2019-01-08 23:10:47 +00:00
Adrian Prantl	8a753a2e5a	Revert "Revert "Revert "Resubmit rL345008 "Split MachinePipeliner code into header and cpp files"""" This reverts commit D56084. llvm-svn: 350654	2019-01-08 21:05:10 +00:00
Paul Robinson	7402fd9a35	Rename DIFlagFixedEnum to DIFlagEnumClass. NFC llvm-svn: 350641	2019-01-08 17:52:29 +00:00
Florian Hahn	c1ece1b41b	[MachineVerifier] Include offending register in allocatable live-in error msg. This patch adds a convenience report() method for physical registers and uses it to print the offending register with the 'MBB has allocatable live-in' error. Reviewers: MatzeB, rtereshin, dsanders Reviewed By: dsanders Differential Revision: https://reviews.llvm.org/D55946 llvm-svn: 350630	2019-01-08 15:16:23 +00:00
Petr Pavlu	bf4fdecc51	[GlobalISel] Fix choice of instruction selector for AArch64 at -O0 with -global-isel=0 Commit rL347861 introduced an unintentional change in the behaviour when compiling for AArch64 at -O0 with -global-isel=0. Previously, explicitly disabling GlobalISel resulted in using FastISel but an updated condition in the commit changed it to using SelectionDAG. The patch fixes this condition and slightly better organizes the code that chooses the instruction selector. Fixes PR40131. Differential Revision: https://reviews.llvm.org/D56266 llvm-svn: 350626	2019-01-08 14:19:06 +00:00
Lama Saba	32f08399eb	Revert "Revert "Resubmit rL345008 "Split MachinePipeliner code into header and cpp files""" This reverts commit rL350497 reported remaining issues seem to be unrelated to modules or this change. more info: https://reviews.llvm.org/D56084 llvm-svn: 350621	2019-01-08 13:30:36 +00:00
Benjamin Kramer	a480523ce9	[GlobalISel] Fix unused variable warning in Release builds. llvm-svn: 350618	2019-01-08 12:54:26 +00:00
Matt Arsenault	376f2ef2f0	Fix typos llvm-svn: 350597	2019-01-08 01:25:47 +00:00
Matt Arsenault	adc40baa29	RegBankSelect: Fix copy insertion point for terminators If a copy was needed to handle the condition of brcond, it was being inserted before the defining instruction. Add tests for iterator edge cases. I find the existing code here suspect for the case where it's looking for terminators that modify the register. It's going to insert a copy in the middle of the terminators, which isn't allowed (it might be necessary to have a COPY_terminator if anybody actually needs this). Also legalize brcond for AMDGPU. llvm-svn: 350595	2019-01-08 01:22:47 +00:00
Wei Mi	2645fd0ece	[RegisterCoalescer] dst register's live interval needs to be updated when merging a src register in ToBeUpdated set. This is to fix PR40061 related with https://reviews.llvm.org/rL339035. In https://reviews.llvm.org/rL339035, live interval of source pseudo register in rematerialized copy may be saved in ToBeUpdated set and its update may be postponed. In PR40061, %t2 = %t1 is rematerialized and %t1 is added into toBeUpdated set to postpone its live interval update. After the rematerialization, the live interval of %t1 is larger than necessary. Then %t1 is merged into %t3 and %t1 gets removed. After the merge, %t3 contains live interval larger than necessary. Because %t3 is not in toBeUpdated set, its live interval is not updated after register coalescing and it will break some assumption in regalloc. The patch requires the live interval of destination register in a merge to be updated if the source register is in ToBeUpdated. Differential revision: https://reviews.llvm.org/D55867 llvm-svn: 350586	2019-01-08 00:26:11 +00:00
Craig Topper	826f44b550	[TargetLowering][AMDGPU] Remove the SimplifyDemandedBits function that takes a User and OpIdx. Stop using it in AMDGPU target for simplifyI24. As we saw in D56057 when we tried to use this function on X86, it's unsafe. It allows the operand node to have multiple users, but doesn't prevent recursing past the first node when it does have multiple users. This can cause other simplifications earlier in the graph without regard to what bits are needed by the other users of the first node. Ideally all we should do to the first node if it has multiple uses is bypass it when its not needed by the user we started from. Doing any other transformation that SimplifyDemandedBits can do like turning ZEXT/SEXT into AEXT would result in an increase in instructions. Fortunately, we already have a function that can do just that, GetDemandedBits. It will only make transformations that involve bypassing a node. This patch changes AMDGPU's simplifyI24, to use a combination of GetDemandedBits to handle the multiple use simplifications. And then uses the regular SimplifyDemandedBits on each operand to handle simplifications allowed when the operand only has a single use. Unfortunately, GetDemandedBits simplifies constants more aggressively than SimplifyDemandedBits. This caused the -7 constant in the changed test to be simplified to remove the upper bits. I had to modify computeKnownBits to account for this by ignoring the upper 8 bits of the input. Differential Revision: https://reviews.llvm.org/D56087 llvm-svn: 350560	2019-01-07 19:30:43 +00:00
Lama Saba	f385c21f79	Revert "Resubmit rL345008 "Split MachinePipeliner code into header and cpp files"" This reverts commit rL350493 issues related to modules still appear in http://green.lab.llvm.org/green/job/lldb-cmake llvm-svn: 350497	2019-01-06 16:39:14 +00:00
Lama Saba	ea9d555b83	Resubmit rL345008 "Split MachinePipeliner code into header and cpp files" Resubmitted in rL345290 and reverted in rL350345 due to failures in http://green.lab.llvm.org/green/job/lldb-cmake/ Resubmitting after a workaround to lldb-cmake failure was committed in rL350346, more info in https://reviews.llvm.org/D56084 llvm-svn: 350493	2019-01-06 15:45:40 +00:00
Craig Topper	57fc891c1b	[LegalizeVectorOps] Add FSHL/FSHR to the list of vector operations that should be handled. The FSHL/FSHR nodes are handled in the expand function, but they need to also be listed in the code that queries for the operation action too. llvm-svn: 350490	2019-01-06 07:06:35 +00:00

... 3 4 5 6 7 ...

25772 Commits