llvm-project

Author	SHA1	Message	Date
Alexey Bataev	dbd00c3b5d	[SLP][NFC]Add a test for gather node with mixed load/non-load scalars.	2023-11-10 08:40:58 -08:00
Ramkumar Ramachandra	2302e4c327	Reland "VectorUtils: mark xrint as trivially vectorizable" (#71416 ) With the recent change 98c90a13 (ISel: introduce vector ISD::LRINT, ISD::LLRINT; custom RISCV lowering), it is now possible for SLPVectorizer, LoopVectorize, and Scalarizer to operate on llvm.lrint and llvm.llrint, with vector codegen for the RISC-V target. Make a trivial change to VectorUtils, and update the corresponding tests. A couple of important fixes have been landed since the original patch was landed and reverted, and it is now safe to re-land the patch: 5e1d81a (LegalizeIntegerTypes: implement PromoteIntRes for xrint) and fd887a3 (LegalizeVectorTypes: fix bug in widening of vec result in xrint). See also #71399, which proves that lrint and llrint will indeed produce vector codegen on RISC-V. Fixes #55208.	2023-11-06 18:49:49 +00:00
Alexey Bataev	ac254fc055	[SLP]Improve tryToGatherExtractElements by using per-register analysis. Currently tryToGatherExtractElements function analyzes the whole vector, regrdless number of actual registers, used in this vector. It may prevent some optimizations, because per-register analysis may allow to simplify the final code by reusing more already emitted vectors and better shuffles. Differential Revision: https://reviews.llvm.org/D148855	2023-11-06 07:29:27 -08:00
Hans Wennborg	046c57e705	Revert "[SLP]Improve tryToGatherExtractElements by using per-register analysis." This causes asserts: llvm-project/llvm/lib/Transforms/Vectorize/SLPVectorizer.cpp:10082: Value llvm::slpvectorizer::BoUpSLP::ShuffleInstructionBuilder::adjustExtracts( const TreeEntry , MutableArrayRef<int>, unsigned int, bool &): Assertion `Part == 0 && "Expected firs part."' failed. See comment on the code review. > Currently tryToGatherExtractElements function analyzes the whole vector, > regrdless number of actual registers, used in this vector. It may > prevent some optimizations, because per-register analysis may allow to > simplify the final code by reusing more already emitted vectors and > better shuffles. > > Differential Revision: https://reviews.llvm.org/D148855 This reverts commit 9dfdbd788707edc8c39eb2bff16004aba1f3586b.	2023-11-06 13:56:42 +01:00
Alexey Bataev	9dfdbd7887	[SLP]Improve tryToGatherExtractElements by using per-register analysis. Currently tryToGatherExtractElements function analyzes the whole vector, regrdless number of actual registers, used in this vector. It may prevent some optimizations, because per-register analysis may allow to simplify the final code by reusing more already emitted vectors and better shuffles. Differential Revision: https://reviews.llvm.org/D148855	2023-11-03 10:43:58 -07:00
Nikita Popov	e4a4122eb6	[IR] Remove zext and sext constant expressions (#71040 ) Remove support for zext and sext constant expressions. All places creating them have been removed beforehand, so this just removes the APIs and uses of these constant expressions in tests. There is some additional cleanup that can be done on top of this, e.g. we can remove the ZExtInst vs ZExtOperator footgun. This is part of https://discourse.llvm.org/t/rfc-remove-most-constant-expressions/63179.	2023-11-03 10:46:07 +01:00
Martin Storsjö	66152f4eed	Revert "[SLP]Improve tryToGatherExtractElements by using per-register analysis." This reverts commit 3e6d7c6d983dd5896e3a03857584654eb1360fda. That commit caused miscompilation of ffmpeg's libavcodec/vp9dsp_8bpp.o on aarch64; the file still compiles correctly, but no longer produces the right result - see https://reviews.llvm.org/D148855#4655968 for details.	2023-11-03 00:08:17 +02:00
Alexey Bataev	495ed8d8c8	[SLP]Fix PR70507: freeze poisonous insts to avoid poison propagation. If the reduction instruction is not bool logical op, but reduced within bool logical op reduction list, need to freeze to avoid poison propagation.	2023-11-02 10:37:38 -07:00
Alexey Bataev	033d2b71d2	[SLP][NFC]Add a test to show poison propagation in mixed (non)bool logical ops reduction, NFC.	2023-11-02 09:58:13 -07:00
Alexey Bataev	3e6d7c6d98	[SLP]Improve tryToGatherExtractElements by using per-register analysis. Currently tryToGatherExtractElements function analyzes the whole vector, regrdless number of actual registers, used in this vector. It may prevent some optimizations, because per-register analysis may allow to simplify the final code by reusing more already emitted vectors and better shuffles. Differential Revision: https://reviews.llvm.org/D148855	2023-11-01 10:42:35 -07:00
Alexey Bataev	6e8d957a22	Revert "[SLP]Improve tryToGatherExtractElements by using per-register analysis." This reverts commit 0a34aaedd8ec2dc2375076976c1327fdbfd7877f to fix fails reported in https://lab.llvm.org/buildbot/#/builders/265/builds/40	2023-11-01 08:52:31 -07:00
Alexey Bataev	c28b7eb496	[SLP]Fix handling of -slp-vectorize-hor-store for values with many uses.	2023-11-01 08:41:54 -07:00
Alexey Bataev	c449a64c3e	[SLP][NFC]Add the test shoing issue with -slp-vectorize-hor-store option, NFC.	2023-11-01 08:31:18 -07:00
Alexey Bataev	0a34aaedd8	[SLP]Improve tryToGatherExtractElements by using per-register analysis. Currently tryToGatherExtractElements function analyzes the whole vector, regrdless number of actual registers, used in this vector. It may prevent some optimizations, because per-register analysis may allow to simplify the final code by reusing more already emitted vectors and better shuffles. Differential Revision: https://reviews.llvm.org/D148855	2023-11-01 07:44:49 -07:00
Ramkumar Ramachandra	ac7c816dc2	Revert "VectorUtils: mark lrint, llrint as trivially vectorizable (#69945 )" This reverts commit 5bfd89bda7c2d5ff167c7bcea0c8d69b0b498f08. It was causing build failures on ffmpeg on i686.	2023-11-01 09:57:22 +00:00
Ramkumar Ramachandra	5bfd89bda7	VectorUtils: mark lrint, llrint as trivially vectorizable (#69945 ) With the recent change 98c90a13 (ISel: introduce vector ISD::LRINT, ISD::LLRINT; custom RISCV lowering), it is now possible for SLPVectorizer, LoopVectorize, and Scalarizer to operate on llvm.lrint and llvm.llrint, with vector codegen for the RISC-V target. Make a trivial change to VectorUtils, and update the corresponding tests.	2023-10-31 21:29:15 +00:00
Alexey Bataev	4c997e1536	[SLP]Fix PR70507: emit freeeze whenever required for bool logical ops in the middle of reduction ops. Need to emit freeze instruction not only in the case, where the root is bool logical op, but also if we reduce several scalars, but unable to say precisely, if the root is bool logical op.	2023-10-31 12:23:12 -07:00
Alexey Bataev	0e8cbb6ac8	[SLP][NFC]Add a test with poisonous reduction, seeding bool logical op. NFC.	2023-10-31 12:10:10 -07:00
Alexey Bataev	9da19e4340	[SLP]Fix PR70507: correctly handle bool logical ops in reductions. If the very first reduction operation is not bool logical op, but some others are, still need to emit the boo logic op for all the extra reduction operations to avoid incorrect poison propagation.	2023-10-30 14:09:08 -07:00
Alexey Bataev	71bf052ec9	[SLP][NFC]Add a test for bool logic ops reduction, NFC.	2023-10-30 13:38:57 -07:00
Philip Reames	3f2ed812f0	[InstCombine] Infer nneg on zext when forming from non-negative sext (#70706 ) Builds on #67982 which recently introduced the nneg flag on a zext instruction. InstCombine is one of our largest canonicalizers of zext from non-negative sext instructions, so set the flag there.	2023-10-30 12:09:43 -07:00
Philip Reames	89564f0b69	Regenerate a set of auto-update tests [nfc] To reduce the spurious test delta in an upcoming change.	2023-10-30 11:36:43 -07:00
Alexey Bataev	af15c46777	[SLP]Do not crash if number of vector registers does not feet the vector type. Need to check, if the number of vector registers, returned by TTI, is not greater than total number of mask element and not zero, before trying to perform any operations. TTI still may return non-valid number of registers.	2023-10-30 07:30:52 -07:00
Antonio Frighetto	138e6c1c86	[AArch64][TTI] Improve `LegalVF` when gather loads are scalarized After determining the cost of loads that could not be coalesced into `VectorizedLoads` in SLP, computing the cost of a gather-vectorized load is carried out. Favour a potentially high valid cost when the type of a group of loads, whose type is a vector of size dependent upon `VF`, may be legalized into a scalar value. Fixes: https://github.com/llvm/llvm-project/issues/68953.	2023-10-27 20:22:54 +02:00
Alex Richardson	e39f6c1844	[opt] Infer DataLayout from triple if not specified There are many tests that specify a target triple/CPU flags but no DataLayout which can lead to IR being generated that has unusual behaviour. This commit attempts to use the default DataLayout based on the relevant flags if there is no explicit override on the command line or in the IR file. One thing that is not currently possible to differentiate from a missing datalayout `target datalayout = ""` in the IR file since the current APIs don't allow detecting this case. If it is considered useful to support this case (instead of passing "-data-layout=" on the command line), I can change IR parsers to track whether they have seen such a directive and change the callback type. Differential Revision: https://reviews.llvm.org/D141060	2023-10-26 12:07:37 -07:00
Alexey Bataev	196d154ab7	[SLP]Improve isGatherShuffledEntry by trying per-register shuffle. Currently when building gather/buildvector node, we try to build nodes shuffles without taking into account separate vector registers. We can improve final codegen and the whole vectorization process by including this info into the analysis and the vector code emission, allows to emit better vectorized code. Differential Revision: https://reviews.llvm.org/D149742	2023-10-26 08:51:37 -07:00
Alexey Bataev	c65ec9d919	Revert "[SLP]Improve isGatherShuffledEntry by trying per-register shuffle." This reverts commit 560bad013ebcb8d2c2c1722e35270b9a70ab40ce to fix a bug reported in https://lab.llvm.org/buildbot/#/builders/5/builds/37763.	2023-10-26 08:36:50 -07:00
Simon Pilgrim	585da2651f	[SLP][X86] Regenerate hadd/hsub tests with full set of check-prefixes Prep for D148855	2023-10-26 14:39:46 +01:00
Alexey Bataev	560bad013e	[SLP]Improve isGatherShuffledEntry by trying per-register shuffle. Currently when building gather/buildvector node, we try to build nodes shuffles without taking into account separate vector registers. We can improve final codegen and the whole vectorization process by including this info into the analysis and the vector code emission, allows to emit better vectorized code. Differential Revision: https://reviews.llvm.org/D149742	2023-10-26 05:57:03 -07:00
Ramkumar Ramachandra	aa30018e66	SLP/RISCV: add negative test for llrint, increase coverage (#69940 ) To follow-up on a06be8a (SLP/RISCV: add negative test for lrint), add a negative test for llvm.llrint as well, and increase the coverage to cover vectors of length 2, 4, and 8, and the i32 variant of lrint, in preparation to get SLPVectorizer to vectorize both lrint and llrint. This is now possible with the recent change 98c90a1 (ISel: introduce vector ISD::LRINT, ISD::LLRINT; custom RISCV lowering).	2023-10-25 17:26:39 +01:00
Valery Dmitriev	3324776d9c	[SLP] Improve gather tree nodes matching when users are PHIs. (#70111 ) This is re-commit of #69392 and also fixes issue #69670 which was uncovered with the prior commit. For delayed gather emission it may be incorrect to use stab instruction as insertion point if it is a PHI operand. For that case insertion point is adjusted to be at the end of block, ensuring that prior dependecy vector code is emitted earlier.	2023-10-24 16:39:36 -07:00
Valery Dmitriev	117041dac9	[NFC][SLP] Add test case for issue #69670 . (#70088 ) Test exposes issue with delayed gather emission, which may lead to generating an instruction which does not dominate all users.	2023-10-24 12:36:38 -07:00
Alexey Bataev	d79051f894	[SLP]Fix PR70004: Do not change insert point for reduction gather nodes. No need to change the insert point for reduction gather node, we can use the ReductionRoot as insert point instead to avoid possible crashes.	2023-10-24 09:19:59 -07:00
Alexey Bataev	8d307f59ee	[SLP]Fix PR69246: do not treat resizing maskas identity. If the mask is resizing and the mask size is greater than than the length of the vector, being reused from extractelement instructions, the mask for undefs cannot be treated as identity, must be treated as a broadcast.	2023-10-24 08:14:13 -07:00
Alexey Bataev	254558ac53	[SLP]Fix PR69976: Check for multi-node uses during node building. Need to check if there is already a node created for the multi-node instruction before ending up with creating a new node for such instructions.	2023-10-24 07:01:46 -07:00
Douglas Yung	734b016b66	Revert "[SLP] Improve gather tree nodes matching when users are PHIs. (#69392 )" This reverts commit c80b50349648dcf7fcbf4ae69c62b3d34bee0c70. This change causes a fatal error in the backend and is filed as issue #69670.	2023-10-20 10:59:07 -07:00
Alexey Bataev	553616a213	[SLP][NFC]Add avx2 test run, NFC.	2023-10-19 09:12:37 -07:00
Valery Dmitriev	c80b503496	[SLP] Improve gather tree nodes matching when users are PHIs. (#69392 )	2023-10-18 09:05:11 -07:00
Valery Dmitriev	3caccb22ab	[NFC][SLP] Test case exposing gather nodes matching deficiency affecting cost. (#69382 )	2023-10-17 14:58:10 -07:00
Alexey Bataev	66775f8ccd	[SLP]Fix PR69196: Instruction does not dominate all uses During emission of the postponed gathers, need to insert them before user instruction to avoid use before definition crash.	2023-10-17 10:43:59 -07:00
Alexey Bataev	119b0f3895	Revert "[SLP]Fix PR69196: Instruction does not dominate all uses" This reverts commit 8e2b2c4181506efc5b9321c203dd107bbd63392b to fix a crash reported in https://lab.llvm.org/buildbot/#/builders/230/builds/19993.	2023-10-16 13:29:17 -07:00
Alexey Bataev	8e2b2c4181	[SLP]Fix PR69196: Instruction does not dominate all uses During emission of the postponed gathers, need to insert them before user instruction to avoid use before definition crash.	2023-10-16 12:57:18 -07:00
Alexey Bataev	e22818d5c9	[IR]Add NumSrcElts param to is..Mask static function in ShuffleVectorInst. Need to add NumSrcElts param to is..Mask functions in ShuffleVectorInstruction class for better mask analysis. Mask.size() not always matches the sizes of the permuted vector(s). Allows to better estimate the cost in SLP and fix uses of the functions in other cases. Differential Revision: https://reviews.llvm.org/D158449	2023-10-05 06:17:07 -07:00
Alexey Bataev	2c49311dea	[SLP][NFC]Add insertsubvector test with small source vector, NFC.	2023-10-05 06:03:58 -07:00
Arthur Eubanks	07389535a7	Revert "[IR]Add NumSrcElts param to is..Mask static function in ShuffleVectorInst." This reverts commit b186f1f68be11630355afb0c08b80374a6d31782. Causes crashes, see https://reviews.llvm.org/D158449.	2023-10-04 14:37:16 -07:00
Alex Richardson	e86d6a43f0	Regenerate test checks for tests affected by D141060	2023-10-04 10:51:35 -07:00
Alexey Bataev	b186f1f68b	[IR]Add NumSrcElts param to is..Mask static function in ShuffleVectorInst. Need to add NumSrcElts param to is..Mask functions in ShuffleVectorInstruction class for better mask analysis. Mask.size() not always matches the sizes of the permuted vector(s). Allows to better estimate the cost in SLP and fix uses of the functions in other cases. Differential Revision: https://reviews.llvm.org/D158449	2023-10-04 07:53:30 -07:00
Alexey Bataev	ff48e83f18	[SLP][NFC]Add a test for reused extracts corner case, NFC.	2023-10-04 06:28:49 -07:00
Alexey Bataev	1129dec778	Revert "[IR]Add NumSrcElts param to is..Mask static function in ShuffleVectorInst." This reverts commit 6f43d28f3452b3ef598bc12b761cfc2dbd0f34c9 to fix a crash reported in https://reviews.llvm.org/D158449.	2023-10-03 13:02:16 -07:00
Alexey Bataev	6f43d28f34	[IR]Add NumSrcElts param to is..Mask static function in ShuffleVectorInst. Need to add NumSrcElts param to is..Mask functions in ShuffleVectorInstruction class for better mask analysis. Mask.size() not always matches the sizes of the permuted vector(s). Allows to better estimate the cost in SLP and fix uses of the functions in other cases. Differential Revision: https://reviews.llvm.org/D158449	2023-10-03 10:26:11 -07:00

1 2 3 4 5 ...

1529 Commits