llvm-project

Author	SHA1	Message	Date
Jonas Devlieghere	d77556549a	[dwarfdump] Fix bogus incompatible tag warning Because DW_TAG_LLVM_ptrauth_type is not marked as a type, dwarfdump currently emits a spurious error: DIE has DW_AT_type with incompatible tag DW_TAG_LLVM_ptrauth_type. This patch fixes that and adds a test.	2022-10-26 16:39:40 -07:00
Steven Wu	76ebaf263b	[LTO] Fix lto_module_create_in_codegen_context return value on error According to the documentation, lto_module_create_in_codegen_context should return NULL on error but currently it is accidentally return error_code. Since this is a bug fix and it seems to be a one-off bug that only affects this API, there is no need to bump API version. rdar://101505192 Reviewed By: pete Differential Revision: https://reviews.llvm.org/D136769	2022-10-26 15:13:22 -07:00
David Blaikie	637a594f5a	[ADT] Add deduction guide for llvm::Optional Added to address some uses of implicit CTAD added in clang recently.	2022-10-26 21:45:57 +00:00
Craig Topper	e94dc58dff	[RISCV] Inline scalar ceil/floor/trunc/rint/round/roundeven. This avoids the call overhead as well as the the save/restore of fflags and the snan handling in the libm function. The save/restore of fflags and snan handling are needed to be correct for -ftrapping-math. I think we can ignore them in the default environment. The inline sequence will generate an invalid exception for nan and an inexact exception if fractional bits are discarded. I've used a custom inserter to explicitly create the control flow around the float->int->float conversion. We can probably avoid the final fsgnj after the conversion for no signed zeros FMF, but I'll leave that for future work. Note the comparison constant is slightly different than glibc uses. They use 1<<53 for double, I'm using 1<<52. I believe either are valid. Numbers >= 1<<52 can't have any fractional bits. It's ok to do the float->int->float conversion on numbers between 1<<53 and 1<<52 since they will all fit in 64. We only have a problem if the double can't fit in i64 Reviewed By: reames Differential Revision: https://reviews.llvm.org/D136508	2022-10-26 14:36:49 -07:00
Sanjay Patel	7d3a37a4b4	[InstCombine] add tests for demanded bits of sub; NFC	2022-10-26 17:23:33 -04:00
Craig Topper	0a03240fb4	[RISCV] Add tests for fixed vector sshl_sat/ushl_sat. NFC	2022-10-26 14:15:47 -07:00
Félix Cloutier	bca75abc01	Revert "[NFC] Make format() more amenable to format attributes" This reverts commit fb1e90ef07fec0d64a05c0b6d41117a5ea3e8344.	2022-10-26 12:53:14 -07:00
Nico Weber	457fe21d8b	[gn build] port cb0eb9d8dd5 (lldb test libc++ refs)	2022-10-26 15:30:10 -04:00
Félix Cloutier	fb1e90ef07	[NFC] Make format() more amenable to format attributes This change modifies the implementation of the format() function so that vendor forks committed to building with compilers that support __attribute__((format)) on non-variadic functions can check the format() function with it. Reviewed By: ahatanak Differential Revision: https://reviews.llvm.org/D132413 rdar://84571523	2022-10-26 12:10:42 -07:00
James Y Knight	26fdad031c	[MIPS] Fix useDeprecatedPositionallyEncodedOperands errors. This is a follow-on to https://reviews.llvm.org/D134073. The number of MIPS16 changes here is a bit surprising. Many of the fields with mismatched names were NOT previously choosing the correct argument positionally, but instead doing something completely wrong (e.g. it would encode a register where an immediate was expected). But, machine-code generation for MIPS16 has never actually functioned. It's also fully untested, thus, the MIPS16 changes, despite changing behavior, breaks (and fixes) zero tests. This change does not fix MIPS16 output, but it ought to be at least incrementally less broken. Outside MIPS16, I believe the only functional change is to the 'ginvi' instruction: it was previously encoding garbage into a field which was specified to be '00'. Fortunately, it was covered by tests -- and the tests were testing the incorrect behavior. So, fixed. Differential Revision: https://reviews.llvm.org/D134220	2022-10-26 14:06:08 -04:00
James Y Knight	23394cd810	[Sparc] Fix useDeprecatedPositionallyEncodedOperands errors. This is a follow-on to https://reviews.llvm.org/D134073. It renames a few fields to have consistent names, as well as renaming operands to match the field names. Behavior is unchanged by this cleanup. (The only generated code change is for the disassembler for LDSTUB/LDSTUBA, but in both old and new versions, it fails to add enough operands, and thus triggers a runtime abort. I will address that bug in a future commit.) Differential Revision: https://reviews.llvm.org/D134201	2022-10-26 14:06:07 -04:00
James Y Knight	5713c2959c	Update "Writing a Backend" doc to use named operand matching. This brings it in line with recommended practice after the introduction of sub-operand naming in a538d1f13a13, and the deprecation of positional argument matching in 5351878ba196.	2022-10-26 14:06:07 -04:00
Sanjay Patel	1bd856fbe5	[InstCombine] add tests for demanded bits of sub; NFC	2022-10-26 14:04:46 -04:00
Sanjay Patel	54eeadcf44	[SDAG] avoid vector extract/insert around binop scalar-to-vector (scalar binop (extractelt V, Idx), C) --> shuffle (vector binop V, C'), {Idx, -1, -1...} We generally try to avoid ad-hoc vectorization in SDAG, but the motivating case from issue #39482 escapes our normal vectorization folds in IR. It seems like it should always be a win to transform this pattern in cases where we have the same vector type for input and output and the target supports the vector operation. That avoids transfers from vector to scalar and back. In the x86 shift examples, we create the scalar-to-vector node during legalization. I'm not sure if there's a more general way to create the pattern for testing. (If so, I could add tests for other targets.) Differential Revision: https://reviews.llvm.org/D136713	2022-10-26 14:04:46 -04:00
Matt Arsenault	f85ce1b236	ConstantFold: Reduce code duplication for checking commuted compare	2022-10-26 10:59:58 -07:00
Jamie Schmeiser	b115ba0050	[NFC] Introduce range based singleton searches for loop queries. Summary: Several loop queries look for a singleton by finding all instances and then returning whether there is 1 instance or not. This can be improved by stopping the search after 2 have been found. Introduce generic range based singleton searches that stop after finding a second value and use them for these loop queries. There is no intended functional change other than improved compile-time efficiency. Author: Jamie Schmeiser <schmeise@ca.ibm.com> Reviewed By: Meinersbur (Michael Kruse) Differential Revision: https://reviews.llvm.org/D136261	2022-10-26 13:50:11 -04:00
Johannes Rudolf Doerfert	41a278f56a	[OpenMP][FIX] Do not add custom state machine eagerly in LTO runs If we run LTO optimization we migth end up introducing a custom state machine and later transforming the region into SPMD. This is a problem. While a follow up will introduce a check for the SPMD conversion, this already prevents the eager custom state machine generation. Only if the kernel init function is defined, rather then declared, we will emit a custom state machine. SPMD-zation can happen eagerly though. Tests are adjusted via a weak definition. The LTO test was added to verify this works as expected. Differential Revision: https://reviews.llvm.org/D136740	2022-10-26 10:40:11 -07:00
Alex Brachet	443e2a10f6	Reland "[PGO] Make emitted symbols hidden" This was reverted because it was breaking when targeting Darwin which tried to export these symbols which are now hidden. It should be safe to just stop attempting to export these symbols in the clang driver, though Apple folks will need to change their TAPI allow list described in the commit where these symbols were originally exported `f538018562` Then reverted again because it broke tests on MacOS, they should be fixed now. Bug: https://github.com/llvm/llvm-project/issues/58265 Differential Revision: https://reviews.llvm.org/D135340	2022-10-26 17:13:05 +00:00
Piyou Chen	7d7940fd77	[RISCV] add svinval extension 1. Add the svinval extension support 2. Add the svinval Predicates for its instruction Note: the svinval instructions defined in https://reviews.llvm.org/D117654 Reviewed By: reames Differential Revision: https://reviews.llvm.org/D136571	2022-10-26 09:45:30 -07:00
Craig Topper	a61b74889f	[RISCV] Use vslide1down for i64 insertelt on RV32. Instead of using vslide1up, use vslide1down and build the other direction. This avoids the overlap constraint early clobber of vslide1up. Reviewed By: reames Differential Revision: https://reviews.llvm.org/D136735	2022-10-26 09:43:12 -07:00
Yashwant Singh	14fb4040e2	[AMDGPU][test] precommiting tests for D136663 More tests for si-peephole-sdwa pass	2022-10-26 22:08:28 +05:30
Momchil Velikov	5ea8951b88	[FuncSpec] Add a testcase for the treatment of constant and unused arguments Increase test coverage - check that functions are not specialised on constant or unused arguments. Reviewed By: SjoerdMeijer Differential Revision: https://reviews.llvm.org/D136184	2022-10-26 17:25:18 +01:00
Michael Maitland	64d5aedd06	[TableGen] Add log bang operator This patch adds base 2 logarithm that returns integer result. I initially wanted to name it `!log2`, but numbers are not permitted in the name. The documentation makes sure to clarify that it is base 2 since it is not explicit in the operator name. Differential Revision: https://reviews.llvm.org/D134068	2022-10-26 09:16:32 -07:00
Sanjay Patel	3aec021118	[SDAG] add helper for opcodes that are not speculatable This is not quite NFC because one of the users should now avoid the DIVREM opcodes too, but I'm not sure how to test that. I used the same name as an analysis function in IR in case we want to expand this to include other operations. Another potential use is proposed in D136713.	2022-10-26 11:20:14 -04:00
Sanjay Patel	ef9dfcd6cd	[x86] add tests for extract + insert of vector shift amount; NFC	2022-10-26 11:20:14 -04:00
Johannes Doerfert	8ce5dee74b	[Docs][NFC] Update my office hour information	2022-10-26 08:14:54 -07:00
Anton Sidorenko	7bc7f2da76	[UpdateTestChecks] Sync flags in update_mir_test_checks.py with MIFlags Some instructions are not matched by update_mir_test_checks.py because MIFlags and regex in the script are not synchronized. Differential Revision: https://reviews.llvm.org/D136170	2022-10-26 17:07:46 +03:00
LLVM GN Syncbot	2e5bf4da99	[gn build] Port bb72d0dde29e	2022-10-26 13:25:32 +00:00
LLVM GN Syncbot	85e4d5583e	[gn build] Port 93ce23adb548	2022-10-26 13:20:22 +00:00
Momchil Velikov	9901583968	Revert "[FuncSpec] Fix specialisation based on literals" This reverts commit a8b0f580170089fcd555ade5565ceff0ec60f609 because of "reverse-iteration" buildbot failure.	2022-10-26 13:54:12 +01:00
Momchil Velikov	2c8a4c6e62	Revert "[FuncSpec][NFC] Refactor finding specialisation opportunities" This reverts commit a8853924bd3c50deebfbf993c037257ccf9805f4 due to dependency on a8b0f5801700	2022-10-26 13:54:12 +01:00
Anton Sidorenko	16fb9150be	[UpdateTestChecks] Precommit test for D136170	2022-10-26 15:49:39 +03:00
Guillaume Chatelet	1a726cfa83	Take memset_inline into account in analyzeLoadFromClobberingMemInst This appeared in https://reviews.llvm.org/D126903#3884061 Differential Revision: https://reviews.llvm.org/D136752	2022-10-26 09:50:13 +00:00
Momchil Velikov	a8853924bd	[FuncSpec][NFC] Refactor finding specialisation opportunities This patch reorders the traversal of function call sites and function formal parameters to: * do various argument feasibility checks (`isArgumentInteresting` ) only once per argument, i.e. doing N-args checks instead of N-calls x N-args checks. * do hash table lookups only once per call site, i.e. N-calls lookups/inserts instead of N-call x N-args lookups/inserts. Reviewed By: ChuanqiXu, labrinea Differential Revision: https://reviews.llvm.org/D135968	2022-10-26 10:18:35 +01:00
Momchil Velikov	606d25e545	[FuncSpec] Compute specialisation gain even when forcing specialisation When rewriting the call sites to call the new specialised functions, a single call site can be matched by two different specialisations - a "less specialised" version of the function and a "more specialised" version of the function, e.g. for a function void f(int x, int y) the call like `f(1, 2)` could be matched by either void f.1(int x /* int y == 2 /); or void f.2(/ int x == 1, int y == 2 */); The `FunctionSpecialisation` pass tries to match specialisation in the order of decreasing gain, so "more specialised" functions are preferred to "less specialised" functions. This breaks, however, when using the flag `-force-function-specialization`, in which case the cost/benefit analysis is not performed and all the specialisations are equally preferable. This patch makes the pass calculate specialisation gain and order the specialisations accordingly even when `-force-function-specialization` is used, under the assumption that this flag has purely debugging purpose and it is reasonable to ignore the extra computing effort it incurs. Reviewed By: ChuanqiXu, labrinea Differential Revision: https://reviews.llvm.org/D136180	2022-10-26 10:08:03 +01:00
Momchil Velikov	a8b0f58017	[FuncSpec] Fix specialisation based on literals The `FunctionSpecialization` pass has support for specialising functions, which are called with literal arguments. This functionality is disabled by default and is enabled with the option `-function-specialization-for-literal-constant` . There are a few issues with the implementation, though: * even with the default, the pass will still specialise based on floating-point literals * even when it's enabled, the pass will specialise only for the `i1` type (or `i2` if all of the possible 4 values occur, or `i3` if all of the possible 8 values occur, etc) The reason for this is incorrect check of the lattice value of the function formal parameter. The lattice value is `overdefined` when the constant range of the possible arguments is the full set, and this is the reason for the specialisation to trigger. However, if the set of the possible arguments is not the full set, that must not prevent the specialisation. This patch changes the pass to NOT consider a formal parameter when specialising a function if the lattice value for that parameter is: * unknown or undef * a constant * a constant range with a single element on the basis that specialisation is pointless for those cases. Is also changes the criteria for picking up an actual argument to specialise if the argument is: * a LLVM IR constant * has `constant` lattice value has `constantrange` lattice value with a single element. Reviewed By: ChuanqiXu Differential Revision: https://reviews.llvm.org/D135893	2022-10-26 09:55:33 +01:00
Haohai Wen	21f23a37c6	[SelectionDAG] Clamp stack alignment for memset, memmove memcpy has clamped dst stack alignment to NaturalStackAlignment if hasStackRealignment is false. We should also clamp stack alignment for memset and memmove. If we don't clamp, SelectionDAG may first do tail call optimization which requires no stack realignment. Then memmove, memset in same function may be lowered to load/store with larger alignment leading to PEI emit stack realignment code which is absolutely not correct. Reviewed By: LuoYuanke Differential Revision: https://reviews.llvm.org/D136456	2022-10-26 16:45:31 +08:00
Pierre van Houtryve	63390dccd8	[GlobalISel] Add Predicates to GICombineRule Small QoL change to allow Predicates to be used in GICombineRule. Currently only one combine in the AMDGPU backend makes use of it. The implementation is pretty simple to get started but of course we can expand this later on and optimize predicate checking better if needed. Reviewed By: dsanders Differential Revision: https://reviews.llvm.org/D136681	2022-10-26 07:13:40 +00:00
Pierre van Houtryve	c1b2920c6e	[AMDGPU] Autogenerate llvm.amdgcn.fcmp.ll Prep commit for adding GISel run lines to that test. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D136591	2022-10-26 07:00:34 +00:00
wlei	bf97c1e066	[NFC] fix a wrong name change during rebase	2022-10-25 21:16:29 -07:00
wlei	91cc53d5a4	[llvm-profgen] Do not cache the frame location stack during computing inlined context size In `computeInlinedContextSizeForRange`, the offset of range is only used one time, there is no need to cache the frame location stack. Measured on one internal service binary, this can save 2GB memory usage and reduce a small run time (avoid one hash search). Reviewed By: hoy, wenlei Differential Revision: https://reviews.llvm.org/D128859	2022-10-25 21:08:36 -07:00
chenglin.bi	9403a8bc37	[GlobalISel][AArch64] Fix miscompile caused by wrong G_ZEXT selection in GISel The miscompile case's G_ZEXT has a G_FREEZE source. Similar to D127154, this patch removed isDef32, relying on the AArch64MIPeephole optimizer to remove redundant SUBREG_TO_REG nodes also in GISel. Fix #58431 Reviewed By: paquette Differential Revision: https://reviews.llvm.org/D136433	2022-10-26 09:54:13 +08:00
Lang Hames	b26f45e5a4	[ORC] Skip non-SHF_ALLOC sections in DebugObjectManagerPlugin. We don't need to provide a load-address for non-alloc sections. Skipping them allows us to avoid some complications, like handling duplicate .group sections.	2022-10-25 18:40:38 -07:00
Guozhi Wei	d24c93cc41	[X86] Enable reassociation for ADD instructions ADD is an associative and commutative operation, so we can do reassociation for it. Differential Revision: https://reviews.llvm.org/D136396	2022-10-26 00:46:13 +00:00
Matt Arsenault	8acddef90d	SimplifyLibCalls: Add missing testcase for sincospi Part of issue 58604. Test should have been part of 50fe87a5c8597eb72e6055356fa7dad364756ff7	2022-10-25 17:06:08 -07:00
Matt Arsenault	a91c17498a	GlobalISel: Fix copy paste error Pretty sure this was harmless since the tablegen calling convention definitions do not use pointers. Part of issue 58604	2022-10-25 17:06:00 -07:00
Douglas Yung	fc40c73921	Revert "Update supported features in the generic CPU configuration" This reverts commit 11afbf396e10e1b1e91a5991e2aec1916e29a910. There are 10 tests still failing after follow-up fix b5d0bf9b9853, this should get the following bots back to green: - https://lab.llvm.org/buildbot/#/builders/183/builds/8194 - https://lab.llvm.org/buildbot/#/builders/186/builds/9491 - https://lab.llvm.org/buildbot/#/builders/214/builds/3908 - https://lab.llvm.org/buildbot/#/builders/93/builds/11740 - https://lab.llvm.org/buildbot/#/builders/231/builds/4200 - https://lab.llvm.org/buildbot/#/builders/121/builds/24519 - https://lab.llvm.org/buildbot/#/builders/230/builds/4466 - https://lab.llvm.org/buildbot/#/builders/94/builds/11639 - https://lab.llvm.org/buildbot/#/builders/45/builds/9325 - https://lab.llvm.org/buildbot/#/builders/124/builds/5219 - https://lab.llvm.org/buildbot/#/builders/67/builds/8623 - https://lab.llvm.org/buildbot/#/builders/123/builds/13836 - https://lab.llvm.org/buildbot/#/builders/109/builds/49355 - https://lab.llvm.org/buildbot/#/builders/58/builds/27751 - https://lab.llvm.org/buildbot/#/builders/117/builds/9922 - https://lab.llvm.org/buildbot/#/builders/16/builds/37012 - https://lab.llvm.org/buildbot/#/builders/104/builds/9490 - https://lab.llvm.org/buildbot/#/builders/42/builds/7725 - https://lab.llvm.org/buildbot/#/builders/196/builds/20077 - https://lab.llvm.org/buildbot/#/builders/3/builds/15217 - https://lab.llvm.org/buildbot/#/builders/6/builds/15251 - https://lab.llvm.org/buildbot/#/builders/9/builds/15247 - https://lab.llvm.org/buildbot/#/builders/36/builds/26487 - https://lab.llvm.org/buildbot/#/builders/54/builds/2474 - https://lab.llvm.org/buildbot/#/builders/74/builds/14536 - https://lab.llvm.org/buildbot/#/builders/5/builds/28555	2022-10-25 16:34:08 -07:00
Momchil Velikov	1a525dec7f	[FuncSpec] Fix missed opportunities for function specialisation When collecting the possible constant arguments to specialise a function the compiler will abandon the search on the first argument that is for some reason unsuitable as a specialisation constant. Thus, depending on the traversal order of the functions and call sites, the compiler can end up with a different set of possible constants, hence with different set of specialisations. With this patch, the compiler will skip unsuitable constants, but nevertheless will continue searching for more. Reviewed By: ChuanqiXu Differential Revision: https://reviews.llvm.org/D135867	2022-10-25 23:19:48 +01:00
Philip Reames	269bc684e7	[LV][RISCV] Disable vectorization of epilogue loops Epilogue loop vectorization is a feature in the vectorize intended to avoid running fully scalar code when the vector length of the main loop turns out to be either longer than the trip count of the actual loop, or with a huge remainder. In practice, this feature appears to not have been well tuned. I honestly don't think it should be on by default at all, but it definitely shouldn't be on for RISCV. Note that other targets have also disabled it, but they've done so via disabling interleaving - which is, well, completely unrelated - and we don't want to do that for RISCV. In the near term, many examples I'm seeing have terrible codegen for epilogue vectorization. We are greatly increasing code size for little value at reasonable VLEN values for small types. In the long term, the cases that epilogue vectorization are intended to handle are likely better handled via tail folding on RISCV. As an aside, I also don't really trust the correctness of epilogue vectorization. The code structure is such that otherwise straight forward changes sometimes break only epilogue vectorization. The reuse of an existing vplan without careful validation opens significant room for nasty bugs. Given how rarely the code is exercised, that is not a good combination. As such, this patch introduces a TTI hook, and completely disables epilogue vectorization on RISCV. Differential Revision: https://reviews.llvm.org/D136695	2022-10-25 14:28:02 -07:00
Arthur Eubanks	ef37504879	[Instrumentation] Remove legacy passes Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D136615	2022-10-25 13:11:07 -07:00

1 2 3 4 5 ...

240562 Commits