llvm-project

Author	SHA1	Message	Date
Louis Dionne	9783f28cbb	[libc++] Format the code base (#74334 ) This patch runs clang-format on all of libcxx/include and libcxx/src, in accordance with the RFC discussed at [1]. Follow-up patches will format the benchmarks, the test suite and remaining parts of the code. I'm splitting this one into its own patch so the diff is a bit easier to review. This patch was generated with: find libcxx/include libcxx/src -type f \ \| grep -v 'module.modulemap.in' \ \| grep -v 'CMakeLists.txt' \ \| grep -v 'README.txt' \ \| grep -v 'libcxx.imp' \ \| grep -v '__config_site.in' \ \| xargs clang-format -i A Git merge driver is available in libcxx/utils/clang-format-merge-driver.sh to help resolve merge and rebase issues across these formatting changes. [1]: https://discourse.llvm.org/t/rfc-clang-formatting-all-of-libc-once-and-for-all	2023-12-18 14:01:33 -05:00
Stanislav Mekhanoshin	e5c523e861	[AMDGPU] Produce better memoperand for LDS DMA (#75247 ) 1) It was marked as volatile. This is not needed and the only reason it was done is because it is both load and store and handled together with atomics. Global load to LDS was marked as volatile just because buffer load was done that way. 2) Preserve at least LDS (store) pointer which we always have with the intrinsics. 3) Use PoisonValue instead of nullptr for load memop as a Value.	2023-12-18 11:01:12 -08:00
Stanislav Mekhanoshin	94230ce548	[AMDGPU] Fix lack of LDS DMA check in the AA handling (#75249 ) SIInstrInfo::areMemAccessesTriviallyDisjoint does a DS offset checks, but does not account for LDS DMA instructions. Added these checks. Without it code falls through and returns true which is wrong. As a result mayAlias would always return false for LDS DMA and a regular LDS instruction or 2 LDS DMA instructions. At the moment this is NFCI because we do not use this AA in a context which may touch LDS DMA instructions. This is also unreacheable now because of the ordered memory ref checks just above in the function and LDS DMA is marked as volatile. This volatile marking is removed in PR #75247, therefore I'd submit this check before #75247.	2023-12-18 10:58:50 -08:00
Justin Bogner	4f54d71501	[HLSL][DirectX] Move handling of resource element types into the frontend Rather than shepherding a type name all the way to the backend as a string and attempting to parse it, get the element type out of the AST and store that in the resource annotation metadata directly. Pull Request: https://github.com/llvm/llvm-project/pull/75674	2023-12-18 11:43:52 -07:00
Ziqing Luo	4123b95949	[-Wunsafe-buffer-usage] Add a subgroup `-Wunsafe-buffer-usage-in-container` (#75665 ) Add a sub diagnostic group under `-Wunsafe-buffer-usage` controlled by `-Wunsafe-buffer-usage-in-container`. The subgroup will include warnings on misuses of `std::span`, `std::vector`, and `std::array`.	2023-12-18 10:39:15 -08:00
Arthur Eubanks	363f708fcb	[gn build] Manually port 945c645a and a0a3c793	2023-12-18 10:24:13 -08:00
lntue	791c5d0eb8	[libc] Improve get_object_files_for_test to reduce CMake configure time for tests. (#75552 ) Profiling cmake shows that a significant time configuring `libc` folder is spent on running `get_object_files_for_test` in the `test` folder (13 sec in `libc/test` folder / 16 sec in `libc` folder). By caching all needed objects for each target instead of resolving every time, the time cmake spends on configuring `libc/test` folder is reduced to ~1s.	2023-12-18 13:07:05 -05:00
srcarroll	b26ee97537	[MLIR][Linalg] Support dynamic sizes in `lower_unpack` (#75494 )	2023-12-18 19:02:04 +01:00
Fangrui Song	644e6d7d8b	[llvm-objdump] --disassemble-symbols: skip inline relocs from symbols that are not dumped (#75724 ) When a section contains two functions x1 and x2, we incorrectly display x1's relocations when dumping x2 for `--disassemble-symbols=x2 -r`. Fix #75539 by ignoring these relocations.	2023-12-18 09:53:44 -08:00
Fangrui Song	96aca7c517	[LTO] Improve diagnostics handling when parsing module-level inline assembly (#75726 ) Non-LTO compiles set the buffer name to "<inline asm>" (`AsmPrinter::addInlineAsmDiagBuffer`) and pass diagnostics to `ClangDiagnosticHandler` (through the `MCContext` handler in `MachineModuleInfoWrapperPass::doInitialization`) to ensure that the exit code is 1 in the presence of errors. In contrast, LTO compiles spuriously succeed even if error messages are printed. ``` % cat a.c void _start() {} asm("unknown instruction"); % clang -c a.c <inline asm>:1:1: error: invalid instruction mnemonic 'unknown' 1 \| unknown instruction \| ^ 1 error generated. % clang -c -flto a.c; echo $? # -flto=thin is the same error: invalid instruction mnemonic 'unknown' unknown instruction ^~~~~~~ error: invalid instruction mnemonic 'unknown' unknown instruction ^~~~~~~ 0 ``` `CollectAsmSymbols` parses inline assembly and is transitively called by both `ModuleSummaryIndexAnalysis::run` and `WriteBitcodeToFile`, leading to duplicate diagnostics. This patch updates `CollectAsmSymbols` to be similar to non-LTO compiles. ``` % clang -c -flto=thin a.c; echo $? <inline asm>:1:1: error: invalid instruction mnemonic 'unknown' 1 \| unknown instruction \| ^ 1 errors generated. 1 ``` The `HasErrors` check does not prevent duplicate warnings but assembler warnings are very uncommon.	2023-12-18 09:46:58 -08:00
Rik Huijzer	672f1a036a	[mlir][memref] Make `LoadOp::verify` error more clear (#75831 ) While debugging https://github.com/llvm/llvm-project/issues/71326, the `LoadOp::verify` code and error were very confusing. This PR improves that. This code was a part from the reverted PR https://github.com/llvm/llvm-project/pull/75519. Fixing the `-convert-vector-to-scf` issue is going to take a bit longer and this code was out of scope anyway. Co-authored-by: Benjamin Maxwell <macdue@dueutil.tech>	2023-12-18 18:41:05 +01:00
Mingming Liu	3aa5d71127	Revert "[PGO][GlobalValue][LTO]In GlobalValues::getGlobalIdentifier, use semicolon as delimiter for local-linkage varibles." (#75835 ) Reverts llvm/llvm-project#74008 The compiler-rt test failed due to `llvm-dis` not found (https://lab.llvm.org/buildbot/#/builders/127/builds/59884) Will revert and investigate how to require the proper dependency.	2023-12-18 09:39:55 -08:00
Shilei Tian	3768039913	[OpenMP] Directly use user's grid and block size in kernel language mode (#70612 ) In kernel language mode, use user's grid and blocks size directly. No validity check, which means if user's values are too large, the launch will fail, similar to what CUDA and HIP are doing right now.	2023-12-18 12:26:18 -05:00
Joseph Huber	8e2cc19e2e	[LinkerWrapper] Forward more arguments to the CPU offloading linker (#75757 ) Summary: The CPU target currently inherits all the libraries from the normal link job to ensure that it has access to the same envrionment that the host does. However, this previously was not respecting argument libraries that are passed by name rather than `-l` as well as the whole archive flags. This patch fixes this to allow the CPU linker to correctly pick up the libraries associated with things like address sanitizers. Fixes: https://github.com/llvm/llvm-project/issues/75651	2023-12-18 11:16:49 -06:00
Mingming Liu	245cddae70	[PGO][GlobalValue][LTO]In GlobalValues::getGlobalIdentifier, use semicolon as delimiter for local-linkage varibles. (#74008 ) Commit fe05193 (phab D156569), IRPGO names uses format `[<filepath>;]<linkage-name>` while prior format is `[<filepath>:<mangled-name>`. The format change would break the use case demonstrated in (updated) `llvm/test/Transforms/PGOProfile/thinlto_indirect_call_promotion.ll` and `compiler-rt/test/profile/instrprof-thinlto-indirect-call-promotion.cpp` This patch changes `GlobalValues::getGlobalIdentifer` to use the semicolon. To elaborate on the scenario how things break without this PR 1. IRPGO raw profiles stores (compressed) IRPGO names of functions in one section, and per-function profile data in another section. The [NameRef](`fc715e4cd9/compiler-rt/include/profile/InstrProfData.inc (L72)`) field in per-function profile data is the MD5 hash of IRPGO names. 2. When raw profiles are converted to indexed format profiles, the profiled address is [mapped](`fc715e4cd9/llvm/lib/ProfileData/InstrProf.cpp (L876-L885)`) to the MD5 hash of the callee. 3. In `pgo-instr-use` thin-lto prelink pipeline, MD5 hash of IRPGO names will be [annotated](`fc715e4cd9/llvm/lib/Transforms/Instrumentation/PGOInstrumentation.cpp (L1707)`) as value profiles, and used to import indirect-call-prom candidates. If the annotated MD5 hash is computed from the new format while import uses the prior format, the callee cannot be imported. `compiler-rt/test/profile/instrprof-thinlto-indirect-call-promotion.cpp` is added to have an end-to-end test. `llvm/test/Transforms/PGOProfile/thinlto_indirect_call_promotion.ll` is updated to have better test coverage from another aspect (as runtime tests are more sensitive to the environment and may be skipped by some contributors)	2023-12-18 09:10:39 -08:00
Nikita Popov	e400c59beb	Revert "[InstCombine] Favour `m_Poison` in `SimplifyDemandedVectorElts`" This reverts commit 318d5bff0b65aa7d52fc7004d49587416f0fb564. Has incomplete test updates.	2023-12-18 18:08:57 +01:00
Joseph Huber	913622d012	[Libomptarget] Remove remaining global constructors in plugins (#75814 ) Summary: This patch fixes the remaining global constructor in the plguins after addressing the ones in the JIT interface. This struct was mistakenly using global constructors as not all the members were being initialized properly. This was almost certainly being optimized out because it's trivial, but would still be present in debug builds and prevented us from compiling with `-Werror=global-constructors`. We will want to do that once offloading is moved to a runtimes only build.	2023-12-18 11:01:02 -06:00
Sam Tebbs	fade67565e	[AArch64][SME2] Enable bfm builtins for sme2 (#71927 ) This patch enables the following builtins for SME2 svbfmlslb_f32 svbfmlslb_lane_f32 svbfmlslt_f32 svbfmlslt_lane_f32 Patch by: Kerry McLaughlin <kerry.mclaughlin@arm.com> --------- Co-authored-by: Matthew Devereau <matthew.devereau@arm.com>	2023-12-18 16:51:10 +00:00
Momchil Velikov	35a77fc13c	[Clang][SVE2.1] Update names of the `svwhileXX` builtins with predicate-as-counter (#75200 ) The `_s64`/`_u64` part can be omitted now and the name variants do not include unsigned comparison mnemonics. Both are inferred from the argument types.	2023-12-18 16:49:59 +00:00
Mark de Wever	3b0705827d	[libc++][modules] Adds CMake 3.28 support. (#75700 ) This is a preparation to start using CMake 3.28 in the CI.	2023-12-18 17:49:11 +01:00
Kerry McLaughlin	f888e739d3	[Clang][SME2] Add multi-vector zip & unzip builtins (#74841 ) Adds the following SME2 builtins: - svzip (x2 & x4) - svzipq (x2 & x4) - svuzp (x2 & x4) - svuzpq (x2 & x4) See https://github.com/ARM-software/acle/pull/217/files Patch by David Sherwood <david.sherwood@arm.com>	2023-12-18 16:38:36 +00:00
Antonio Frighetto	318d5bff0b	[InstCombine] Favour `m_Poison` in `SimplifyDemandedVectorElts` A miscompilation issue has been addressed with refined checking.	2023-12-18 17:28:39 +01:00
Simon Pilgrim	7b1e4239b3	[DAG] Fold (vt trunc (extload (vt x))) -> (vt load x) (#75229 ) We were only folding cases which remained extloads, but DAG.getExtLoad can also handle the cases which don't need to extend at all (we just can't do truncloads). reduceLoadWidth can handle this for scalar loads, but not for vectors. Noticed while triaging D152928	2023-12-18 16:21:11 +00:00
Sam Tebbs	a0a3c793d2	[Clang][SME] Warn when a function doesn't have ZA state (#75805 ) This patch adds a warning that's emitted when a builtin call uses ZA state but the calling function doesn't provide any. Patch by David Sherwood <david.sherwood@arm.com>.	2023-12-18 16:14:25 +00:00
Oleksandr "Alex" Zinenko	4d9d105c70	[mlir] fix filecheck prefixes in a dataflow test (#75794 ) -SAME and -LITERAL do not compose in CHECK commands.	2023-12-18 17:11:21 +01:00
Nathan Sidwell	d0285a31c8	aarch64: fix testcase (#75723 ) Add missing < %s to RUN line.	2023-12-18 11:02:44 -05:00
Nikita Popov	cd54c47424	[InstCombine] Match poison instead of undef in foldVectorBinop() Some negative tests turn into positive tests, as the differences between undef and poison propagation allow additional transforms.	2023-12-18 17:01:59 +01:00
Momchil Velikov	fd527def7e	[Clang][SVE2.1] Add floating-point variants of `svrevd_XX` (#75117 )	2023-12-18 15:52:28 +00:00
Nikita Popov	ddd11537e2	[InstCombine] Match poison instead of undef in binop of same-mask shuffle fold	2023-12-18 16:41:38 +01:00
Abhina Sree	8a233d8cfd	[SystemZ][z/OS] Add guard for dl_info and dladdr (#75637 ) This patch fixes the following build error on z/OS `error: unknown type name 'Dl_info'` by adding a guard to check if we have dladdr.	2023-12-18 10:33:23 -05:00
Nikita Popov	9d25b28b9e	[InstCombine] Explicitly canonicalize splat shuffles to use poison RHS This is usually handled by demanded elements simplification. However, as that is not supported for scalable vectors, also handle it explicitly here.	2023-12-18 16:30:40 +01:00
Shilei Tian	0f5eef1fb3	[OpenMP][Clang] Force use of `num_teams` and `thread_limit` for bare kernel (#68373 ) This patch makes `num_teams` and `thread_limit` mandatory for bare kernels, similar to a reguar kernel language that when launching a kernel, the grid size has to be set explicitly.	2023-12-18 10:29:26 -05:00
Joseph Huber	1580877555	[Libomptarget] Remove bitcode image map used for JIT processing (#75672 ) Summary: Libomptarget supports JIT by treating an LLVM-IR file as a regular input image. The handling here used a global map to keep track of triples once it was parsed. This was done to same time, however this created a global constructor as well as an extra mutex to handle it. This patch removes the use of this map. Instead, we simply use the file magic to perform a quick check if the input image is valid bitcode. If not, we then create a lazy module. This should roughly equivalent to the old handling that create an IR symbol table. Here we can prevent the module from materializing everything but the single triple metadata we read in later.	2023-12-18 09:28:06 -06:00
Mark de Wever	27767959a5	[libc++][CI] Enable modules in no TZDB build. (#75704 ) This is one of the few build where we didn't test modules.	2023-12-18 10:27:13 -05:00
Dominik Wójt	e60167927b	[libc++] Add a picolibc test configuration with exceptions enabled (#75462 )	2023-12-18 10:25:50 -05:00
Joseph Huber	87b45f1ca6	[llvm-readobj] Print AMDGPU specific values in GNU mode (#75661 ) Summary: Currently, we don't emit any of the AMDGPU specific flags and options when printing in GNU-mode (the default). This patch adds extra code to handle outputting these values to match the output from GNU's `readelf` when used on an AMDGPU image.	2023-12-18 09:22:22 -06:00
Nikita Popov	a5f3415533	[InstCombine] Replace non-demanded undef vector with poison If an operand (esp to shufflevector or insertelement) is not demanded, canonicalize it from undef to poison.	2023-12-18 16:12:37 +01:00
Podchishchaeva, Mariya	5cda366221	Revert "[clang] Fix false positive -Wmissing-field-initializer for anonymous unions (#70829 )" This reverts commit a01307a6ee788fc6ac2e09e58f0f52e5666def86 and its follow-up fix 32d5221ec4810dd723ccebaabbda1df5d3b4cfcf. It caused unexpected warnings emitted for nested designators in C.	2023-12-18 07:08:27 -08:00
Gheorghe-Teodor Bercea	cd1038a46a	[OpenMP][libomptarget][Fix]Require presence of libomptarget-debug for newly added test (#75807 ) Require presence of libomptarget-debug fixes https://github.com/llvm/llvm-project/pull/75642	2023-12-18 10:07:52 -05:00
Louis Dionne	a35629cd8d	[libc++] Remove assumptions that std::array::iterator is a raw pointer (#74624 ) This patch removes assumptions that std::array's iterators are raw pointers in the source code and in our test suite. While this is true right now, this doesn't have to be true and ion the future we might want to enable bounded iterators in std::array, which would require this change. This is a pre-requisite for landing #74482	2023-12-18 10:00:47 -05:00
Nikita Popov	d0605e21af	[InstCombine] Canonicalize splat shuffles to use poison operand If the splat shuffle is represented using an undef RHS, replace it with poison.	2023-12-18 15:57:49 +01:00
Gheorghe-Teodor Bercea	4ef6587715	[Clang][OpenMP] Fix mapping of structs to device (#75642 ) Fix mapping of structs to device. The following example fails: ``` #include <stdio.h> #include <stdlib.h> struct Descriptor { int datum; long int x; int xi; long int arr[1][30]; }; int main() { Descriptor dat = Descriptor(); dat.datum = (int )malloc(sizeof(int)*10); dat.xi = 3; dat.arr[0][0] = 1; #pragma omp target enter data map(to: dat.datum[:10]) map(to: dat) #pragma omp target { dat.xi = 4; dat.datum[dat.arr[0][0]] = dat.xi; } #pragma omp target exit data map(from: dat) return 0; } ``` This is a rework of the previous attempt: https://github.com/llvm/llvm-project/pull/72410	2023-12-18 09:47:59 -05:00
Alexey Bataev	58a2c4e2f2	[SLP][NFC]Check for equal opcode preliminary to meet weak strict order requirement, NFC. This change does not affect functionality, just fixes the assertions in some standard c++ library implementations.	2023-12-18 06:42:03 -08:00
Nikita Popov	6d91905f97	[ValueTracking] Short-circuit on unknown bits in isKnownNonEqual() (NFC) Don't bother computing known bits for the second operand if we know nothing about the first.	2023-12-18 15:36:38 +01:00
Nikita Popov	e93d324adb	[InstCombine] Preserve poison in evaluateInDifferentElementOrder() Don't unnecessarily replace poison with undef.	2023-12-18 15:36:22 +01:00
Matt Arsenault	629412938b	Triple: Fix handling of macos with unexpected target arches (#75469 ) Some tools with a specified target arch, but no full triple default to the host triple. On macos hosts, this would then force using macho on targets that didn't expect it, resulting in assertions. We should also probably emit explicit errors if the object format is specified on targets which don't handle it.	2023-12-18 21:28:33 +07:00
Nikita Popov	6c9813aa02	[InstCombine] Check for poison instead of undef in shuffle combine Otherwise we may replace undef with poison. Note that a lot of tests regressing here already have variants that use poison instead of undef (often in a separate inseltpoison file), which is why I'm not adjusting them to the new pattern.	2023-12-18 15:19:16 +01:00
Jay Foad	7e5019e82b	[AMDGPU] Simplify WaitcntBrackets::getRegInterval with getPhysRegBaseClass (#74087 ) This means that getRegInterval no longer depends on the MCInstrDesc, so it could be simplified further to take just a MachineOperand or just a physical register. NFCI.	2023-12-18 14:16:02 +00:00
Nikita Popov	fda3a134bc	[InstCombine] Regenerate test checks (NFC)	2023-12-18 15:00:59 +01:00
Oleksandr "Alex" Zinenko	32a4e3fcca	[mlir] support non-interprocedural dataflow analyses (#75583 ) The core implementation of the dataflow anlysis framework is interpocedural by design. While this offers better analysis precision, it also comes with additional cost as it takes longer for the analysis to reach the fixpoint state. Add a configuration mechanism to the dataflow solver to control whether it operates inteprocedurally or not to offer clients a choice. As a positive side effect, this change also adds hooks for explicitly processing external/opaque function calls in the dataflow analyses, e.g., based off of attributes present in the the function declaration or call operation such as alias scopes and modref available in the LLVM dialect. This change should not affect existing analyses and the default solver configuration remains interprocedural. Co-authored-by: Jacob Peng <jacobmpeng@gmail.com>	2023-12-18 14:16:52 +01:00

1 2 3 4 5 ...

484225 Commits