llvm-project

Author	SHA1	Message	Date
Fangrui Song	4a6f59ac3c	[ELF] Replace lld::warn with Warn(ctx)	2024-11-16 13:38:43 -08:00
Fangrui Song	6c19fa4bfc	[ELF] Remove unneeded toString(Error) when using ELFSyncStream	2024-11-16 13:31:05 -08:00
Fangrui Song	38870fe124	[ELF] Remove unneeded toString(Error) when using ELFSyncStream	2024-11-16 13:22:06 -08:00
Fangrui Song	be5dad012e	[ELF] Replace internalLinkerError(getErrorLoc(ctx, buf) + ...) with InternalErr(ctx, buf) and simplify `+ toStr(ctx, x)` to `<< x`. The trailing '\n' << llvm::getBugReportMsg() is not very useful and therefore removed.	2024-11-16 13:07:17 -08:00
Kazu Hirata	7642759498	[Lex] Remove unused includes (NFC) (#116460 ) Identified with misc-include-cleaner.	2024-11-16 12:14:06 -08:00
Kazu Hirata	73e89cf66d	[Mips] Remove unused includes (NFC) (#116499 ) Identified with misc-include-cleaner.	2024-11-16 12:13:26 -08:00
Fangrui Song	a6755bdad1	[ELF] Replace global ctx with getCtx()	2024-11-16 12:11:00 -08:00
Lee Wei	64b9753d03	[llvm] Replace `UndefValue` placeholders with `PoisonValue` in unit tests [NFC] (#116453 ) This PR replaces all `UndefValue` act as placeholders with `PoisonValue` in `llvm/unittests`.	2024-11-16 20:01:27 +00:00
Fangrui Song	58a971f42f	[ELF] Replace contex-less toString(x) with toStr(ctx, x) so that we can remove the global `ctx` from toString implementations. Rename lld::toString (to lld:🧝:toStr) to simplify name lookup (we have many llvm::toString and another lld::toString(const llvm::opt::Arg &)).	2024-11-16 11:58:10 -08:00
David Green	549413fa40	[AArch64][GlobalISel] Protect against folding loads across basic blocks. isObviouslySafeToFold can look between a load and an instruction it can be folded into, to check that no other memory operations prevents the fold. It doesn't handle multiple basic blocks which we needs to guard against.	2024-11-16 19:52:44 +00:00
Mészáros Gergely	797330e96c	[ADT][NFCI]: Fix iterator category for graph iterators with external storage (#116403 ) Set the iterator category for graph iterators to input_iterator_tag when the visited set is stored externally. In that case we can't provide multi-pass guarantee, so we should not claim to be a forward iterator. Fixes: #116400	2024-11-16 20:46:15 +01:00
Fangrui Song	3fb83f65c4	[ELF] Replace toString(RelType) with operator<< while using ELFSyncStream	2024-11-16 11:45:46 -08:00
Kazu Hirata	612b779963	[memprof] Update a comment (NFC) (#116500 ) Note that Version0 has been removed in #116442.	2024-11-16 11:06:04 -08:00
David Green	935d753c6d	[AArch64][GlobalISel] Add test coverage fir ld1r combines. NFC	2024-11-16 18:36:14 +00:00
LLVM GN Syncbot	3f67544b4d	[gn build] Port dedc5159997e	2024-11-16 18:30:12 +00:00
Nikolas Klauser	dedc515999	[libc++] Avoid including <string> in <mutex> (#116254 )	2024-11-16 19:25:03 +01:00
Nikolas Klauser	adb80d8a4c	[libc++] Address post-commit comments for __scope_guard (#116291 ) Fixes #116204	2024-11-16 19:23:58 +01:00
Tobias Hieta	e2b4a700fd	[Utils] Fixed rebase in merge-release-pr script (#116340 ) Recently GitHub changed something on their side so we no longer can rebase release PR's with the API. This means that we now have to manually rebase the PR locally and then push the results. This fixes the script that I use to merge PRs to the release branch by changing the rebase part to do the local rebase and also adds a new option --rebase-only so that you can rebase the PRs easier. Minor change is that the script now can take a URL to the pull request as well as just the PR ID.	2024-11-16 19:14:56 +01:00
Fangrui Song	47e6673006	[ELF] Replace toString(RelType) with operator<< while using ELFSyncStream	2024-11-16 10:12:08 -08:00
Adrian Vogelsgesang	4f48a81a62	[lldb-dap] Support column breakpoints (#113787 ) This commit adds support for column breakpoints to lldb-dap. To do so, support for the `breakpointLocations` request was added. To find all available breakpoint positions, we iterate over the line table. The `setBreakpoints` request already forwarded the column correctly to `SBTarget::BreakpointCreateByLocation`. However, `SourceBreakpointMap` did not keep track of multiple breakpoints in the same line. To do so, the `SourceBreakpointMap` is now indexed by line+column instead of by line only. See http://jonasdevlieghere.com/post/lldb-column-breakpoints/ for a high-level introduction to column breakpoints.	2024-11-16 19:01:12 +01:00
LLVM GN Syncbot	15ca79998a	[gn build] Port 764834d63d5b	2024-11-16 17:25:47 +00:00
Nikolas Klauser	764834d63d	[libc++] Remove <istream> and <ostream> includes from <iomanip> (#116223 ) This reduces the include time of `<filesystem>` by ~50ms.	2024-11-16 18:20:55 +01:00
Nikolas Klauser	1636580b0a	[libc++] Avoid including <ostream> in <fstream> and <strstream> (#116014 ) This reduces the include time of `<fstream>` from ~800ms to ~500ms.	2024-11-16 18:19:17 +01:00
Maurice Heumann	71b3b32c6e	[Clang] [MS] Add /Zc:tlsGuards option to control tls guard emission (#113830 ) This adds an option to control whether guards for on-demand TLS initialization in combination with Microsoft's CXX ABI are emitted or not. The behaviour should match with Microsoft: https://learn.microsoft.com/en-us/cpp/build/reference/zc-tlsguards?view=msvc-170 This fixes #103484	2024-11-16 17:15:47 +01:00
GkvJwa	b88e938b1f	[NFC] Remove whitespaces in `llvm/CMakeLists.txt`	2024-11-16 23:51:57 +08:00
Kazu Hirata	46d750be2e	[Sema] Remove unused includes (NFC) (#116461 ) Identified with misc-include-cleaner.	2024-11-16 07:37:33 -08:00
Kazu Hirata	e8a6624325	[CodeGen] Remove unused includes (NFC) (#116459 ) Identified with misc-include-cleaner.	2024-11-16 07:37:13 -08:00
Florian Hahn	c95daac4c1	[LangRef] Spell out alias attribute/metadata violations are UB. (#116220 ) Update the documentation for the noalias attribute, !alias.scope and !loop.parallel_accesses metadata to clarify they are UB on voilation the noalias property. PR: https://github.com/llvm/llvm-project/pull/116220 --------- Co-authored-by: Nuno Lopes <nuno.lopes@tecnico.ulisboa.pt>	2024-11-16 13:38:58 +00:00
Simon Pilgrim	51809e4a26	[DAG] SimplifyDemandedVectorElts - add SimplifyMultipleUse handling to SEXT/ZEXT/TRUNC nodes (#116227 ) Allows us to bypass multiple uses of a SEXT/ZEXT/TRUNC node operand	2024-11-16 12:40:42 +00:00
agozillon	e508bacce4	[Flang][OpenMP] Derived type explicit allocatable member mapping (#113557 ) This PR is one of 3 in a PR stack, this is the primary change set which seeks to extend the current derived type explicit member mapping support to handle descriptor member mapping at arbitrary levels of nesting. The PR stack seems to do this reasonably (from testing so far) but as you can create quite complex mappings with derived types (in particular when adding allocatable derived types or arrays of allocatable derived types) I imagine there will be hiccups, which I am more than happy to address. There will also be further extensions to this work to handle the implicit auto-magical mapping of descriptor members in derived types and a few other changes planned for the future (with some ideas on optimizing things). The changes in this PR primarily occur in the OpenMP lowering and the OMPMapInfoFinalization pass. In the OpenMP lowering several utility functions were added or extended to support the generation of appropriate intermediate member mappings which are currently required when the parent (or multiple parents) of a mapped member are descriptor types. We need to map the entirety of these types or do a "deep copy" for lack of a better term, where we map both the base address and the descriptor as without the copying of both of these we lack the information in the case of the descriptor to access the member or attach the pointers data to the pointer and in the latter case we require the base address to map the chunk of data. Currently we do not segment descriptor based derived types as we do with regular non-descriptor derived types, we effectively map their entirety in all cases at the moment, I hope to address this at some point in the future as it adds a fair bit of a performance penalty to having nestings of allocatable derived types as an example. The process of mapping all intermediate descriptor members in a members path only occurs if a member has an allocatable or object parent in its symbol path or the member itself is a member or allocatable. This occurs in the createParentSymAndGenIntermediateMaps function, which will also generate the appropriate address for the allocatable member within the derived type to use as a the varPtr field of the map (for intermediate allocatable maps and final allocatable mappings). In this case it's necessary as we can't utilise the usual Fortran::lower functionality such as gatherDataOperandAddrAndBounds without causing issues later in the lowering due to extra allocas being spawned which seem to affect the pointer attachment (at least this is my current assumption, it results in memory access errors on the device due to incorrect map information generation). This is similar to why we do not use the MLIR value generated for this and utilise the original symbol provided when mapping descriptor types external to derived types. Hopefully this can be rectified in the future so this function can be simplified and more closely aligned to the other type mappings. We also make use of fir::CoordinateOp as opposed to the HLFIR version as the HLFIR version doesn't support the appropriate lowering to FIR necessary at the moment, we also cannot use a single CoordinateOp (similarly to a single GEP) as when we index through a descriptor operation (BoxType) we encounter issues later in the lowering, however in either case we need access to intermediate descriptors so individual CoordinateOp's aid this (although, being able to compress them into a smaller amount of CoordinateOp's may simplify the IR and perhaps result in a better end product, something to consider for the future). The other large change area was in the OMPMapInfoFinalization pass, where the pass had to be extended to support the expansion of box types (or multiple nestings of box types) within derived types, or box type derived types. This requires expanding each BoxType mapping from one into two maps and then modifying all of the existing member indices of the overarching parent mapping to account for the addition of these new members alongside adjusting the existing member indices to support the addition of these new maps which extend the original member indices (as a base address of a box type is currently considered a member of the box type at a position of 0 as when lowered to LLVM-IR it's a pointer contained at this position in the descriptor type, however, this means extending mapped children of this expanded descriptor type to additionally incorporate the new member index in the correct location in its own index list). I believe there is a reasonable amount of comments that should aid in understanding this better, alongside the test alterations for the pass. A subset of the changes were also aimed at making some of the utilities for packing and unpacking the DenseIntElementsAttr containing the member indices shareable across the lowering and OMPMapInfoFinalization, this required moving some functions to the Lower/Support/Utils.h header, and transforming the lowering structure containing the member index data into something more similar to the version used in OMPMapInfoFinalization. There we also some other attempts at tidying things up in relation to the member index data generation in the lowering, some of which required creating a logical operator for the OpenMP ID class so it can be utilised as a map key (it simply utilises the symbol address for the moment as ordering isn't particularly important). Otherwise I have added a set of new tests encompassing some of the mappings currently supported by this PR (unfortunately as you can have arbitrary nestings of all shapes and types it's not very feasible to cover them all).	2024-11-16 12:28:37 +01:00
agozillon	b5db75bfce	[OpenMP][MLIR] Descriptor explicit member map lowering changes (#113556 ) This is one of 3 PRs in a PR stack that aims to add support for explicit mapping of allocatable members in derived types. The primary changes in this PR are the OpenMPToLLVMIRTranslation.cpp changes, which are small and seek to alter the current member mapping to add an additional map insertion for pointers. Effectively, if the member is a pointer (currently indicated by having a varPtrPtr field) we add an additional map for the pointer and then alter the subsequent mapping of the member (the data) to utilise the member rather than the parents base pointer. This appears to be necessary in certain cases when mapping pointer data within record types to avoid segfaulting on device (due to incorrect data mapping). In general this record type mapping may be simplifiable in the future. There are also additions of tests which should help to showcase the affect of the changes above.	2024-11-16 12:26:29 +01:00
agozillon	3723449955	[OpenMP] Allocatable explicit member mapping fortran offloading tests (#113555 ) This PR is one in a series of 3 that aim to add support for explicit member mapping of allocatable components in derived types within OpenMP+Fortran for Flang. This PR provides all of the runtime tests that are currently upstreamable, unfortunately some of the other tests would require linking of the fortran runtime for offload which we currently do not do. But regardless, this is plenty to ensure that the mapping is working in most cases.	2024-11-16 12:22:33 +01:00
Louis Dionne	0fd6f684b9	[libc++] Adjust workflow file for building the libc++ docker image (#116366 )	2024-11-16 12:05:12 +01:00
David Green	100376a2fa	[AArch64] Add a test for phis of different types. NFC	2024-11-16 10:40:06 +00:00
Serge Pavlov	f97f96492d	[GlobalISel][ARM] Legalize reset_fpmode (#115859 ) Implement lowering intrinsic `reset_fpmode` in Global Selector for ARM target.	2024-11-16 17:21:33 +07:00
Sergei Barannikov	b69f646c46	[AArch64] Remove unused SDNodes (NFC) (#116236 ) The corresponding enum members were only used by `EmitMOPS`, which immediately translated them to machine opcodes. Just pass the machine opcodes instead.	2024-11-16 13:14:42 +03:00
Jay Foad	89cb0eefcb	[AMDGPU] Move GCNPreRAOptimizations after MachineScheduler (#116211 ) This is in preparation for adding a new optimization to the pass that cares about the order of instructions. The existing optimization does not care, so this just causes minor codegen differences.	2024-11-16 09:40:46 +00:00
Martin Storsjö	dc3156d8e6	[OpenMP] Don't hardcode _WIN32_WINNT for MinGW targets (#115708 ) Instead respect what the toolchain default is (or what the user sets via CMAKE_CXX_FLAGS). This fixes builds with libcxx, with mingw toolchains targeting msvcrt.dll, after 5d8be4c036aa5ce4a94f1f37a9155d5c877e23db; after that commit, the libcxx public headers reference symbols such as iswspace_l, which are unavailable when targeting msvcrt.dll on older versions of Windows (it's only available in msvcrt.dll since Windows Vista).	2024-11-16 11:23:15 +02:00
Kunwar Grover	db115ba3ef	[mlir][Linalg] Fix non-matmul linalg structured ops (#116412 ) `3ad0148020` broke linalg structured ops other than MatmulOp. The patch: - Changes the printer to hide additional attributes, which weren't hidden before: "indexing_maps". - Changes the build of every linalg structured op to have an indexing map for matmul. These changes combined, hide the problem until you print the operation in it's generic form. Reproducer: ```mlir func.func public @bug(%arg0 : tensor<5x10x20xf32>, %arg1 : tensor<5x20x40xf32>, %arg3 : tensor<5x10x40xf32>) -> tensor<5x10x40xf32> { %out = linalg.batch_matmul ins(%arg0, %arg1 : tensor<5x10x20xf32>, tensor<5x20x40xf32>) outs(%arg3 : tensor<5x10x40xf32>) -> tensor<5x10x40xf32> func.return %out : tensor<5x10x40xf32> } ``` Prints fine, with `mlir-opt <file>`, but if you do `mlir-opt --mlir-print-op-generic <file>`: ``` #map = affine_map<(d0, d1, d2) -> (d0, d2)> #map1 = affine_map<(d0, d1, d2) -> (d2, d1)> #map2 = affine_map<(d0, d1, d2) -> (d0, d1)> #map3 = affine_map<(d0, d1, d2, d3) -> (d0, d1, d3)> #map4 = affine_map<(d0, d1, d2, d3) -> (d0, d3, d2)> #map5 = affine_map<(d0, d1, d2, d3) -> (d0, d1, d2)> "builtin.module"() ({ "func.func"() <{function_type = (tensor<5x10x20xf32>, tensor<5x20x40xf32>, tensor<5x10x40xf32>) -> tensor<5x10x40xf32>, sym_name = "bug", sym_visibility = "public"}> ({ ^bb0(%arg0: tensor<5x10x20xf32>, %arg1: tensor<5x20x40xf32>, %arg2: tensor<5x10x40xf32>): %0 = "linalg.batch_matmul"(%arg0, %arg1, %arg2) <{operandSegmentSizes = array<i32: 2, 1>}> ({ ^bb0(%arg3: f32, %arg4: f32, %arg5: f32): %1 = "arith.mulf"(%arg3, %arg4) <{fastmath = #arith.fastmath<none>}> : (f32, f32) -> f32 %2 = "arith.addf"(%arg5, %1) <{fastmath = #arith.fastmath<none>}> : (f32, f32) -> f32 "linalg.yield"(%2) : (f32) -> () }) {indexing_maps = [#map, #map1, #map2], linalg.memoized_indexing_maps = [#map3, #map4, #map5]} : (tensor<5x10x20xf32>, tensor<5x20x40xf32>, tensor<5x10x40xf32>) -> tensor<5x10x40xf32> "func.return"(%0) : (tensor<5x10x40xf32>) -> () }) : () -> () }) : () -> () ``` The batch_matmul operation's builder now always inserts a indexing_map which is unrelated to the operation itself. This was caught when a transformation from one LinalgStructuredOp to another, tried to pass it's attributes to the other ops builder and there were multiple indexing_map attributes in the result. This patch fixes this by specializing the builders for MatmulOp with indexing map information.	2024-11-16 08:13:10 +00:00
Thorsten Schütt	2906fcadb8	[GlobalISel] Combine G_MERGE_VALUES of x and zero (#116283 ) into zext x LegalizerHelper has two padding strategies: undef or zero. see LegalizerHelper:273 see LegalizerHelper:315 This PR is about zero sugar and Coke Zero. ; CHECK-NEXT: [[MV2:%[0-9]+]]:_(s64) = G_MERGE_VALUES %a(s32), [[C]](s32) Please continue padding merge values. // %bits_8_15:(s8) = G_CONSTANT i8 0 // %0:(s16) = G_MERGE_VALUES %bits_0_7:(s8), %bits_8_15:(s8) %bits_8_15 is defined by zero. For optimization, we pick zext. // %0:_(s16) = G_ZEXT %bits_0_7:(s8) The upper bits of %0 are zero and the lower bits come from %bits_0_7.	2024-11-16 08:00:21 +01:00
Julian Schmidt	ec0a27f658	Revert "Reland: [clang][test] add testing for the AST matcher reference" (#116477 ) Reverts llvm/llvm-project#112168	2024-11-16 07:34:20 +01:00
Valentin Clement	42be165dde	Reland '[flang][cuda] Specialize entry point for scalar to desc data transfer'	2024-11-15 19:13:55 -08:00
Matthias Springer	309c890921	[llvm] `APFloat`: Add helpers to query NaN/inf semantics (#116315 ) `APFloat` changes extracted from #116176 as per reviewer comments.	2024-11-16 11:48:05 +09:00
Valentin Clement (バレンタインクレメン)	70b9440c88	Revert "[flang][cuda] Specialize entry point for scalar to desc data transfer" (#116458 ) Reverts llvm/llvm-project#116457	2024-11-15 17:44:48 -08:00
Valentin Clement (バレンタインクレメン)	43cb424a54	[flang][cuda] Specialize entry point for scalar to desc data transfer (#116457 ) The runtime Assign function is not meant to initialize an array from a scalar. For that we need to use DoAssignFromSource. Update the data transfer from scalar to descriptor to use a new entry point that use this function underneath.	2024-11-15 17:41:23 -08:00
Kyungwoo Lee	ab27253ad3	[CGData][lld-macho] Merge CG Data by LLD (#112674 ) LLD now processes raw CG data for stable functions, similar to how it handles raw CG data for the outliner's hash tree. This data is encoded in the custom section (`__llvm_merge`) within object files. LLD merges this information into the indexed CG data file specified by the `-codegen-data-generate-path={path}` option. For the linker that does not support this feature, we could use `llvm-cgdata` tool -- https://github.com/llvm/llvm-project/blob/main/llvm/docs/CommandGuide/llvm-cgdata.rst. Depends on #115750. This is a patch for https://discourse.llvm.org/t/rfc-global-function-merging/82608.	2024-11-15 17:24:35 -08:00
Craig Topper	6a0905d11e	[RISCV][GISel] Add isel patterns for i16 load/store (#116293 ) In order to support f16 load/store we need to make load/stores with s16 register type legal. If regbank selection doesn't pick the FPR bank, we'll be left with a GPR load or store which we don't have isel patterns for from SelectionDAG. In order to add the patterns we need to make i16 a legal type for the GPR register class. Tests are currently disabling the legality check because I haven't update the legalizer yet.	2024-11-15 17:23:46 -08:00
Craig Topper	131d73ed34	[RegAlloc] Remove redundant prints of LiveInterval weight. (#116451 ) LiveInterval::print has included the weight since early 2018. We don't need to print again after we print the interval.	2024-11-15 16:43:30 -08:00
vporpo	1be9827754	[SandboxVec][BottomUpVec] Implement packing of vectors (#116447 ) Up until now we could only support packing of scalar elements. This patch fixes this by implementing packing of vector elements, by generating extractelement and insertelement instruction pairs.	2024-11-15 16:12:22 -08:00
Kazu Hirata	0d38f64e7d	[memprof] Remove MemProf format Version 0 (#116442 ) This patch removes MemProf format Version 0 now that version 2 and 3 seem to be working well. I'm not touching version 1 for now because some tests still rely on version 1. Note that Version 0 is identical to Version 1 except that the MemProf section of the indexed format has a MemProf version field.	2024-11-15 15:37:00 -08:00

1 2 3 4 5 ...

518451 Commits