llvm-project

Author	SHA1	Message	Date
Slava Zakharin	d9527be914	[NFC][flang] Added engineering option for triaging local-alloc-tbaa. (#149587 ) I triaged a benchmark that showed inaccurate results, when local-alloc-tbaa was enabled. It turned out to be not a real TBAA issue, but rather TBAA affecting optimizations that affect FMA generation, which introduced an expected accuracy variation. I would like to keep this threshold control for future uses.	2025-07-21 10:57:35 -07:00
Krzysztof Parzyszek	2aa1e54fa1	[flang][OpenMP] Parse OpenMP 6.0 map modifiers (#149134 ) OpenMP 6.0 has changed the modifiers on the MAP clause: - map-type-modifier has been split into individual modifiers, - map-type "delete" has become a modifier, - new modifiers have been added. This patch adds parsing support for all of the OpenMP 6.0 modifiers. The old "map-type-modifier" is retained, but is no longer created in parsing. It will remain to take advantage of the preexisting modifier validation for older versions: when the OpenMP version is < 6.0, the modifiers will be rewritten back as map-type-modifiers (or map- type in case of "delete"). In this patch the modifiers will always be rewritten in the older format to isolate these changes to parsing as much as possible.	2025-07-21 10:55:37 -05:00
Michael Kruse	ed2bfd1325	[Flang] Add -fintrinsic-modules-path= alias (#149417 ) Add the syntax `-fintrinsic-modules-path=<dir>` as an alias to the existing option `-fintrinsic-modules-path <dir>`. gfortran also supports both alternatives. This is particularly useful with CMake which de-duplicates command-line options. For instance, `-fintrinsic-modules-path /path/A -fintrinsic-modules-path /path/B` is de-duplicated to `-fintrinsic-modules-path /path/A /path/B` since it conisiders the second `-fintrinsic-modules-path` "redundant". This can be avoided using the syntax `-fintrinsic-modules-path=/path/A -fintrinsic-modules-path=/path/B`.	2025-07-21 17:18:33 +02:00
Koakuma	38fc453afd	[SPARC][Driver] Move feature mode selection to Arch/Sparc.cpp (#149652 ) This is so that it's performed also for flang and not just for clang. This should fix https://github.com/llvm/llvm-project/issues/138494.	2025-07-20 21:33:06 -04:00
Kelvin Li	2e67dcfdcd	[flang] update ppc lit tests after using vector.insert and vector.extract (NFC) (#148775 ) See https://github.com/llvm/llvm-project/pull/143272	2025-07-18 14:43:15 -07:00
Andre Kuhlenschmidt	abdd4536ce	[flang][openacc] fix bugs with default(none) checking (#149220 ) A report of the following code not generating an error led to fixing two bugs in directive checking. - We should treat CombinedConstructs as OpenACC Constructs - We should treat DoConstruct index variables as private. ```fortran subroutine sub(nn) integer :: nn, ii !$acc serial loop default(none) do ii = 1, nn end do !$acc end serial loop end subroutine ``` Here `nn` should be flagged as needing a data clause while `ii` should still get one implicitly.	2025-07-18 13:50:09 -07:00
Peter Klausler	9e5b2fbe86	[flang][runtime] Preserve type when remapping monomorphic pointers (#149427 ) Pointer remappings unconditionally update the element byte size and derived type of the pointer's descriptor. This is okay when the pointer is polymorphic, but not when a pointer is associated with an extended type. To communicate this monomorphic case to the runtime, add a new entry point so as to not break forward binary compatibility.	2025-07-18 13:45:05 -07:00
Eugene Epshteyn	2c2567da95	[flang] Fixed a crash with undeclared variable in implicit-do loop (#149513 ) Fixed a crash in the following example: ``` subroutine sub() implicit none print *, (i, i = 1, 2) ! Problem: using undefined var in implied-do loop end subroutine sub ``` The error message was already generated, but the compiler crashed before it could display it.	2025-07-18 12:58:09 -04:00
Kelvin Li	ff5f3ae02a	[flang] convert program name to upper case (NFC) (#149508 )	2025-07-18 11:20:00 -04:00
Valentin Clement (バレンタインクレメン)	6c63316ee1	[flang][cuda] Support device component in a pointer or allocatable derived-type (#149418 )	2025-07-18 04:17:15 -07:00
Kelvin Li	df56b1a2cf	[flang] handle allocation of zero-sized objects (#149165 ) This PR handles the allocation of zero-sized objects for different implementations. One byte is allocated for the zero-sized objects.	2025-07-17 23:52:48 -04:00
Connector Switch	cd6311b50d	[flang] Implement `COSPI` (#149343 ) This feature is added in the Fortran 2023 standard.	2025-07-18 08:09:38 +08:00
Krzysztof Parzyszek	73d4cea68c	[flang][OpenMP] Generalize isOpenMPPrivatizingConstruct (#148654 ) Instead of treating all block and all loop constructs as privatizing, actually check if the construct allows a privatizing clause.	2025-07-17 13:41:04 -05:00
Eugene Epshteyn	413e71b700	[flang] Main program symbol no longer conflicts with the other symbols (#149169 ) The following code is now accepted: ``` module m end program m use m end ``` The PROGRAM name doesn't really have an effect on the compilation result, so it shouldn't result in symbol name conflicts. This change makes the main program symbol name all uppercase in the cooked character stream. This makes it distinct from all other symbol names that are all lowercase in cooked character stream. Modified the tests that were checking for lower case main program name.	2025-07-17 14:18:21 -04:00
delaram-talaashrafi	0dae924c1f	[openacc][flang] Support two type bindName representation in acc routine (#149147 ) Based on the OpenACC specification — which states that if the bind name is given as an identifier it should be resolved according to the compiled language, and if given as a string it should be used unmodified — we introduce two distinct `bindName` representations for `acc routine` to handle each case appropriately: one as an array of `SymbolRefAttr` for identifiers and another as an array of `StringAttr` for strings. To ensure correct correspondence between bind names and devices, this patch also introduces two separate sets of device attributes. The routine operation is extended accordingly, along with the necessary updates to the OpenACC dialect and its lowering.	2025-07-17 09:38:02 -07:00
nvptm	a7f595efd8	[flang][acc] Create UseDeviceOp for both results of hlfir.declare (#148017 ) A sample such as ``` program test integer :: N = 100 real*8 :: b(-1:N) !$acc data copy(b) !$acc host_data use_device(b) call vadd(b) !$acc end host_data !$acc end data end ``` is lowered to ``` %13:2 = hlfir.declare %11(%12) {uniq_name = "_QFEb"} : (!fir.ref<!fir.array<?xf64>>, !fir.shapeshift<1>) -> (!fir.box<!fir.array<?xf64>>, !fir.ref<!fir.array<?xf64>>) %14 = acc.copyin var(%13#0 : !fir.box<!fir.array<?xf64>>) -> !fir.box<!fir.array<?xf64>> {dataClause = #acc<data_clause acc_copy>, name = "b"} acc.data dataOperands(%14 : !fir.box<!fir.array<?xf64>>) { %15 = acc.use_device var(%13#0 : !fir.box<!fir.array<?xf64>>) -> !fir.box<!fir.array<?xf64>> {name = "b"} acc.host_data dataOperands(%15 : !fir.box<!fir.array<?xf64>>) { fir.call @_QPvadd(%13#1) fastmath<contract> : (!fir.ref<!fir.array<?xf64>>) -> () acc.terminator } acc.terminator } acc.copyout accVar(%14 : !fir.box<!fir.array<?xf64>>) to var(%13#0 : !fir.box<!fir.array<?xf64>>) {dataClause = #acc<data_clause acc_copy>, name = "b"} ``` Note that while the use_device clause is applied to %13#0, the argument passed to vadd is %13#1. To avoid problems later in lowering, this change additionally applies the use_device clause to %13#1, so that the resulting MLIR is ``` %13:2 = hlfir.declare %11(%12) {uniq_name = "_QFEb"} : (!fir.ref<!fir.array<?xf64>>, !fir.shapeshift<1>) -> (!fir.box<!fir.array<?xf64>>, !fir.ref<!fir.array<?xf64>>) %14 = acc.copyin var(%13#0 : !fir.box<!fir.array<?xf64>>) -> !fir.box<!fir.array<?xf64>> {dataClause = #acc<data_clause acc_copy>, name = "b"} acc.data dataOperands(%14 : !fir.box<!fir.array<?xf64>>) { %15 = acc.use_device var(%13#0 : !fir.box<!fir.array<?xf64>>) -> !fir.box<!fir.array<?xf64>> {name = "b"} %16 = acc.use_device varPtr(%13#1 : !fir.ref<!fir.array<?xf64>>) -> !fir.ref<!fir.array<?xf64>> {name = "b"} acc.host_data dataOperands(%15, %16 : !fir.box<!fir.array<?xf64>>, !fir.ref<!fir.array<?xf64>>) { fir.call @_QPvadd(%13#1) fastmath<contract> : (!fir.ref<!fir.array<?xf64>>) -> () acc.terminator } acc.terminator } acc.copyout accVar(%14 : !fir.box<!fir.array<?xf64>>) to var(%13#0 : !fir.box<!fir.array<?xf64>>) {dataClause = #acc<data_clause acc_copy>, name = "b"} ```	2025-07-17 09:04:44 -07:00
Valentin Clement (バレンタインクレメン)	34951f7de8	[flang][cuda] Use NVVM op for clock64 (#149223 )	2025-07-16 18:12:18 -07:00
Valentin Clement (バレンタインクレメン)	4cf7670b01	[flang][cuda] Lower clock() to NNVM op (#149228 ) Also use a same gen function for all NVVM time ops.	2025-07-16 17:24:17 -07:00
Valentin Clement (バレンタインクレメン)	210cf010c3	[flang][cuda] Lower globaltimer to NVVM op (#149217 )	2025-07-16 17:02:43 -07:00
Valentin Clement (バレンタインクレメン)	1e4e2b332d	[flang][cuda] Import type descriptor in the gpu module when needed (#149157 )	2025-07-16 14:12:27 -07:00
Andre Kuhlenschmidt	b3c72a97c5	[flang][driver] -Werror promotes warnings to error and interopts with -Wfatal-errors (#148748 ) This PR changes how `-Werror` promotes warnings to errors so that it interoperates with `-Wfatal-error`. It maintains the property that warnings and other messages promoted to errors are displayed as there original message.	2025-07-16 12:26:38 -07:00
Slava Zakharin	bcee18a2e2	[flang] Handle SEQUENCE derived types for array repacking. (#148777 ) It is possible that a non-polymorphic dummy argument has a dynamic type that does not match its static type in a valid Fortran program, e.g. when the actual and the dummy arguments have different compatible derived SEQUENCE types: module mod type t sequence integer x end type contains subroutine test(x) type t sequence integer x end type type(t) :: x(:) end subroutine end module 'test' may be called with an actual argument of type 'mod::t', which is the dynamic type of 'x' on entry to 'test'. If we create the repacking temporary based on the static type of 'x' ('test::t'), then the runtime will report the types mismatch as an error. Thus, we have to create the temporary using the dynamic type of 'x'. The fact that the dummy's type has SEQUENCE or BIND attribute is not easily computable at this stage, so we use the dynamic type for all derived type cases. As long as this is done only when the repacking actually happens, the overhead should not be noticeable.	2025-07-16 12:11:15 -07:00
Peter Klausler	3de11b7062	[flang] Catch bad members of BIND(C) COMMON block (#148971 ) Variables that can't be BIND(C), like pointers, can't be in a BIND(C) common block, either. Fixes https://github.com/llvm/llvm-project/issues/148922.	2025-07-16 09:10:26 -07:00
Peter Klausler	52a46dc57f	[flang] Allow -fdefault-integer-8 with defined I/O (#148927 ) Defined I/O subroutines have UNIT= and IOSTAT= dummy arguments that are required to have type INTEGER with its default kind. When that default kind is modified via -fdefault-integer-8, calls to defined I/O subroutines from the runtime don't work. Add a flag to the two data structures shared between the compiler and the runtime support library to indicate that a defined I/O subroutine was compiled under -fdefault-integer-8. This has been done in a compatible manner, so that existing binaries are compatible with the new library and new binaries are compatible with the old library, unless of course -fdefault-integer-8 is used. Fixes https://github.com/llvm/llvm-project/issues/148638.	2025-07-16 09:09:49 -07:00
Peter Klausler	fc7f9d795d	[flang] Better error message for ambiguous ASSIGNMENT(=) (#148720 ) When a type-bound generic ASSIGNMENT(=) procedure is ambiguous for a particular reference, say so, rather than claiming that no specific procedure matched the types and ranks of the LHS and RHS. Fixes https://github.com/llvm/llvm-project/issues/148675.	2025-07-16 09:09:29 -07:00
Peter Klausler	6e0b0ec66a	[flang] Fix crash in Semantics (#148706 ) Allow for renaming in USE association of Cray pointers. Fixes https://github.com/llvm/llvm-project/issues/148559.	2025-07-16 09:09:05 -07:00
Akash Banerjee	1742966c0d	[Flang] Force lowering to Complex for AMDGPU (#144927 )	2025-07-16 13:59:20 +01:00
Valentin Clement (バレンタインクレメン)	b4e2272271	[flang][cuda] Move cuf.set_allocator_idx after derived-type init (#148936 ) Derived type initialization overwrite the component descriptor. Place the `cuf.set_allocator_idx` after the initialization is performed.	2025-07-15 13:52:00 -07:00
Tom Eccles	c2e41be50f	[flang][Parser] Add whitespace token after the sentinel in fixed form (#148825 ) Fixes #148386 The first time the line was classified (using `Prescanner::ClassifyLine(const char *)`) the line was correctly classified as a compiler directive. But then later on the token form is invoked (`Prescanner::ClassifyLine(TokenSequence, Provenance)`). This one incorrectly classified the line as a comment because there was no whitespace token right after the sentinel. This fixes the issue by ensuring this whitespace is added.	2025-07-15 16:39:34 +01:00
Krzysztof Parzyszek	51b6f64b89	[flang][OpenMP] Avoid unnecessary parsing of OpenMP constructs (#148629 ) When parsing a specification part, the parser will look ahead to see if the next construct is an executable construct. In doing so it will invoke OpenMPConstruct parser, whereas the only necessary thing to check would be the directive alone.	2025-07-15 09:05:56 -05:00
Valentin Clement (バレンタインクレメン)	90ef114a33	[flang][cuda] Add cuf.set_allocator_idx for device component (#148750 )	2025-07-14 19:31:44 -07:00
Valentin Clement (バレンタインクレメン)	2c6771889a	[flang][cuda] Introduce cuf.set_allocator_idx operation (#148717 )	2025-07-14 17:23:18 -07:00
Peter Klausler	69f38443e5	[flang] Extension: TRANSFER(boz, integer or real scalar) (#147604 ) Interpret TRANSFER(SOURCE=BOZ literal, MOLD=integer or real scalar) as if it had been a reference to the standard INT or REAL intrinsic functions, for which a BOZ literal is an acceptable argument, with a warning about non-conformance. It's a needless extension that has somehow crept into some other compilers and real applications.	2025-07-14 11:12:50 -07:00
Peter Klausler	4dceb25dd1	[flang] Don't create bogus tokens from token pasting (##) (#147596 ) When blank tokens arise from macro replacement in token sequences with token pasting (##), the preprocessor is producing some bogus tokens (e.g., "name(") that can lead to subtle bugs later when macro names are not recognized as such. The fix is to not paste tokens together when the result would not be a valid Fortran or C token in the preprocessing context.	2025-07-14 11:11:43 -07:00
Slava Zakharin	4775b96898	[flang] Optimize redundant array repacking. (#147881 ) This patch allows optimizing redundant array repacking, when the source array is statically known to be contiguous. This is part of the implementation plan for the array repacking feature, though, it does not affect any real life use case as long as FIR inlining is not a thing. I experimented with simple cases of FIR inling using `-inline-all`, and I recorded these cases in optimize-array-repacking.fir tests.	2025-07-14 09:41:42 -07:00
Slava Zakharin	9e56d0d2ab	[flang] Fixed TBAA tags for derived types with descriptors. (#148093 ) We cannot attach any "data" or "descriptor" tag to accesses of derived types that contain descriptors, because this will make them non-aliasing with any generic "data" or "descriptor" accesses, which is not correct. We have to skip TBAA tags attachment for such accesses same way we do it for boxes.	2025-07-14 09:41:15 -07:00
Slava Zakharin	fc99ef7411	[flang] Allow embox's source_box to be a !fir.box. (#148305 ) In order to create temporary copies of assumed-type arrays (e.g. for `-frepack-arrays`), we have to allow the source_box to be a !fir.box. This patch replaces #147618.	2025-07-14 09:40:42 -07:00
Kareem Ergawy	a716cc0961	[flang] Support `do concurrent ... reduce` for associating names (#148597 ) Extends reduction support for `do concurrent`, in particular, for associating names. Consider the following input: ```fortran subroutine dc_associate_reduce integer :: i real, allocatable, dimension(:) :: x associate(x_associate => x) do concurrent (i = 1:10) reduce(+: x_associate) end do end associate end subroutine ``` The declaration of `x_associate` is emitted as follows: ```mlir %13:2 = hlfir.declare %10(%12) {uniq_name = "...."} : (!fir.heap<!fir.array<?xf32>>, !fir.shapeshift<1>) -> (!fir.box<!fir.array<?xf32>>, !fir.heap<!fir.array<?xf32>>) ``` where the HLFIR base type is an array descriptor (i.e. the allocatable/heap attribute is dropped as stipulated by the spec; section 11.1.3.3). The problem here is that `declare_reduction` ops accept only reference types. This restriction is already partially handled for `fir::BaseBoxType`'s by allocating a stack slot for the descriptor and storing the box in that stack allocation. We have to modify this a littble bit for `associate` since the HLFIR and FIR base types are different (unlike most scenarios).	2025-07-14 12:18:16 +02:00
Tom Eccles	51d3bc67d0	[flang][tco] Add more flags for generating codegen tests (#148171 ) Built on top of https://github.com/llvm/llvm-project/pull/146533 Adding some more options I use to generate minimal testcases for MLIR->LLVMIR conversion. Hopefully this will save us all some time when writing these tests. The following options are added - `-simplify-mlir` runs CSE and canonicalization at the end of the MLIR pipeline - `-enable-aa` allows toggling whether TBAA is enabled - `-test-gen` is shorthand for `-emit-final-mlir -simplify-mlir -enable-aa=false`	2025-07-14 10:24:58 +01:00
Christian Ulmann	374d5da214	[MLIR][Interfaces] Remove negative branch weight verifier (#148234 ) This commit removes the verifier that checked if branch weights are negative. This check was too strict because weights are interpreted as unsigned integers. This showed up when running the verifier on LLVM dialect modules that were imported from LLVM IR.	2025-07-14 07:34:29 +02:00
Valentin Clement (バレンタインクレメン)	aa7bf60618	[flang][cuda] Fix false positive on unsupported CUDA data transfer (#148295 ) The switch to `GetSymbolVector` introduced a regression on detecting implicit data transfer when the rhs is a function call. Make sure the symbol we are looking at are of interest to detect data transfer.	2025-07-11 14:30:57 -07:00
Krzysztof Parzyszek	61a9d2c22d	[flang][OpenMP] Use OmpDirectiveSpecification in DISPATCH (#148008 ) Dispatch is the last construct (after ATOMIC and ALLOCATORS) where the associated block requires a specific form. Using OmpDirectiveSpecification for the begin and the optional end directives will make the structure of all block directives more uniform.	2025-07-11 07:28:54 -05:00
Krzysztof Parzyszek	638943b27e	[flang][OpenMP] Convert AST node for ALLOCATORS to use Block as body (#148005 ) The ALLOCATORS construct is one of the few constructs that require a special form of the associated block. Convert the AST node to use OmpDirectiveSpecification for the directive and the optional end directive, and to use parser::Block as the body: the form of the block is checked in the semantic checks (with a more meaningful message).	2025-07-11 06:45:11 -05:00
Kareem Ergawy	ab1c4905f4	[flang][do concurrent] Extned `getAllocaBlock()` and emit yields correctly (#146853 ) Handles some loose ends in `do concurrent` reduction declarations. This PR extends `getAllocaBlock` to handle declare ops, and also emit `fir.yield` in all regions.	2025-07-11 10:17:17 +02:00
Kareem Ergawy	0e9b7b054c	[flang][OpenMP] Basic mapping of `do concurrent ... reduce` to OpenMP (#146033 ) Now that we have changes introduced by #145837, mapping reductions from `do concurrent` to OpenMP is almost trivial. This PR adds such mapping. PR stack: - https://github.com/llvm/llvm-project/pull/145837 - https://github.com/llvm/llvm-project/pull/146025 - https://github.com/llvm/llvm-project/pull/146028 - https://github.com/llvm/llvm-project/pull/146033 (this one)	2025-07-11 09:19:16 +02:00
Kareem Ergawy	eba35cc1c0	[flang][do concurrent] Re-model `reduce` to match reductions are modelled in OpenMP and OpenACC (#145837 ) This PR proposes re-modelling `reduce` specifiers to match OpenMP and OpenACC. In particular, this PR includes the following: * A new `fir` op: `fir.delcare_reduction` which is identical to OpenMP's `omp.declare_reduction` op. * Updating the `reduce` clause on `fir.do_concurrent.loop` to use the new op. * Re-uses the `ReductionProcessor` component to emit reductions for `do concurrent` just like we do for OpenMP. To do this, the `ReductionProcessor` had to be refactored to be more generalized. * Upates mapping `do concurrent` to `fir.loop ... unordered` nests using the new reduction model. Unfortunately, this is a big PR that would be difficult to divide up in smaller parts because the bottom of the changes are the `fir` table-gen changes to `do concurrent`. However, doing these MLIR changes cascades to the other parts that have to be modified to not break things. This PR goes in the same direction we went for `private/local` speicifiers. Now the `do concurrent` and OpenMP (and OpenACC) dialects are modelled in essentially the same way which makes mapping between them more trivial, hopefully. PR stack: - https://github.com/llvm/llvm-project/pull/145837 (this one) - https://github.com/llvm/llvm-project/pull/146025 - https://github.com/llvm/llvm-project/pull/146028 - https://github.com/llvm/llvm-project/pull/146033	2025-07-11 06:39:30 +02:00
Razvan Lupusoru	4859b92b7f	[flang][acc] Update FIR ref, heap, and pointer to be MappableType (#147834 ) The MappableType OpenACC type interface is a richer interface that allows OpenACC dialect to be capable to better interact with a source dialect, FIR in this case. fir.box and fir.class types already implemented this interface. Now the same is being done with the other FIR types that represent variables. One additional notable change is that fir.array no longer implements this interface. This is because MappableType is primarily intended for variables - and FIR variables of this type have storage associated and thus there's a pointer-like type (fir.ref/heap/pointer) that holds the array type. The end goal of promoting these FIR types to MappableType is that we will soon implement ability to generate recipes outside of the frontend via this interface.	2025-07-10 15:23:57 -07:00
Valentin Clement (バレンタインクレメン)	9a0e03f430	[flang][cuda] Update implicit data transfer for device component (#147882 ) Update the detection of implicit data transfer when a device resident allocatable derived-type component is involved and remove the TODOs.	2025-07-10 09:50:31 -07:00
Krzysztof Parzyszek	9b0ae6ccd6	[flang][OpenMP] Issue a warning when parsing future directive spelling (#147765 ) OpenMP 6.0 introduced alternative spelling for some directives, with the previous spellings still allowed. Warn the user when a new spelling is encountered with OpenMP version set to an older value.	2025-07-10 09:57:03 -05:00
Daniel Paoliello	154de3e1bd	[flang] Don't check the '-mframe-pointer' flag (#147837 ) The `-mframe-pointer` flag is not explicitly set in the original `flang` invocation and so the value passed to `flang -fc1` can vary depending on the host machine, so don't verify it in the output. `-mframe-pointer` forwarding is already verified by `flang/test/Driver/frame-pointer-forwarding.f90`.	2025-07-10 07:27:33 -07:00

... 2 3 4 5 6 ...

6210 Commits