llvm-project

Author	SHA1	Message	Date
Ian McInerney	28d5bc5649	[Flang][Driver] Predefine pic/pie macros based on configured level (#153449 ) Predefine the `__pic__/__pie__/__PIC__/__PIE__` macros based on the configured relocation level. This logic mirrors that of the clang driver, where `__pic__/__PIC__` are defined for both PIC and PIE modes, but `__pie__/__PIE__` are only defined for PIE mode. Fixes https://github.com/llvm/llvm-project/issues/135275	2025-08-14 17:48:20 +01:00
Kajetan Puchalski	d3d96e2057	[flang][OpenMP] Add -f[no]-openmp-simd (#150269 ) Both clang and gfortran support the -fopenmp-simd flag, which enables OpenMP support only for simd constructs, while disabling the rest of OpenMP. Implement the appropriate parse tree rewriting to remove non-SIMD OpenMP constructs at the parsing stage. Add a new SimdOnly flang OpenMP IR pass which rewrites generated OpenMP FIR to handle untangling composite simd constructs, and clean up OpenMP operations leftover after the parse tree rewriting stage. With this approach, the two parts of the logic required to make the flag work can be self-contained within the parse tree rewriter and the MLIR pass, respectively. It does not need to be implemented within the core lowering logic itself. The flag is expected to have no effect if -fopenmp is passed explicitly, and is only expected to remove OpenMP constructs, not things like OpenMP library functions calls. This matches the behaviour of other compilers. --------- Signed-off-by: Kajetan Puchalski <kajetan.puchalski@arm.com>	2025-08-14 14:20:15 +01:00
Michael Kruse	38853a0146	[flang][OpenMP] MSVC buildbot fix PR #153488 caused the msvc build (https://lab.llvm.org/buildbot/#/builders/166/builds/1397) to fail: ``` ..\llvm-project\flang\include\flang/Evaluate/rewrite.h(78): error C2668: 'Fortran::evaluate::rewrite::Identity::operator ()': ambiguous call to overloaded function ..\llvm-project\flang\include\flang/Evaluate/rewrite.h(43): note: could be 'Fortran::evaluate::Expr<Fortran::evaluate::SomeType> Fortran::evaluate::rewrite::Identity::operator ()<Fortran::evaluate::SomeType,S>(Fortran::evaluate::Expr<Fortran::evaluate::SomeType> &&,const U &)' with [ S=Fortran::evaluate::value::Integer<128,true,32,unsigned int,unsigned __int64,128>, U=Fortran::evaluate::value::Integer<128,true,32,unsigned int,unsigned __int64,128> ] ..\llvm-project\flang\lib\Semantics\check-omp-atomic.cpp(174): note: or 'Fortran::evaluate::Expr<Fortran::evaluate::SomeType> Fortran::semantics::ReassocRewriter::operator ()<Fortran::evaluate::SomeType,S,void>(Fortran::evaluate::Expr<Fortran::evaluate::SomeType> &&,const U &,Fortran::semantics::ReassocRewriter::NonIntegralTag)' with [ S=Fortran::evaluate::value::Integer<128,true,32,unsigned int,unsigned __int64,128>, U=Fortran::evaluate::value::Integer<128,true,32,unsigned int,unsigned __int64,128> ] ..\llvm-project\flang\include\flang/Evaluate/rewrite.h(78): note: while trying to match the argument list '(Fortran::evaluate::Expr<Fortran::evaluate::SomeType>, const S)' with [ S=Fortran::evaluate::value::Integer<128,true,32,unsigned int,unsigned __int64,128> ] ..\llvm-project\flang\include\flang/Evaluate/rewrite.h(78): note: the template instantiation context (the oldest one first) is ..\llvm-project\flang\lib\Semantics\check-omp-atomic.cpp(814): note: see reference to function template instantiation 'U Fortran::evaluate::rewrite::Mutator<Fortran::semantics::ReassocRewriter>::operator ()<const Fortran::evaluate::Expr<Fortran::evaluate::SomeType>&,Fortran::evaluate::Expr<Fortran::evaluate::SomeType>>(T)' being compiled with [ U=Fortran::evaluate::Expr<Fortran::evaluate::SomeType>, T=const Fortran::evaluate::Expr<Fortran::evaluate::SomeType> & ] ``` The reason is that there is an ambiguity between operator() of ReassocRewriter itself and operator() of the base class `Identity` through `using Id::operator();`. By the C++ specification, method declarations in ReassocRewriter hide methods with the same signature from a using declaration, but this does not apply to ``` evaluate::Expr<T> operator()(..., NonIntegralTag = {}) ``` which has a different signature due to an additional tag parameter. Since it has a default value, it is ambiguous with operator() without tag parameter. GCC and Clang both accept this, but in my understanding MSVC is correct here. Since the overloads of ReassocRewriter cover all cases (integral and non-integral), removing the using declaration to avoid the ambiguity.	2025-08-14 12:30:59 +02:00
Terapines MLIR	c164e6309b	[flang][fir] Add conversion of `fir.iterate_while` to `scf.while`. (#152439 ) This commmit is a supplement for https://github.com/llvm/llvm-project/pull/140374. RFC:https://discourse.llvm.org/t/rfc-add-fir-affine-optimization-fir-pass-pipeline/86190/6	2025-08-14 13:39:55 +08:00
Valentin Clement (バレンタインクレメン)	f9b9e9b7d5	[flang][cuda] Fix buildbot failure after #153242 (#153500 )	2025-08-13 14:54:33 -07:00
Peter Klausler	442ae603c5	[flang] Warn about inexact real literal implicit widening pitfall (#152799 ) When a REAL or COMPLEX literal appears without an explicit kind suffix or a kind-determining exponent letter, and the conversion of that literal from decimal to binary is inexact, emit a warning if that constant is later implicitly widened to a more precise kind, since it will have a different value than was probably intended. Values that convert exactly from decimal to default real, e.g. 1.0 and 0.125, do not elicit this warning. There are many contexts in which Fortran implicitly converts constants. This patch covers name constant values, variable and component initialization, constants in expressions, structure constructor components, and array constructors. For example, "real(8) :: tenth = 0.1" is a common Fortran bug that's hard to find, and is one that often trips up even experienced Fortran programmers. Unlike C and C++, the literal constant 0.1 is not double precision by default, and it does not have the same value as 0.1d0 or 0.1_8 do when it is converted from decimal to real(4) and then to real(8).	2025-08-13 14:36:13 -07:00
Krzysztof Parzyszek	1e7772abcb	[flang][OpenMP] Reassociate ATOMIC update expressions (#153488 ) An atomic update expression of form x = x + a + b is technically illegal, since the right-hand side is parsed as (x+a)+b, and the atomic variable x should be an argument to the top-level +. When the type of x is integer, the result of (x+a)+b is guaranteed to be the same as x+(a+b), so instead of reporting an error, the compiler can treat (x+a)+b as x+(a+b). This PR implements this kind of reassociation for integral types, and for the two arithmetic associative/commutative operators: + and *. Reinstate PR153098 one more time with fixes for the issues that came up: - unused variable "lsrc", - use of ‘outer1’ before deduction of ‘auto’.	2025-08-13 15:53:59 -05:00
Valentin Clement (バレンタインクレメン)	a2899c457e	[flang][cuda] Support data transfer with conversion (#153242 ) When the rhs of the data transfer is from a different type, allocate a new temp on the host and first transfer the rhs to it. Then, use the elemental op created to do the conversion.	2025-08-13 10:55:15 -07:00
Krzysztof Parzyszek	bb6f16ecd7	Revert "[flang][OpenMP] Reassociate ATOMIC update expressions (#153450 )" This reverts commit 7e125b9892f26d2028c3570f537e5a26f8c94447. The fixes helped gcc-8, but not 7.5.	2025-08-13 12:17:13 -05:00
Krzysztof Parzyszek	7e125b9892	[flang][OpenMP] Reassociate ATOMIC update expressions (#153450 ) An atomic update expression of form x = x + a + b is technically illegal, since the right-hand side is parsed as (x+a)+b, and the atomic variable x should be an argument to the top-level +. When the type of x is integer, the result of (x+a)+b is guaranteed to be the same as x+(a+b), so instead of reporting an error, the compiler can treat (x+a)+b as x+(a+b). This PR implements this kind of reassociation for integral types, and for the two arithmetic associative/commutative operators: + and *. Reinstate PR153098 with fixes for the issues that came up: - unused variable "lsrc", - use of ‘outer1’ before deduction of ‘auto’.	2025-08-13 12:00:05 -05:00
Tom Eccles	d02ad56015	[flang][OpenMP] Catch assumed-rank/size variables for privatization a… (#152764 ) …nd reduction Fixes #152312 Assumed size is valid for privatization and simple examples generate sensible looking HLFIR.	2025-08-13 16:59:17 +01:00
Krzysztof Parzyszek	6b5c38dbbb	Revert "[flang][OpenMP] Reassociate ATOMIC update expressions (#153098 )" This reverts commit 4f6ae2af3563a7eefbe4179eabe10ef5898a5963. This PR causes build breaks with older versions of GCC.	2025-08-13 10:27:53 -05:00
Krzysztof Parzyszek	ed2fa6fd89	Revert "[flang][OpenMP] Fix unused variable warning (#153409 )" This reverts commit 6b20b16b2f796b01bc5c3d5d8013a8e65b40080a. This patch depends on PR153098 which is the one being reverted.	2025-08-13 10:27:53 -05:00
Krzysztof Parzyszek	6b20b16b2f	[flang][OpenMP] Fix unused variable warning (#153409 )	2025-08-13 08:59:20 -05:00
Krzysztof Parzyszek	4f6ae2af35	[flang][OpenMP] Reassociate ATOMIC update expressions (#153098 ) An atomic update expression of form x = x + a + b is technically illegal, since the right-hand side is parsed as (x+a)+b, and the atomic variable x should be an argument to the top-level +. When the type of x is integer, the result of (x+a)+b is guaranteed to be the same as x+(a+b), so instead of reporting an error, the compiler can treat (x+a)+b as x+(a+b). This PR implements this kind of reassociation for integral types, and for the two arithmetic associative/commutative operators: + and *.	2025-08-13 08:07:11 -05:00
yanming	02ab6f358c	[flang][fir][NFC] unify flang's code style with the rest.	2025-08-13 15:11:06 +08:00
Connector Switch	f4dd442395	[flang] Optimize `tanpi` precision (#153215 ) Part of #150452.	2025-08-13 10:07:17 +08:00
Connector Switch	12e0d524bc	[flang] Optimize `sinpi` precision (#153211 ) Part of #150452.	2025-08-13 10:06:29 +08:00
Connector Switch	d9074db137	[flang] Optimize `cospi` precision (#153208 ) Part of #150452.	2025-08-13 10:06:09 +08:00
Connector Switch	4537f0ee61	[flang] Optimize `atanpi` precision (#153207 ) Part of #150452.	2025-08-13 10:05:48 +08:00
Connector Switch	c664ce49e3	[flang] Optimize `asinpi` precision (#153203 ) Part of #150452.	2025-08-13 10:05:25 +08:00
Slava Zakharin	b8e4232bd2	[flang] Cast fir.select[_rank] selector to i64. (#153239 ) Properly cast the selector to `i64` regardless of its integer type. We used to generate llvm.trunc always. We have to use `i64` as long as the case values may exceed INT_MAX. Fixes #153050.	2025-08-12 16:43:44 -07:00
Slava Zakharin	252efeba90	[flang] Change traversal order for OptimizeArrayRepackingPass. (#153136 ) A long chain of fir.pack_arrays might require multiple iterations of the greedy rewriter, if we use down-top traversal. The rewriter may not converge in 10 (default) iterations. It is not an error, but it was reported as such. This patch changes the traversal to top-down and also disabled the hard error, if the rewriter does not converge soon enough.	2025-08-12 13:13:10 -07:00
Akash Banerjee	1c7720ef78	Revert "[MLIR][OpenMP] Add a new AutomapToTargetData conversion pass in FIR (#153048 )" This reverts commit 4e6d510eb3ec5b5e5ea234756ea1f0b283feee4a.	2025-08-12 20:19:45 +01:00
Krzysztof Parzyszek	4b7f3806f6	[flang][OpenMP] Move rewriting of min/max from Lower to Semantics (#153038 ) There semantic analysis of the ATOMIC construct will require additional rewriting (reassociation of certain expressions for user convenience), and that will be driven by diagnoses made in the semantic checks. While the rewriting of min/max is not required to be done in semantic analysis, moving it there will make all rewriting for ATOMIC construct be located in a single location.	2025-08-12 12:13:50 -05:00
Akash Banerjee	4e6d510eb3	[MLIR][OpenMP] Add a new AutomapToTargetData conversion pass in FIR (#153048 ) Add a new AutomapToTargetData pass. This gathers the declare target enter variables which have the AUTOMAP modifier. And adds omp.declare_target_enter/exit mapping directives for fir.alloca and fir.free oeprations on the AUTOMAP enabled variables. Automap Ref: OpenMP 6.0 section 7.9.7.	2025-08-12 15:18:15 +01:00
Kareem Ergawy	31387d69e2	[flang][OpenMP] Don't privatize implicit symbols declare by nested `BLOCK`s (#152973 ) Fixes a bug when a block variable is marked as implicit private. In such case, we can simply ignore privatizing that symbol within the context of the currrent OpenMP construct since the "private" allocation for the symbol will be emitted within the nested block anyway.	2025-08-12 06:27:42 +02:00
Krzysztof Parzyszek	ea14ee4464	[flang][OpenMP] Refactor creating atomic analysis, NFC (#153036 ) Turn it into a class that combines the information and generates the analysis instead of having independent functions do it.	2025-08-11 12:20:51 -05:00
Connector Switch	7b41c2f824	[flang] Optimize `acospi` precision (#152869 ) This patch implements the solution for `acospi` proposed in #150452, improving precision for r16 and removing a redundant cast for r4.	2025-08-12 00:28:42 +08:00
Akash Banerjee	0998da27e9	Revert "[MLIR][OpenMP] Add a new AutomapToTargetData conversion pass in FIR (#151989 )" This reverts commit 5a5e8ba0c388d57aecb359ed67919cda429fc7b1.	2025-08-11 13:52:39 +01:00
Akash Banerjee	5a5e8ba0c3	[MLIR][OpenMP] Add a new AutomapToTargetData conversion pass in FIR (#151989 ) Add a new `AutomapToTargetData` pass. This gathers the declare target enter variables which have the `AUTOMAP` modifier. And adds `omp.declare_target_enter/exit` mapping directives for `fir.allocmem` and `fir.freemem` oeprations on the `AUTOMAP` enabled variables. Automap Ref: OpenMP 6.0 section 7.9.7.	2025-08-11 13:18:38 +01:00
Akash Banerjee	a9f9c7db4f	Fix build error from #151513 .	2025-08-11 13:16:39 +01:00
Akash Banerjee	3b10b9a2b0	[MLIR][OpenMP] Add lowering support for AUTOMAP modifier (#151513 ) Add Automap modifier to the MLIR op definition for the DeclareTarget directive's Enter clause. Also add lowering support in Flang. Automap Ref: OpenMP 6.0 section 7.9.7.	2025-08-11 12:45:22 +01:00
Kareem Ergawy	98ffdf3958	[flang][OpenMP] Don't privatize pre-determined symbols declare by nested `BLOCK`s (#152652 ) Fixes a bug when a block variable is marked as pre-determined private. In such case, we can simply ignore privatizing that symbol within the context of the currrent OpenMP construct since the "private" allocation for the symbol will be emitted within the nested block anyway.	2025-08-11 10:21:07 +02:00
Nikita Popov	c23b4fbdbb	[IR] Remove size argument from lifetime intrinsics (#150248 ) Now that #149310 has restricted lifetime intrinsics to only work on allocas, we can also drop the explicit size argument. Instead, the size is implied by the alloca. This removes the ability to only mark a prefix of an alloca alive/dead. We never used that capability, so we should remove the need to handle that possibility everywhere (though many key places, including stack coloring, did not actually respect this).	2025-08-08 11:09:34 +02:00
Kareem Ergawy	d7d0d7a80f	[flang] Skip processing reductions for unstructured `do concurrent` loops (#150188 ) Fixes #149563 When emitting unstructured `do concurrent` loops, reduction processing should be skipped since we are not emitting `fir.do_concurrent` loop in the first place.	2025-08-08 06:22:53 +02:00
Valentin Clement (バレンタインクレメン)	4394a0ca4a	[flang][cuda] Fix detection of implicit data transfer with a global (#152604 )	2025-08-07 17:39:22 -07:00
Krzysztof Parzyszek	bc814348ec	[flang][OpenMP] Break up ResolveOmpObject for readability, NFC (#151957 ) The function ResolveOmpObject had a lot of highly-indented code in two variant visitors. Extract the visitors into their own functions, and reformat the code. Replace !(\|\|) with !&&! in a couple of places to make the formatting a bit nicer. Use llvm::enumerate instead of manually maintaining iteration index.	2025-08-07 15:29:51 -05:00
Krzysztof Parzyszek	e368b5343d	[flang][OpenMP] Make OpenMPCriticalConstruct follow block structure (#152007 ) This allows not having the END CRITICAL directive in certain situations. Update semantic checks and symbol resolution.	2025-08-07 08:10:25 -05:00
Valentin Clement (バレンタインクレメン)	35f003d13b	[flang][cuda] Fix buildbot after #152418 (#152437 )	2025-08-06 22:24:35 -07:00
Valentin Clement (バレンタインクレメン)	eb0ddba26b	Reland "[flang][cuda] Set the allocator of derived type component after allocation" (#152418 ) Reviewed in #152379 - Move the allocator index set up after the allocate statement otherwise the derived type descriptor is not allocated. - Support array of derived-type with device component	2025-08-06 21:49:55 -07:00
Valentin Clement (バレンタインクレメン)	a196281896	[flang][cuda] Remove meaningless warning on CUDA shared arguments (#152404 ) The warning in issued during the compatibility check makes little sense. Just remove it as it is confusing.	2025-08-06 18:50:07 -07:00
Valentin Clement (バレンタインクレメン)	2696e8c149	[flang][cuda] Remove too restrictive assert for data transfer (#152398 ) When the rhs is a an array element, the assert was triggered but this is still a valid transfer. Remove the assert. The operation has a verifier to check its validity.	2025-08-06 18:49:52 -07:00
Valentin Clement (バレンタインクレメン)	7d3134f6cc	Revert "[flang][cuda] Set the allocator of derived type component after allocation" (#152402 ) Reverts llvm/llvm-project#152379 Buildbot failure https://lab.llvm.org/buildbot/#/builders/207/builds/4905	2025-08-06 15:55:53 -07:00
Valentin Clement (バレンタインクレメン)	d897355876	[flang][cuda] Set the allocator of derived type component after allocation (#152379 ) - Move the allocator index set up after the allocate statement otherwise the derived type descriptor is not allocated. - Support array of derived-type with device component	2025-08-06 15:14:00 -07:00
Krzysztof Parzyszek	1d23005b8e	[flang][OpenMP] Insert CRITICAL construct names into global scope (#152004 ) They were inserted in the current scope. OpenMP spec (all versions): The names of critical constructs are global entities of the program. If a name conflicts with any other entity, the behavior of the program is unspecified.	2025-08-06 10:16:43 -05:00
agozillon	255be51f3f	[Flang][OpenMP] Fix missing dereference in isConstructWithTopLevelTarget This was causing false negatives, yielding incorrect programs.	2025-08-05 23:46:31 -05:00
khaki3	8470027f25	[flang][acc] Add a semantic check for the validity of nested parallelism (#152225 ) This PR implements a semantic checker to ensure the legality of nested OpenACC parallelism. The following are quotes from Spec 3.3. We need to disallow loops from having parallelism at the same level as or at a sub-level of child loops. >2.9.2 gang clause >[2064] <ins>When the parent compute construct is a parallel construct</ins>, or on an orphaned loop construct, the gang clause behaves as follows. (...) The associated dimension is the value of the dim argument, if it appears, or is dimension one. The dim argument must be a constant positive integer with value 1, 2, or 3. >[2112] The region of a loop with a gang(dim:d) clause may not contain a loop construct with a gang(dim:e) clause where e >= d unless it appears within a nested compute region. >[2074] <ins>When the parent compute construct is a kernels construct</ins>, the gang clause behaves as follows. (...) >[2148] The region of a loop with the gang clause may not contain another loop with a gang clause unless within a nested compute region. >2.9.3 worker clause >[2122]/[2129] The region of a loop with the worker clause may not contain a loop with the gang or worker clause unless within a nested compute region. >2.9.4 vector clause >[2141]/[2148] The region of a loop with the vector clause may not contain a loop with a gang, worker, or vector clause unless within a nested compute region. https://openacc.org/sites/default/files/inline-images/Specification/OpenACC-3.3-final.pdf	2025-08-05 18:21:31 -07:00
Peter Klausler	867602742c	[flang] Minor speed-up to module file parsing (#152178 ) Module files shouldn't ever produce parsing errors, and if they did in the case of a badly-generated module file, the compiler will notice and crash. So we can run the parser on module files with message deferral enabled, and that saves time that would otherwise be spent generating messages on failed parsing alternatives that are discarded anyway when backtracking. It's not a big savings (single digit percentage on overall compilation time for a big application with lots of modules), but worth doing.	2025-08-05 13:40:30 -07:00
agozillon	88283a6e46	[Flang][OpenMP] Fix MSVC missing include caused by #147442 Windows/MSVC build bots are having issues with a missing header where linux is not. So, attempt to fix by adding missing header explicitly to file.	2025-08-05 10:13:32 -05:00

1 2 3 4 5 ...

8060 Commits