llvm-project

Author	SHA1	Message	Date
Maksim Levental	46f6df0848	[mlir][NFC] update `flang/Optimizer/Transforms` create APIs (11/n) (#149915 ) See https://github.com/llvm/llvm-project/pull/147168 for more info.	2025-07-21 19:37:17 -04:00
Slava Zakharin	da959c92c5	[flang] Fixed out-of-bounds access in SimplifyIntrinsics. (#136171 ) When the mask is scalar, it is incorrect to cast it to !fir.box<!fir.array<1xlogical<>>>, because the coordinate operation will try to read the dim-1 stride from the box to get the address of the first element. Even though the stride value will be multiplied by 0, and does not matter, it is still a read past the allocated box object. Instead, we should just use box_addr to get the address of the scalar mask.	2025-04-17 11:46:06 -07:00
Valentin Clement (バレンタインクレメン)	478e516140	[flang][cuda] Sync double descriptor after c_f_pointer call (#130194 ) After a global device pointer is set through `c_f_pointer`, we need to sync the double descriptor so the version on the device is also up to date.	2025-03-06 19:19:51 -08:00
Kelvin Li	996092d5a5	[flang] probably convert Fortran logical to i1 in expanding hlfir.maxloc/hlfir.minloc opcodes (#129791 ) If mask is a scalar, it always converts to !fir.box<!fir.array<1xi1>>. The wrong value may be picked up when passing to the function on the big endian platform. This patch is to do the conversion based on the original type of the mask and convert the value to i1 after the load.	2025-03-06 15:47:44 -05:00
Michael Kruse	b815a3942a	[Flang] Move non-common headers to FortranSupport (#124416 ) Move non-common files from FortranCommon to FortranSupport (analogous to LLVMSupport) such that * declarations and definitions that are only used by the Flang compiler, but not by the runtime, are moved to FortranSupport * declarations and definitions that are used by both ("common"), the compiler and the runtime, remain in FortranCommon * generic STL-like/ADT/utility classes and algorithms remain in FortranCommon This allows a for cleaner separation between compiler and runtime components, which are compiled differently. For instance, runtime sources must not use STL's `<optional>` which causes problems with CUDA support. Instead, the surrogate header `flang/Common/optional.h` must be used. This PR fixes this for `fast-int-sel.h`. Declarations in include/Runtime are also used by both, but are header-only. `ISO_Fortran_binding_wrapper.h`, a header used by compiler and runtime, is also moved into FortranCommon.	2025-02-06 15:29:10 +01:00
Valentin Clement (バレンタインクレメン)	4b17a8b10e	[flang][cuda] Add operation to sync global descriptor (#121520 ) Introduce cuf.sync_descriptor to be used to sync device global descriptor after pointer association. Also move CUFCommon so it can be used in FIRBuilder lib as well.	2025-01-02 17:02:45 -08:00
Valentin Clement (バレンタインクレメン)	a76609dd72	[flang][cuda] Avoid intrinsics simplification in device context (#117026 )	2024-11-21 10:37:38 -08:00
Christian Sigg	bd9fdce69b	[flang] Use `isa/dyn_cast/cast/...` free functions. (#90432 ) The corresponding member functions are deprecated.	2024-04-29 09:16:22 +02:00
Christian Sigg	fac349a169	Reapply "[mlir] Mark `isa/dyn_cast/cast/...` member functions depreca… (#90406 ) …ted. (#89998)" (#90250) This partially reverts commit 7aedd7dc754c74a49fe84ed2640e269c25414087. This change removes calls to the deprecated member functions. It does not mark the functions deprecated yet and does not disable the deprecation warning in TypeSwitch. This seems to cause problems with MSVC.	2024-04-28 22:01:42 +02:00
dyung	7aedd7dc75	Revert "[mlir] Mark `isa/dyn_cast/cast/...` member functions deprecated. (#89998 )" (#90250 ) This reverts commit 950b7ce0b88318f9099e9a7c9817d224ebdc6337. This change is causing build failures on a bot https://lab.llvm.org/buildbot/#/builders/216/builds/38157	2024-04-26 12:09:13 -07:00
Christian Sigg	950b7ce0b8	[mlir] Mark `isa/dyn_cast/cast/...` member functions deprecated. (#89998 ) See https://mlir.llvm.org/deprecation and https://discourse.llvm.org/t/preferred-casting-style-going-forward.	2024-04-26 16:28:30 +02:00
Tom Eccles	81442f8d97	[flang][NFC] Use tablegen to create SimplifyIntrinsics constructor (#89963 ) This pass runs on ModuleOp, internally walking all func::CallOps so it shouldn't need anything special to work on other top level operations.	2024-04-25 10:26:05 +01:00
jeanPerier	a4798bb0b6	[flang][NFC] use mlir::SymbolTable in lowering (#86673 ) Whenever lowering is checking if a function or global already exists in the mlir::Module, it was doing module->lookup. On big programs (~5000 globals and functions), this causes important slowdowns because these lookups are linear. Use mlir::SymbolTable to speed-up these lookups. The SymbolTable has to be created from the ModuleOp and maintained in sync. It is therefore placed in the converter, and FirOPBuilders can take a pointer to it to speed-up the lookups. This patch does not bring mlir::SymbolTable to FIR/HLFIR passes, but some passes creating a lot of runtime calls could benefit from it too. More analysis will be needed. As an example of the speed-ups, this patch speeds-up compilation of Whizard compare_amplitude_UFO.F90 from 5 mins to 2 mins on my machine (there is still room for speed-ups).	2024-04-02 14:29:29 +02:00
David Green	2a95fe481d	[Flang] Allow Intrinsic simpification with min/maxloc dim and scalar result (#81619 ) This makes an adjustment to the existing fir minloc/maxloc generation code to handle functions with a dim=1 that produce a scalar result. This should allow us to get the same benefits as the existing generated minmax reductions. This is a recommit of #76194 with an extra alteration to the end of genRuntimeMinMaxlocBody to make sure we convert the output array to the correct type (a `box<heap<i32>>`, not `box<heap<array<1xi32>>>`) to prevent writing the wrong type of box into it. This still allocates the data as a `array<1xi32>`, converting it into a i32 assuming that is safe. An alternative would be to allocate the data as a i32 and change more of the accesses to it throughout genRuntimeMinMaxlocBody.	2024-03-02 14:39:59 +00:00
David Green	7242896233	[Flang] Attempt to fix Nan handling in Minloc/Maxloc intrinsic simplification (#82313 ) In certain case "extreme" values like Nan, Inf and 0xffffffff could lead to generating different code via the inline-generated intrinsics vs the versions in the runtimes (and other compilers like gfortran). There are some examples I was using for testing in https://godbolt.org/z/x4EfqEss5. This changes the generation for the intrinsics to be more like the runtimes, using a condition that is similar to: isFirst \|\| (prev != prev && elem == elem) \|\| elem < prev The middle part is only used for floating point operations, and checks if the values are Nan. This should then hopefully make the logic closer to - return the first element with the lowest value, with Nans ignored unless there are only Nans. The initial limit value for floats are also changed from the largest float to Inf, to make sure it is handled correctly. The integer reductions are also changed to use a similar scheme to make sure they work with masked values. This means that the preamble after the loop can be removed.	2024-02-21 09:31:29 +00:00
David Green	815a846552	[Flang] Move genMinMaxlocReductionLoop to Transforms/Utils.cpp (#81380 ) This is one option for attempting to move genMinMaxlocReductionLoop to a better location. It moves it into Transforms and makes HLFIRTranforms depend upon FIRTransforms. It passes a build locally, both with and without -DBUILD_SHARED_LIBS, and does OK on the windows CI.	2024-02-13 08:31:07 +00:00
David Green	202917f86e	[Flang] Move genMinMaxlocReductionLoop to a common location. The shared library build doesn't like references of genMinMaxlocReductionLoop, in Optimizer/Transforms, from HLFIR/Optimizer/Transforms. For the moment I've moved the code to the header file where it can be shared, like other methods in Utils.h	2024-01-25 13:31:18 +00:00
David Green	223d3dabc8	[Flang] Minloc elemental intrinsic lowering (#74828 ) Currently the lowering of a minloc intrinsic with a mask will look something like: %e = hlfir.elemental %shape ({ ... }) %m = hlfir.minloc %array mask %e hlfir.assign %m to %result hlfir.destroy %m The elemental will be expanded into a temporary+loop, the minloc into a FortranAMinloc call (which hopefully gets simplified to a specialized call that can be inlined at the call site), and the assign might get expanded to a FortranAAssign. It would be better to generate the entire construct as single loop if we can - one that performs the minloc calculation with the mask elemental computed inline. This patch attempt to do that, adding a hlfir version of the expansion code from SimplifyIntrinsics that turns an minloc+elemental into a single combined loop nest. It attempts to reuse the methods in genMinlocReductionLoop for constructing the loop with a modified loop body. The declaration for the function is currently in Optimizer/Support/Utils.h, but there might be a better place for it. It is added as part of the OptimizedBufferizationPass, like the similar count/any/all that have been added recently.	2024-01-25 12:17:12 +00:00
Pete Steinfeld	4f59a38821	Revert #76194 (#76987 ) [Flang] Revert "Allow Intrinsic simpification with min/maxloc dim and…scalar result (#76194)" This reverts commit 9b7cf5bfb08b6e506216ef354dfd61adb15acbff. See merge request #76194. This change was causing several failures in our internal tests. I'm reverting now and will work on creating a test that David Green can use to reproduce the problem.	2024-01-04 10:19:50 -08:00
David Green	9b7cf5bfb0	[Flang] Allow Intrinsic simpification with min/maxloc dim and scalar result (#76194 ) This makes an adjustment to the existing fir minloc/maxloc generation code to handle functions with a dim=1 that produce a scalar result. This should allow us to get the same benefits as the existing generated minmax reductions. This is a recommit of #75820 with the typename added to the generated function.	2024-01-02 11:09:18 +00:00
Pete Steinfeld	0cf3af0c51	Revert "[Flang] Allow Intrinsic simpification with min/maxloc dim and… (#76184 ) … scalar result. (#75820)" This reverts commit 701f64790520790f75b1f948a752472d421ddaa3. The commit breaks some uses of the 'maxloc' intrinsic. See PR #75820	2023-12-21 13:14:05 -08:00
David Green	701f647905	[Flang] Allow Intrinsic simpification with min/maxloc dim and scalar result. (#75820 ) This makes an adjustment to the existing fir minloc/maxloc generation code to handle functions with a dim=1 that produce a scalar result. This should allow us to get the same benefits as the existing generated minmax reductions.	2023-12-20 12:12:12 +00:00
David Green	9bb47f7f8b	[Flang] Add Maxloc to fir simplify intrinsics pass (#75463 ) This takes the code from D144103 and extends it to maxloc, to allow the simplifyMinMaxlocReduction method to work with both min and max intrinsics by switching condition and limit/initial value.	2023-12-18 07:59:51 +00:00
Kazu Hirata	11efccea8f	[flang] Use StringRef::{starts,ends}_with (NFC) This patch replaces uses of StringRef::{starts,ends}with with StringRef::{starts,ends}_with for consistency with std::{string,string_view}::{starts,ends}_with in C++20. I'm planning to deprecate and eventually remove StringRef::{starts,ends}with.	2023-12-13 23:48:53 -08:00
Slava Zakharin	89b98c13e0	[flang] Fixed simplification for FP maxval. On x86, a simplified F128 maxval ends up calling fmaxl that does not work properly for F128 arguments. It is probably an LLVM issue, but we also should not use arith.maxf if NaN or -0.0 operands are possible. The change is to use cmpf and select. Unfortunately, these arith ops do not support FastMathFlags currently, so I will have to fix this sooner or later (depending on how this affects performance). Reviewed By: kiranchandramohan Differential Revision: https://reviews.llvm.org/D158200	2023-08-21 19:33:56 -07:00
David Truby	f52c64b115	[flang] Add fastmath flags to localBuilder in IntrinsicCall Currently the local builder used in IntrinsicCall doesn't have the fastmath flags passed to it. This results in the fastmath attribute not being added to certain runtime calls. This patch simply forwards the fastmath flags from the parent builder. Differential Revision: https://reviews.llvm.org/D154611	2023-07-11 18:53:31 +01:00
Slava Zakharin	7a607e253d	[flang] Removed unnecessary llvm/CodeGen/SelectionDAGNodes.h include. Required after D148767 for flang+debug+slibs build. Reviewed By: chapuni, clementval Differential Revision: https://reviews.llvm.org/D149764	2023-05-03 15:10:09 -07:00
Renaud-K	b07ef9e7cd	Break circular dependency between FIR dialect and utilities	2023-03-09 15:24:51 -08:00
Sacha Ballantyne	242bb0b652	[flang] Fix a bug with simplified minloc that treated logicals with even values > 1 as 0 Previously the mask would be loaded as the appropriate integer type and cast to I1 to pass to fir.if, however this truncates the integer and so would cast 6 to 0. By loading values as logicals and casting to I1 this problem is avoided. Reviewed By: Leporacanthicus Differential Revision: https://reviews.llvm.org/D144974	2023-02-28 17:15:36 +00:00
Sacha Ballantyne	79dccded69	[flang] Change COUNT intrinsic to support different kind logicals Previously COUNT would cast the mask input to logical<4> before passing it to the runtime function, this has been changed to allow different types of logical. Reviewed By: tblah Differential Revision: https://reviews.llvm.org/D144867	2023-02-28 12:26:33 +00:00
Sacha Ballantyne	614cd721e1	[Flang] Add Minloc to simplify intrinsics pass This patch adds minloc to the simplify intrinsics pass, supporting calls with KIND or MASK arguments while calls which have BACK, DIM or have a CHARACTER input array are rejected. This patch is targeting exchange2, and in benchmarks provides a ~11% improvement in performance. Also included are some minor style changes / cleanup in simplifyIntrinsics.cpp. Reviewed By: vzakhari Differential Revision: https://reviews.llvm.org/D144103	2023-02-27 11:36:55 +00:00
Sacha Ballantyne	98ecc3ac77	[Flang] Fix for Any/All simplification to properly propogate the inital value When rank > 1, the inital value would be lost on inner loops, leading to the wrong value to be returned, e.g. This would return T. This patch fixes this to use the correct inital value for all cases. ``` Integer :: m(0,10) Any(m .eq 0) ``` Reviewed By: vdonaldson Differential Revision: https://reviews.llvm.org/D143899	2023-02-14 10:28:56 +00:00
Sacha Ballantyne	20fba03f96	[Flang] Add Any and All intrinsics to simplify intrinsics pass This patch provides a simplified version of the Any intrinsic as well as the All intrinsic that can be used for inlining or simpiler use cases. These changes are targeting exchange2, and provide a ~9% performance increase. Reviewed By: Leporacanthicus, vzakhari Differential Revision: https://reviews.llvm.org/D142977	2023-02-09 19:52:15 +00:00
Sacha Ballantyne	bb94d33aac	[flang] Fix simplify intrinsic for count not checking for rank = 0 properly Simple fix to check for rank in the same way as other intrinsics to allow runtime count to take over when dealing with unknown dimension arrays. Fixes #60356 Reviewed By: Leporacanthicus Differential Revision: https://reviews.llvm.org/D142877	2023-01-30 12:23:37 +00:00
Sacha Ballantyne	7d2e198729	[flang] Add Count to simplified intrinsics This patch adds a simplfiied version of count for the simplify intrinsics pass, allowing the function to be inlined. This was done specifically to help improve performance for exchange2, and provides a ~12% performance increase. Reviewed By: vzakhari, Leporacanthicus Differential Revision: https://reviews.llvm.org/D142209	2023-01-27 16:30:11 +00:00
Kazu Hirata	0db88db5d9	flang] Remove remaining uses of llvm::Optional (NFC) This patch removes the unused "using" declaration and removes #include "llvm/ADT/Optional.h". This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2023-01-07 22:32:38 -08:00
Kazu Hirata	c09215860f	[flang] Use std::optional instead of llvm::Optional (NFC) This patch replaces (llvm::\|)Optional< with std::optional<. I'll post a separate patch to remove #include "llvm/ADT/Optional.h". This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2023-01-07 22:26:48 -08:00
Kazu Hirata	4d4d4785e0	[flang] Add #include <optional> (NFC) This patch adds #include <optional> to those files containing llvm::Optional<...> or Optional<...>. I'll post a separate patch to actually replace llvm::Optional with std::optional. This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2023-01-07 20:55:47 -08:00
Kazu Hirata	c15a925ada	[flang] Use std::optional instead of None in comments (NFC) This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-12-10 17:00:21 -08:00
Kazu Hirata	9a41739565	[flang] Use std::nullopt instead of None (NFC) This patch mechanically replaces None with std::nullopt where the compiler would warn if None were deprecated. The intent is to reduce the amount of manual work required in migrating from Optional to std::optional. This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-12-03 12:14:21 -08:00
Slava Zakharin	ffe1661fab	[flang] Propagate fastmath flags during intrinsics simplification. In general, the meaning of fastmath flags on a call during inlining is that the call's operation flags must be ignored. For user functions that means that the fastmath flags used for the function definition override any call site's fastmath flags. For intrinsic functions we can use the call site's fastmath flags, but we have to make sure that the call sites with different flags produce/use different simplified versions of the same intrinsic function. Differential Revision: https://reviews.llvm.org/D138048	2022-11-17 10:16:47 -08:00
David Truby	d983f5f39e	[flang] Add cpowi function to runtime and use instead of pgmath This patch adds a cpowi function to the flang runtime, and switches to using that function instead of pgmath for complex number to integer power operations. Differential Revision: https://reviews.llvm.org/D134889	2022-10-11 12:34:58 +00:00
Slava Zakharin	8bd76ac151	[flang] Support multidimensional reductions in SimplifyIntrinsicsPass. Create simplified functions for each rank with "x<rank>" suffix that implement multidimensional reductions. To enable this I had to fix an issue with taking incorrect box shape in cases of sliced embox/rebox. Differential Revision: https://reviews.llvm.org/D133820	2022-09-19 12:16:23 -07:00
Slava Zakharin	2b138567e0	[flang] Support more data types for reduction in SimplifyIntrinsicsPass. All floating point (not complex) and integer types should be supported now. Differential Revision: https://reviews.llvm.org/D133818	2022-09-19 12:16:22 -07:00
Mats Petersson	aa94eb3877	[FLANG][NFC]Use RTNAME instead of hard-coding for simplify intrinsics Use the RTNMAE macro (via stringify macros) to generate the name strings for runtime functions, instead of using strings. The sequence of macros generate exactly the same string as the ones used previously, but this will support future changes in runtime function names. No functional change. Reviewed By: vzakhari Differential Revision: https://reviews.llvm.org/D132652	2022-09-05 13:06:44 +01:00
Mats Petersson	43159b5808	[FLANG][NFCI]De-duplicate code in SimplifyIntrinsics This removes a bunch of duplicated code, by adding an intermediate function simplifyReduction that takes a std::function argument for the actual replacement of the code. No functional change intended. Reviewed By: vzakhari Differential Revision: https://reviews.llvm.org/D132588	2022-09-02 10:49:25 +01:00
Michele Scuttari	67d0d7ac0a	[MLIR] Update pass declarations to new autogenerated files The patch introduces the required changes to update the pass declarations and definitions to use the new autogenerated files and allow dropping the old infrastructure. Reviewed By: mehdi_amini, rriddle Differential Review: https://reviews.llvm.org/D132838	2022-08-31 12:28:45 +02:00
Michele Scuttari	039b969b32	Revert "[MLIR] Update pass declarations to new autogenerated files" This reverts commit 2be8af8f0e0780901213b6fd3013a5268ddc3359.	2022-08-30 22:21:55 +02:00
Michele Scuttari	2be8af8f0e	[MLIR] Update pass declarations to new autogenerated files The patch introduces the required changes to update the pass declarations and definitions to use the new autogenerated files and allow dropping the old infrastructure. Reviewed By: mehdi_amini, rriddle Differential Review: https://reviews.llvm.org/D132838	2022-08-30 21:56:31 +02:00
Mats Petersson	5653884e34	[FLANG]Remove experimental flag from SUM simplification The SUM function does appear to be safe to use, so remove the experimental flag for the SUM operation. Reviewed By: vzakhari, awarzynski Differential Revision: https://reviews.llvm.org/D132567	2022-08-25 14:11:41 +01:00

1 2

58 Commits