llvm-project

Author	SHA1	Message	Date
jeanPerier	3643d11988	[flang][hlfir] Support box in user defined assignments (#77578 ) When dealing with overlaps in user defined assignments, some entities with descriptors (fir.box) may be saved without descriptors. The current code was replacing the original box entity with the "raw" copy with a simple cast instead of creating a box for the copy. This patch ensures a fir.embox is emitted instead.	2024-01-11 08:50:35 +01:00
Slava Zakharin	69d9ad1cee	[flang][hlfir] Fixed cleanup code placement indeterminism in OrderedAssignments. (#66811 ) I had to remove test3() case in 73086dab9e9870f49f3a9cc2763a1e8b1a20aa20 to fix the buildbots. This patch brings it back with proper fix.	2023-09-20 08:34:11 -07:00
Slava Zakharin	73086dab9e	Revert "Revert "[flang][hlfir] Fixed assignment/finalization order for user-defined assignments. (#66736 )"" This reverts commit 775754e32840c6b6ca64c8bc0b7ae2c778b97d1e. Relanding with removing part of the LIT test. There seems to be operations ordering indeterminism that is unrelated to my change. I will address this issue separately.	2023-09-19 11:40:58 -07:00
Slava Zakharin	775754e328	Revert "[flang][hlfir] Fixed assignment/finalization order for user-defined assignments. (#66736 )" This reverts commit a9a1f849a99ea1d7c44fe0419f6b11c0e5b13495.	2023-09-19 11:31:58 -07:00
Slava Zakharin	a9a1f849a9	[flang][hlfir] Fixed assignment/finalization order for user-defined assignments. (#66736 ) This patch places the finalization code for the RHS of a user-defined assignment after the assignment code. The change only affects standalone RegionAssignOp operations.	2023-09-19 10:57:40 -07:00
Jean Perier	0446bfcc5c	[flang][hlfir] Codegen of hlfir.region_assign where LHS conflicts When the analysis of hlfir.region_assign determined that the LHS region evaluation may be impacted by the assignment effects, all LHS must be fully evaluated and saved before any assignment is done. This patch adds TemporaryStorage variants to save address, including vector subscripted entities addresses whose shape must be saved. It uses the DescriptorStack runtime to deal with complex cases inside forall. For the sake of simplicity, this is also used for vector subscripted LHS outside of foralls (each element address is saved as a descriptor on this stack. This is a bit suboptimal, but it is a safe start that will work with all kinds of type (polymorphic, PDTs...) without further work). Another approach would be to saved only the values that are conflicting in the LHS computation, but this would require a much more complex analysis of the LHS region DAG. Differential Revision: https://reviews.llvm.org/D154057	2023-06-30 09:20:52 +02:00
Slava Zakharin	39e87db192	[flang][hlfir] Codegen for unordered elemental operations. Depends on D154031, D154032 Reviewed By: jeanPerier, tblah Differential Revision: https://reviews.llvm.org/D154033	2023-06-29 10:35:43 -07:00
Jean Perier	fc2c8fed0b	[flang][hlfir] Do not reuse hlfir.expr mask when saving RHS. In WHERE and masked FORALL assignment, both the mask and the RHS may need to be saved in some temporary storage before evaluating the assignment. The code was trying to "optimize" that case when evaluating the RHS by not fetching the mask temporary that was just created, but in simple cases of WHERE construct where the evaluated mask is an hlfir.expr, this caused the hlfir.expr to be both used in an hlfir.associate and later in an hlfir.apply to create the fir.if to mask the RHS evaluation. This double usage prevents codegen from inlining the hlfir.expr at the hlfir.apply, and from "moving" the hlfir.expr storage into the temp during hlfir.associate bufferization. So this is pessimizing the code: this would lead to created two mask array temporary storages This was caught by the unexpectedly high number of "not yet implemented: hlfir.associate of hlfir.expr with more than one use" that were firing. Use the mask temporary instead (the hlfir.associate result) when possible. Some temporary (the "inlined stack") do not support fetching and pushing in the same run (a single counter is used to keep track of the fetching and pushing position). Add a canBeFetchedAfterPush() for safety, but this limitation is anyway not relevant for hlfir.expr since the inlined stack is only used to save "trivial" scalars. Also update the temporary storage name to only indicate "forall" if the top level construct is a FORALL. This is not a very precise name, but it should at least give a correct context to indicate in the IR why some temporary array storage was created. Differential Revision: https://reviews.llvm.org/D153880	2023-06-28 08:34:22 +02:00
Jean Perier	6c14e84926	[flang][hlfir] Add codegen for vector subscripted LHS This patch adds support for vector subscripted assignment left-hand side. It does not yet add support for the cases where the LHS must be saved because its evaluation could be impacted by the assignment. The implementation adds an hlfir::ElementalOpInterface to share the elemental inlining utility and some other tools between hlfir::ElementalOp and hlfir::ElelemntalAddrOp. It adds generateYieldedLHS() to allow retrieving the LHS value in lowering, whether or not it is vector subscripted. If it is vector subscripted, this utility creates a loop nest iterating over the elements and returns the address of an element. Differential Revision: https://reviews.llvm.org/D153759	2023-06-27 13:30:24 +02:00
Jean Perier	9231134708	[flang][hlfir] user defined assignment codegen Add codegen support for hlfir.region_assign with user defined assignment. It is currently a bit pessimistic, because outside of forall, it does not use the PURE aspect, if any, of the assignment routine to rule out that the routine can write to something else than the LHS that could overlap with the RHS. However, the current lowering is anyway adding parenthesis around the RHS, so this should not cause performance regressions. Differential Revision: https://reviews.llvm.org/D153516	2023-06-26 13:24:36 +02:00
Tom Eccles	53cc33b00b	[flang] Store KindMapping by value in FirOpBuilder Previously only a constant reference was stored in the FirOpBuilder. However, a lot of code was merged using FirOpBuilder builder{rewriter, getKindMapping(mod)}; This is incorrect because the KindMapping returned will go out of scope as soon as FirOpBuilder's constructor had run. This led to an infinite loop running some tests using HLFIR (because the stack space containing the kind mapping was re-used and corrupted). One solution would have just been to fix the incorrect call sites, however, as a large number of these had already made it past review, I decided to instead change FirOpBuilder to store its own copy of the KindMapping. This is not costly because nearly every time we construct a KindMapping is exclusively to construct a FirOpBuilder. To make this common pattern simpler, I added a new constructor to FirOpBuilder which calls getKindMapping(). Differential Revision: https://reviews.llvm.org/D151881	2023-06-05 09:57:57 +00:00
Kazu Hirata	00c7bdf80c	[flang] Fix an unused variable warning This patch fixes: flang/lib/Optimizer/HLFIR/Transforms/LowerHLFIROrderedAssignments.cpp:911:10: error: unused variable 'inserted' [-Werror,-Wunused-variable]	2023-05-25 22:36:48 -07:00
Jean Perier	b361e1c025	[flang][hlfir] address post-commit comments from D151247 and D151251 Addresses comments not addressed in https://reviews.llvm.org/D151251 and https://reviews.llvm.org/D151247 - Fix typo in comments. - Update an expected test output to include the fir.allocmem argument. - Make a more generic type comparisons and cast when fetching value back from the AnyValueStack temporary storage. Differential Revision: https://reviews.llvm.org/D151428	2023-05-25 16:34:54 +02:00
Jean Perier	f3c3f63672	[flang][hlfir] Generate temporary storage in Forall/Where [2/2] Generate temporary storage inside WHERE and FORALL using the temporary stack runtime. This covers all cases outside of LHS temporary, where the descriptor stack will have to be used. Reviewed By: vzakhari Differential Revision: https://reviews.llvm.org/D151251	2023-05-25 11:56:29 +02:00
Jean Perier	d7578116b8	[flang][hlfir] Generate temporary storage in Forall/Where [1/2] Generate temporary storage inline inside WHERE and FORALL when possible. A following patch will use the runtime to cover the generic cases. Reviewed By: vzakhari Differential Revision: https://reviews.llvm.org/D151247	2023-05-25 11:51:32 +02:00
Jean Perier	96a003b9bf	[flang][hlfir] Hoist forall bounds computation when possible When inner forall bound computations do not depend on previous forall indices, they can be hoisted. This is possible because: - bound computation are required to be pure (so evaluating them only once is possible). - If the bound computation depends on a value previously assigned, the forall scheduling analysis created different run for it: the assignment impacting the bounds value is not part of the current loop nest. The reason this optimization is done at that point and not as part of generic loop hoisting optimization is that having the all the loop bound computation hoisted will allow allocating simple temporary storages. The number of iteration can be pre-computed and used as the extent for the temporary. Differential Revision: https://reviews.llvm.org/D151110	2023-05-23 09:17:44 +02:00
Jean Perier	5d0c5c5928	[flang][hlfir] Lower hlfir.where when there are no conflicts Lower hlfir.where when the scheduling analysis determined that no temporary storage is needed. Differential Revision: https://reviews.llvm.org/D150881	2023-05-22 12:50:54 +02:00
Jean Perier	222a8a1b2d	[flang][hlfir] Enable WHERE scheduling in LowerHLFIROrderedAssignments Nothing special is needed, other than adding the logging code for where masks and to plug the pattern. This tests mainly adds test. Note that some of the justifications to create temps shows some lacks of side effect interface on operations (like hlfir.transpose), or on some transparent llvm intrinsic calls (llvm.stacksave/restore). I think we should as much as possible try to improve this on the ops generate code rather than special casing it here. Differential Revision: https://reviews.llvm.org/D150581	2023-05-22 12:49:33 +02:00
Jean Perier	c9e3840c83	[flang][hlfir] Implement hlfir.forall codegen when no temp is required The patch applies the schedule built with the utility added in the previous D150455 patch to generate the code for an ordered assignment tree. For now, it only supports forall that do not contain user defined assignments or assignments to vector subscripted entities, and for which the scheduling analysis does not require temporary storages. Support for temporary, WHERE, and user-defined/vector subscript assignment will be added in later patches. This enables end-to-end support with HLFIR for forall where the schedule analysis can prove there is no need to create temporary storage. Differential Revision: https://reviews.llvm.org/D150564	2023-05-19 15:55:40 +02:00
Jean Perier	4f30a63ca2	[flang][hlfir] Implement the scheduling part of hlfir.forall codegen The lowering of hlfir.forall to loops (and later hlfir.where) requires doing a data dependency analysis to avoid creating temporary storage for every control/mask/rhs/lhs expressions. The added code implements a data dependency analysis for the hlfir ordered assignment trees (it is not specific to Forall since these nodes includes Where, user defined assignments, and assignment to vector subscripted entities, but the added code is only plugged and tested with hlfir.forall in this patch). This data dependency analysis returns a "schedule", which is a list of runs containing actions. Each runs will result in a single loop nest evaluating all its action "at the same time" inside the loop body. Actions may either evaluating an assignment, or saving some expression evaluation (the value yielded inside the ordered assignment hlfir operations) in a temporary storage before doing the assignment that requires this expression value but may "conflict" with it. A "conflict" is a read in an expression E to a variable that is, or may be (analysis is conservative), written by an assignment that depends on E. The analysis is based on MLIR SideEffectInterface and fir AliasAnalysis which makes it generic. For now, the codegen that will apply the schedule and rewrite the hlfir.forall into a set of loops is not implemented, but the scheduling is tested on its own (from Fortran, because it allows testing many cases in very readable fashions). The current scheduling has limitations, for instance "forall(i=1, 10) x(i)=2*x(i)" does not require saving the RHS values for all "i" before doing the assignments since the RHS does not depend on values computed during previous iterations. Any user call will also trigger a conservative assumption that there is a conflict. Finally, a lot of operations are missing memory effect interfaces (especially in HLFIR). This patch adds a few so that it can be tested, but more will be added in later patches. Differential Revision: https://reviews.llvm.org/D150455	2023-05-17 14:23:33 +02:00
Jean Perier	88ed439562	[flang][hlfir] Add ordered assignment pass with TODOs I plan to implement lowering from parse tree to HLFIR first for forall and where to ease testing of the rewrite pass while writing it. To avoid cryptic errors in ConvertToFir pass about unhandled operations, this patch already defines the pass that will further lower these operations and make it throw clear TODO messages. Differential Revision: https://reviews.llvm.org/D149852	2023-05-05 09:21:43 +02:00

21 Commits