llvm-project

Author	SHA1	Message	Date
Leandro Lupori	ddb36a8102	[flang] Preserve dynamic length of characters in ALLOCATE (#152564 ) Fixes #151895	2025-08-19 09:25:08 -03:00
Valentin Clement (バレンタインクレメン)	eb0ddba26b	Reland "[flang][cuda] Set the allocator of derived type component after allocation" (#152418 ) Reviewed in #152379 - Move the allocator index set up after the allocate statement otherwise the derived type descriptor is not allocated. - Support array of derived-type with device component	2025-08-06 21:49:55 -07:00
Valentin Clement (バレンタインクレメン)	7d3134f6cc	Revert "[flang][cuda] Set the allocator of derived type component after allocation" (#152402 ) Reverts llvm/llvm-project#152379 Buildbot failure https://lab.llvm.org/buildbot/#/builders/207/builds/4905	2025-08-06 15:55:53 -07:00
Valentin Clement (バレンタインクレメン)	d897355876	[flang][cuda] Set the allocator of derived type component after allocation (#152379 ) - Move the allocator index set up after the allocate statement otherwise the derived type descriptor is not allocated. - Support array of derived-type with device component	2025-08-06 15:14:00 -07:00
Valentin Clement (バレンタインクレメン)	9b195dc3ef	[flang][cuda] Generate cuf.allocate for descriptor with CUDA components (#152041 ) The descriptor for derived-type with CUDA components are allocated in managed memory. The lowering was calling the standard runtime on allocate statement where it should be a `cuf.allocate` operation.	2025-08-04 16:51:11 -07:00
Valentin Clement (バレンタインクレメン)	05b52ef909	[flang][cuda][NFC] Update to the new create APIs (#152050 ) Some operation creations were updated in flang directory but not all. Migrate the CUF ops to the new create APIs introduce in #147168	2025-08-04 16:09:24 -07:00
Maksim Levental	a3a007ad5f	[mlir][NFC] update `flang/Lower` create APIs (8/n) (#149912 ) See https://github.com/llvm/llvm-project/pull/147168 for more info.	2025-07-21 19:54:29 -04:00
Kazu Hirata	938cdb30f1	[flang] Migrate away from std::nullopt (NFC) (#145928 ) ArrayRef has a constructor that accepts std::nullopt. This constructor dates back to the days when we still had llvm::Optional. Since the use of std::nullopt outside the context of std::optional is kind of abuse and not intuitive to new comers, I would like to move away from the constructor and eventually remove it. This patch replaces std::nullopt with {}. There are a couple of places where std::nullopt is replaced with TypeRange() to accommodate perfect forwarding.	2025-06-26 12:41:49 -07:00
Valentin Clement (バレンタインクレメン)	f5609aa1b0	[flang][cuda] Use a reference for asyncObject (#140614 ) Switch from `int64_t` to `int64_t*` to fit with the rest of the implementation. New tentative with some fix. The previous was reverted some time ago. Reviewed in #138010	2025-05-19 15:02:53 -07:00
Asher Mancinelli	30f7a6cc42	[flang] Correctly prepare allocatable runtime call arguments (#138727 ) When lowering allocatables, the generated calls to runtime functions were not using the runtime::createArguments utility which handles the required conversions. createArguments is where I added the implicit volatile casts to handle converting volatile variables to the appropriate type based on their volatility in the callee. Because the calls to allocatable runtime functions were not using this function, their arguments were not casted to have the appropriate volatility. Add a test to demonstrate that volatile and allocatable class/box/reference types are appropriately casted before calling into the runtime library. Instead of using a recursive variadic template to perform the conversions in createArguments, map over the arguments directly so that createArguments can be called with an ArrayRef of arguments. Some cases in Allocatable.cpp already had a vector of values at the point where createArguments needed to be called - the new overload allows calling with a vector of args or the variadic version with each argument spelled out at the callsite. This change resulted in the allocatable runtime calls having their arguments converted left-to-right, which changed some of the test results. I used CHECK-DAG to ignore the order. Add some missing handling of volatile class entities, which I previously missed because I had not yet enabled volatile class entities in Lower.	2025-05-08 06:36:39 -07:00
Valentin Clement (バレンタインクレメン)	9b6b144438	Revert "[flang][cuda] Use a reference for asyncObject" (#138221 ) Reverts llvm/llvm-project#138186	2025-05-01 17:41:44 -07:00
Valentin Clement (バレンタインクレメン)	7f922f1400	[flang][cuda] Use a reference for asyncObject (#138186 ) Switch from `int64_t` to `int64_t*` to fit with the rest of the implementation. New tentative with some fix. The previous was reverted yesterday.	2025-05-01 17:04:12 -07:00
Valentin Clement (バレンタインクレメン)	01a18809ee	Revert "[flang][cuda] Use a reference for asyncObject (#138010 )" (#138082 ) This reverts commit 9b0eaf71e674a28ee55be3afa11b5f7d4da732c0.	2025-04-30 22:03:26 -07:00
Valentin Clement (バレンタインクレメン)	9b0eaf71e6	[flang][cuda] Use a reference for asyncObject (#138010 ) Switch from `int64_t` to `int64_t*` to fit with the rest of the implementation.	2025-04-30 14:02:29 -07:00
Valentin Clement (バレンタインクレメン)	f4d87c42a6	[flang][cuda] Add asyncId to allocate entry point (#134947 )	2025-04-09 10:52:02 -07:00
Valentin Clement (バレンタインクレメン)	478e516140	[flang][cuda] Sync double descriptor after c_f_pointer call (#130194 ) After a global device pointer is set through `c_f_pointer`, we need to sync the double descriptor so the version on the device is also up to date.	2025-03-06 19:19:51 -08:00
Valentin Clement (バレンタインクレメン)	2130285564	[flang][cuda] Make sure allocator id is set for pointer allocate (#129950 )	2025-03-05 17:29:09 -08:00
Valentin Clement (バレンタインクレメン)	b7637a8557	[flang][cuda] Set PINNED variable to false in ALLOCATE (#121593 ) When `PINNED=` is used with variables that don't have the `PINNED` attribute, the logical value must be set to false when host allocation is performed.	2025-01-03 15:27:41 -08:00
Valentin Clement (バレンタインクレメン)	9165848c82	[flang][cuda] Sync global descriptor when nullifying pointer (#121595 )	2025-01-03 14:37:14 -08:00
Valentin Clement (バレンタインクレメン)	4b17a8b10e	[flang][cuda] Add operation to sync global descriptor (#121520 ) Introduce cuf.sync_descriptor to be used to sync device global descriptor after pointer association. Also move CUFCommon so it can be used in FIRBuilder lib as well.	2025-01-02 17:02:45 -08:00
Valentin Clement (バレンタインクレメン)	4cb2a519db	Revert "Reland '[flang] Allow to pass an async id to allocate the descriptor (#118713 )' and #118733 " (#121029 ) This still cause issue for device runtime build.	2024-12-23 21:27:34 -08:00
Valentin Clement (バレンタインクレメン)	5b74fb75d9	Reland '[flang] Allow to pass an async id to allocate the descriptor (#118713 )' and #118733 (#120997 ) Device runtime build have been fixed. Attempt to re-land these patches that have been approved before. https://github.com/llvm/llvm-project/pull/118713 https://github.com/llvm/llvm-project/pull/118733	2024-12-23 12:13:56 -08:00
Valentin Clement (バレンタインクレメン)	16c2a1016e	Revert "[flang] Allow to pass an async id to allocate the descriptor (#118713 )" (#119109 ) This reverts commit 7d1c661381d36018fd105f4ad4c2d6dc45e7288b. This commit breaks some device runtime builds. Need time to investigate.	2024-12-07 19:55:12 -08:00
Valentin Clement (バレンタインクレメン)	7d1c661381	[flang] Allow to pass an async id to allocate the descriptor (#118713 ) This is a patch in preparation for the support stream ordered memory allocator in CUDA Fortran. This patch adds an asynchronous id to the AllocatableAllocate runtime function and to Descriptor::Allocate so it can be passed down to the registered allocator. It is up to the allocator to use this value or not. A follow up patch will implement that asynchronous allocator for CUDA Fortran.	2024-12-04 18:24:40 -08:00
Valentin Clement (バレンタインクレメン)	d4c519e7b2	[flang][cuda] Do inline allocation/deallocation in device code (#106628 ) ALLOCATE and DEALLOCATE statements can be inlined in device function. This patch updates the condition that determined to inline these actions in lowering. This avoid runtime calls in device function code and can speed up the execution. Also move `isCudaDeviceContext` from `Bridge.cpp` so it can be used elsewhere.	2024-08-29 22:37:20 -07:00
Valentin Clement (バレンタインクレメン)	bbdb1e400f	[flang][cuda] Set the allocator on fir.embox operation (#101722 ) This patch set the `allocator_idx` attribute for allocatable descriptor that have specific CUDA attribute.	2024-08-02 14:00:26 -07:00
Tom Eccles	a56f37d3bc	[flang][Lower] get ultimate symbol when querying if pointer or allocatable (#99528 ) This fixes a bug in OpenMP privatisation. The privatised variables are created as though they are host associated clones of the original variables. These privatised variables do not contain the allocatable attribute themselves and so we need to check if the ultimate symbol is allocatable. Having or not having this flag influences whether lowering determines that this is a whole allocatable assignment, which then causes hlfir.assign not to get the realloc flag, which cases the allocatable not to be allocated when it is assigned to (leading to a segfault running the newly added test). I also did the same for pointer variables because I would imagine they could experience the same issue. There is no fallout on tests outside of OpenMP, and the gfortran test suite still passes, so I think this doesn't break host other kinds of host associated symbols.	2024-07-19 19:01:27 +01:00
Alexander Shaposhnikov	77d8cfb3c5	[Flang] Switch to common::visit more call sites (#90018 ) Switch to common::visit more call sites. Test plan: ninja check-all	2024-06-17 12:59:04 -07:00
jeanPerier	74faa402cc	[flang] lower allocatable assumed-rank specification parts (#93682 ) Lower allocatable and pointers specification parts. Nothing special is required to allocate the descriptor given they are required to be dummy arguments, however, care must be taken with INTENT(OUT) to use the runtime to deallocate them (inlined fir.embox + store is not possible).	2024-05-30 09:31:18 +02:00
Valentin Clement (バレンタインクレメン)	45daa4fdc6	[flang][cuda] Move CUDA Fortran operations to a CUF dialect (#92317 ) The number of operations dedicated to CUF grew and where all still in FIR. In order to have a better organization, the CUF operations, attributes and code is moved into their specific dialect and files. CUF dialect is tightly coupled with HLFIR/FIR and their types. The CUF attributes are bundled into their own library since some HLFIR/FIR operations depend on them and the CUF dialect depends on the FIR types. Without having the attributes into a separate library there would be a dependency cycle.	2024-05-17 09:37:53 -07:00
Christian Sigg	fac349a169	Reapply "[mlir] Mark `isa/dyn_cast/cast/...` member functions depreca… (#90406 ) …ted. (#89998)" (#90250) This partially reverts commit 7aedd7dc754c74a49fe84ed2640e269c25414087. This change removes calls to the deprecated member functions. It does not mark the functions deprecated yet and does not disable the deprecation warning in TypeSwitch. This seems to cause problems with MSVC.	2024-04-28 22:01:42 +02:00
dyung	7aedd7dc75	Revert "[mlir] Mark `isa/dyn_cast/cast/...` member functions deprecated. (#89998 )" (#90250 ) This reverts commit 950b7ce0b88318f9099e9a7c9817d224ebdc6337. This change is causing build failures on a bot https://lab.llvm.org/buildbot/#/builders/216/builds/38157	2024-04-26 12:09:13 -07:00
Christian Sigg	950b7ce0b8	[mlir] Mark `isa/dyn_cast/cast/...` member functions deprecated. (#89998 ) See https://mlir.llvm.org/deprecation and https://discourse.llvm.org/t/preferred-casting-style-going-forward.	2024-04-26 16:28:30 +02:00
Valentin Clement (バレンタインクレメン)	7c0da7993e	[flang][cuda] Use fir.cuda_deallocate for automatic deallocation (#89662 ) Automatic deallocation of allocatable that are cuda device variable must use the fir.cuda_deallocate operation. This patch update the automatic deallocation code generation to use this operation when the variable is a cuda variable. This patch has also the side effect to correctly call `attachDeclarePostDeallocAction` for OpenACC declare variable on automatic deallocation as well. Update the code in `attachDeclarePostDeallocAction` so we do not attach on fir.result but on the correct last op.	2024-04-24 08:43:54 -07:00
Valentin Clement	f35e1931be	Revert "[flang][cuda] Use fir.cuda_deallocate for automatic deallocation (#89450 )" This reverts commit 2a632d3d9f5c70db38c617b0816deb37ef722a7b. This has some implication on OpenACC postDeallocate action	2024-04-19 17:25:47 -07:00
Valentin Clement (バレンタインクレメン)	2a632d3d9f	[flang][cuda] Use fir.cuda_deallocate for automatic deallocation (#89450 ) Automatic deallocation of allocatable that are cuda device variable must use the fir.cuda_deallocate operation. This patch update the automatic deallocation code generation to use this operation when the variable is a cuda variable.	2024-04-19 14:49:56 -07:00
Valentin Clement (バレンタインクレメン)	9435edf628	[flang][cuda] Lower DEALLOCATE for device variables (#89091 ) Replace the runtime call to `AllocatableDeallocate` for CUDA device variable to the newly added `fir.cuda_deallocate` operation. This is similar with #88980 A third patch will handle the case of automatic dealloctaion of device allocatable variables	2024-04-17 13:45:22 -07:00
Valentin Clement (バレンタインクレメン)	da70f2cdcd	[flang][cuda] Lower ALLOCATE for device variable (#88980 ) Replace the runtime call to `AllocatableAllocate` for CUDA device variable to the newly added `fir.cuda_allocate` operation.	2024-04-17 08:43:25 -07:00
Valentin Clement (バレンタインクレメン)	b1a278dd87	[flang][cuda] Add a proper TODO for allocate statement for cuda var (#88034 ) Allocate statement for variable with CUDA attributes need to allocate memory on the device and not the host. Add a proper TODO so we keep track of work to be done for it.	2024-04-09 09:44:55 -07:00
jeanPerier	6a7da2e30d	[flang] Fix source allocation to explicit length after deferred length object (#87785 ) Flang supports source allocation to allocatable or pointers with a non deferred length that do not match the source length. This documented at: `9708d09003/flang/docs/Extensions.md (L312)` The current lowering code was bugged when such explicit length allocate object appeared after a deferred length object in the source allocation list: Since "lenParams" had been computed when generating allocation of the deferred length object, the call to genSetDeferredLengthParameters was not a no-op on when lowering the explicit length allocation, and the explicit length was overridden with the source length. The output of the program added in test was: ``` ZZheZZ ZZhelloZZ ZZhelloZZ ``` Instead of: ``` ZZheZZ ZZhelloZZ ZZhello ZZ ``` Skip genSetDeferredLengthParameters when the allocate object has non deferred length.	2024-04-08 10:22:44 +02:00
Peter Klausler	dc15524f61	[flang] DEALLOCATE(pointer) should use PointerDeallocate() (#79702 ) A DEALLOCATE statement on a pointer should always use PointerDeallocate() in the runtime, even if there's no STAT= or polymorphism or derived types, so that it can be checked to ensure that it is indeed a whole allocation of a pointer.	2024-01-31 11:50:09 -08:00
Valentin Clement (バレンタインクレメン)	3c8a5800f5	[flang][openacc] Place post allocate/deallocate attribute correctly (#79883 ) The `acc.declate_action` attribute was sometime misplaced as reported in #79770. This patch updates the lowering code to place the postAllocate/postDeallocate actions at the correct place.	2024-01-29 14:56:26 -08:00
Peter Klausler	a3bbe627d2	[flang][runtime] Validate pointer DEALLOCATE (#78612 ) The standard requires a compiler to diagnose an incorrect use of a pointer in a DEALLOCATE statement. The pointer must be associated with an entire object that was allocated as a pointer (not allocatable) by an ALLOCATE statement. Implement by appending a validation footer to pointer allocations. This is an extra allocated word that encodes the base address of the allocation. If it is not found after the data payload when the pointer is deallocated, signal an error. There is a chance of a false positive result, but that should be vanishingly unlikely. This change requires all pointer allocations (not allocatables) to take place in the runtime in PointerAllocate(), which might be slower in cases that could otherwise be handled with a native memory allocation operation. I believe that memory allocation of pointers is less common than with allocatables, which are not affected. If this turns out to become a performance problem, we can inline the creation and initialization of the footer word. Fixes https://github.com/llvm/llvm-project/issues/78391.	2024-01-25 14:44:09 -08:00
Pete Steinfeld	5db4779c3f	[flang] Regularize TODO messages for coarray related features (#69227 ) I want to make "not yet implemented" messages for features related to coarrays easy to identify and make them easy for users to read.	2023-10-16 12:37:57 -07:00
jeanPerier	2cb31fe8ea	[flang] Centralize automatic deallocation code in lowering (#67003 ) There are currently several places that automatically deallocate allocatble if they are allocated: - INTENT(OUT) allocatable are deallocated on entry in the callee - INTENT(OUT) allocatable are also deallocated on the caller side of BIND(C) function in case the implementation is in C. - Results of function returning allocatable are deallocated after usage. - OPENMP privatized allocatable are deallocated at the end of OPENMP region. Introduce genDeallocateIfAllocated that centralize all this code, except for the function return that use genFreememIfAllocated since finalization is done separately currently. `fir:🏭:genFinalization` and `fir:🏭:genInlinedDeallocation` are removed and replaced by genFreemem since their name were misleading: finalization was not called. There is a fallout in the tests because previous generated code did not check the allocated status when doing inline deallocation. This was OK since free(null) is guaranteed to be a no-op, but this makes compiler code more complex, is a bit surprising in the generated IR IMHO, and it relied on knowing when genDeallocateBox inserts runtime calls or uses inlined code.	2023-09-21 18:38:23 +02:00
Valentin Clement	2e1982f31d	[flang][openacc] Add acc.declare_action attributes on operation This patches adds the acc.declare_action attrbites on post allocate operation and pre/post deallocate operations. Reviewed By: razvanlupusoru Differential Revision: https://reviews.llvm.org/D157915	2023-08-15 09:44:42 -07:00
Peter Klausler	4ad7279392	[flang] CUDA Fortran - part 1/5: parsing Begin upstreaming of CUDA Fortran support in LLVM Flang. This first patch implements parsing for CUDA Fortran syntax, including: - a new LanguageFeature enum value for CUDA Fortran - driver change to enable that feature for .cuf and .CUF source files - parse tree representation of CUDA Fortran syntax - dumping and unparsing of the parse tree - the actual parsers for CUDA Fortran syntax - prescanning support for !@CUF and !$CUF - basic sanity testing via unparsing and parse tree dumps ... along with any minimized changes elsewhere to make these work, mostly no-op cases in common::visitors instances in semantics and lowering to allow them to compile in the face of new types in variant<> instances in the parse tree. Because CUDA Fortran allows the kernel launch chevron syntax ("call foo<<<blocks, threads>>>()") only on CALL statements and not on function references, the parse tree nodes for CallStmt, FunctionReference, and their shared Call were rearranged a bit; this caused a fair amount of one-line changes in many files. More patches will follow that implement CUDA Fortran in the symbol table and name resolution, and then semantic checking. Differential Revision: https://reviews.llvm.org/D150159	2023-05-31 09:48:59 -07:00
Jean Perier	ffe4029d92	[flang] Turn on use-desc-for-alloc by default Currently, local allocatables and contiguous/scalar pointers (and some other conditions) are lowered to a set of independent variables in FIR (one for the address, one for each bound and one for character length). The intention was to help LLVM get rids of descriptors. But LLVM knows how to do that anyway in those cases: ``` subroutine foo(x) real, target :: x(100) real, pointer, contiguous :: p(:) p => x call bar(p(50)) end subroutine ``` The output fir the option on or off is the same after llvm opt -O1, there is no descriptor anymore, the indirection is removed. ``` define void @foo_(ptr %0) local_unnamed_addr { %2 = getelementptr [100 x float], ptr %0, i64 0, i64 49 tail call void @bar_(ptr %2) ret void } ``` So the benefit of not using a descriptor in lowering is questionable, and although it is abstracted as much as possible in the so called MutableBoxValue class that represent allocatable/pointer in lowering it is still causing bugs from time to time, and will also be a bit problematic when emitting debug info for the pointer/allocatable. In HLFIR lowering, the simplification to always use a descriptor in lowering was already made. This patch allows decorrelating the impact from this change from the bigger impact HLFIR will have so that it is easier to get feedback if this causes performance issues. The lowering tests relying on the previous behavior are only updated to set back this option to true. The reason is that I think we should preserve coverage of the code dealing with the "non descriptor" approach in lowering until we actually get rid of it. The other reason is that the test will have to be or are already covered by equivalent HLFIR tests, which use descriptors. Differential Revision: https://reviews.llvm.org/D148910	2023-04-24 09:07:30 +02:00
Valentin Clement	029313cc97	[flang] Update allocate lowering to use AllocatableInit.*ForAllocate functions Update lowering of allocate statement to use the new functions defined in D146290. Depends on D146290 Reviewed By: PeteSteinfeld Differential Revision: https://reviews.llvm.org/D146291	2023-03-20 10:01:51 +01:00
Peter Steinfeld	34ed7db9e1	[Flang] Fix ALLOCATE with MOLD where MOLD is a scalar We were failing tests where an ALLOCATE statement that allocated an array had a non-character scalar MOLD argument. I fixed this by merging the code for ALLOCATE statements with MOLD and SOURCE arguments. Differential Revision: https://reviews.llvm.org/D145418	2023-03-09 06:07:50 -08:00

1 2

86 Commits