llvm-project

Author	SHA1	Message	Date
jeanPerier	bfcd05317d	[flang][hlfir] Do not emit extra declare for dummy used in BLOCK (#69184 ) When a variable is used in a specification expression in a scope, it is added to the list of variables that must be instantiated when lowering the scope. When lowering a BLOCK, this caused instantiateVar to be called again on all the host block variables appearing in block variable specification expressions. This caused an extra declare to be emitted for dummy inside block (for non dummy, instantiateVar is a no-op if the symbol is already mapped). Only call instantiateVar if the symbol is not mapped when lowering BLOCK variables.	2023-10-17 09:11:53 +02:00
Pete Steinfeld	5db4779c3f	[flang] Regularize TODO messages for coarray related features (#69227 ) I want to make "not yet implemented" messages for features related to coarrays easy to identify and make them easy for users to read.	2023-10-16 12:37:57 -07:00
jeanPerier	4ccd57ddb1	[flang][nfc] replace fir.dispatch_table with more generic fir.type_info (#68309 ) The goal is to progressively propagate all the derived type info that is currently in the runtime type info globals into a FIR operation that can be easily queried and used by FIR/HLFIR passes. When this will be complete, the last step will be to stop generating the runtime info global in lowering, but to do that later in or just before codegen to keep the FIR files readable (on the added type-info.f90 tests, the lowered runtime info globals takes a whooping 2.6 millions characters on 1600 lines of the FIR textual output. The fir.type_info that contains all the info required to generate those globals for such "trivial" types takes 1721 characters on 9 lines). So far this patch simply starts by replacing the fir.dispatch_table operation by the fir.type_info operation and to add the noinit/ nofinal/nodestroy flags to it. These flags will soon be used in HLFIR to better rewrite hlfir.assign with derived types.	2023-10-06 09:29:57 +02:00
jeanPerier	43d2ef2856	[flang][lowering] propagate location info of macro expansions (#67446 ) Currently flang-new -g is failing when compiling code containing a call in a macro to a function defined in the same file. The verification added in https://reviews.llvm.org/D157447 is valid, flang lowering was failing to propagate location information in code from macro expansion because GetSourcePositionRange does not work with them (it fails to come with an end location), but we do not need a range for the MLIR location, only the start. Use GetSourcePosition instead that works with code from macro expansion. Note that the source location is the one of the statement where the macro appeared, if needed some FusedLocation could be later built to keep a link to the macro location in the debug info.	2023-09-26 20:33:01 +02:00
jeanPerier	2cb31fe8ea	[flang] Centralize automatic deallocation code in lowering (#67003 ) There are currently several places that automatically deallocate allocatble if they are allocated: - INTENT(OUT) allocatable are deallocated on entry in the callee - INTENT(OUT) allocatable are also deallocated on the caller side of BIND(C) function in case the implementation is in C. - Results of function returning allocatable are deallocated after usage. - OPENMP privatized allocatable are deallocated at the end of OPENMP region. Introduce genDeallocateIfAllocated that centralize all this code, except for the function return that use genFreememIfAllocated since finalization is done separately currently. `fir:🏭:genFinalization` and `fir:🏭:genInlinedDeallocation` are removed and replaced by genFreemem since their name were misleading: finalization was not called. There is a fallout in the tests because previous generated code did not check the allocated status when doing inline deallocation. This was OK since free(null) is guaranteed to be a no-op, but this makes compiler code more complex, is a bit surprising in the generated IR IMHO, and it relied on knowing when genDeallocateBox inserts runtime calls or uses inlined code.	2023-09-21 18:38:23 +02:00
Leandro Lupori	3dbb055f54	[flang] Generate valid IR on GOTO DO body (#66084 ) Flang was generating invalid IR when there was a GOTO to the body of a DO loop. This happened because the value of step, computed at the beginning of the loop, was being reused at the end of the loop, that, for unstructured loops, is in another basic block. Because of this, a GOTO could skip the beginning of the loop, that defined step, and yet try to use it at the end of the loop, which is invalid. Instead of reusing the step value, it can be recomputed if it is a constant, or stored and loaded to/from a temporary variable, for non-constant step expressions. Note that, while this change prevents the generation of invalid IR on the presence of jumps to DO loop bodies, what happens if the program reaches the end of a DO loop without ever passing through its beginning is undefined behavior, as some control variables, such as trip, will be uninitialized. It doesn't seem worth the effort and overhead to ensure this legacy extension will behave correctly in this case. This is consistent with at least gfortran, that doesn't behave correctly if step is not equal to one. Fixes: https://github.com/llvm/llvm-project/issues/65036	2023-09-21 15:59:35 +02:00
Andrew Gozillon	8fde6f41a0	[Flang][OpenMP] Add lowering from PFT to new MapEntry and Bounds operations and tie them to relevant Target operations This patch builds on top of a prior patch in review which adds a new map and bounds operation by modifying the OpenMP PFT lowering to support these operations and generate them from the PFT. A significant amount of the support for the Bounds operation is borrowed from OpenACC's own current implementation and lowering, just ported over to OpenMP. The patch also adds very preliminary/initial support for lowering to a new Capture attribute, which is stored on the new Map Operation, which helps the later lowering from OpenMP -> LLVM IR by indicating how a map argument should be handled. This capture type will influence how a map argument is accessed on device and passed by the host (different load/store handling etc.). It is reflective of a similar piece of information stored in the Clang AST which performs a similar role. As well as some minor adjustments to how the map type (map bitshift which dictates to the runtime how it should handle an argument) is generated to further support more use-cases for future patches that build on this work. Finally it adds the map entry operation creation and tying it to the relevant target operations as well as the addition of some new tests and alteration of previous tests to support the new changes. Depends on D158732 reviewers: kiranchandramohan, TIFitis, clementval, razvanlupusoru Differential Revision: https://reviews.llvm.org/D158734	2023-09-19 08:26:46 -05:00
Slava Zakharin	47025af639	[flang][hlfir] Alias analysis for host associated accesses. (#65919 ) This patch adds `host_assoc` attribute for operations that implement FortranVariableInterface (e.g. `hlfir.declare`). The attribute is used by the alias analysis to make better conclusions about memory overlap. For example, a dummy argument of an inner subroutine and a host's variable used inside the inner subroutine cannot refer to the same object (if the dummy argument does not satisify exceptions in F2018 15.5.2.13). This closes a performance gap between HLFIR optimization pipeline and FIR ArrayValueCopy for Polyhedron/nf.	2023-09-18 09:59:06 -07:00
jeanPerier	99a54b839a	[flang] Lower PRIVATE component names safely (#66076 ) It is possible for a derived type extending a type with private components to define components with the same name as the private components. This was not properly handled by lowering where several fir.record type component names could end-up being the same, leading to bad generated code (only the first component was accessed via fir.field_index, leading to bad generated code). This patch handles the situation by adding the derived type mangled name to private component.	2023-09-18 14:59:56 +02:00
Sergio Afonso	29aa749087	[OpenMP][Flang][MLIR] Lowering of OpenMP requires directive from parse tree to MLIR This patch implements the lowering of the OpenMP 'requires' directive from Flang parse tree to MLIR attributes attached to the top-level module. Target-related 'requires' clauses are gathered and combined for each top-level unit during semantics. Lastly, a single module-level `omp.requires` attribute is attached to the MLIR module with that information at the end of the process. The `atomic_default_mem_order` clause is not addressed by this patch, but rather it will come as a separate patch and follow a different approach. Depends on D147214, D150328, D150329 and D157983. Differential Revision: https://reviews.llvm.org/D147218	2023-09-14 10:34:54 +01:00
Razvan Lupusoru	e070ea47a9	[flang][openacc] Enable lowering support for OpenACC atomic operations (#65776 ) Since the OpenACC atomics specification is a subset of OpenMP atomics, the same lowering implementation can be used. This change extracts out the necessary pieces from the OpenMP lowering and puts them in a shared spot. The shared spot is a header file so that each implementation can template specialize directly. After putting the OpenMP implementation in a common spot, the following changes were needed to make it work for OpenACC: * Ensure parsing works correctly by avoiding hardcoded offsets. * Templatize based on atomic type. * The checking whether it is OpenMP or OpenACC is done by checking for OmpAtomicClauseList (OpenACC does not implement this so we just templatize with void). It was preferable to check this instead of atomic type because in some cases, like atomic capture, the read/write/update implementations are called - and we want compile time evaluation of these conditional parts. * The memory order and hint are used only for OpenMP. * Generate acc dialect operations instead of omp dialect operations.	2023-09-11 13:58:10 -07:00
jeanPerier	6ffea74f7c	[flang] Use BIND name, if any, when consolidating common blocks (#65613 ) This patch changes how common blocks are aggregated and named in lowering in order to: * fix one obvious issue where BIND(C) and non BIND(C) with the same Fortran name were "merged" * go further and deal with a derivative where the BIND(C) C name matches the assembly name of a Fortran common block. This is a bit unspecified IMHO, but gfortran, ifort, and nvfortran "merge" the common block without complaints as a linker would have done. This required getting rid of all the common block mangling early in FIR (\_QC) instead of leaving that to the phase that emits LLVM from FIR because BIND(C) common blocks did not have mangled names. Care has to be taken to deal with the underscoring option of flang-new. See added flang/test/Lower/HLFIR/common-block-bindc-conflicts.f90 for an illustration.	2023-09-08 10:43:55 +02:00
Valentin Clement (バレンタインクレメン)	20f4a5a313	[flang][openacc][NFC] Clean up lowering api (#65678 ) Remove unused argument `pft::Evaluation` from higher level lowering API.	2023-09-07 14:54:38 -07:00
Slava Zakharin	f8843efbb2	[flang][hlfir] Lower Cray pointee references. (#65563 ) A Cray pointee reference must be done using the characteristics (bounds, type params) of the original pointee declaration, but using the actual address value of the associated Cray pointer. There might be multiple Cray pointees associated with the same Cray pointer. The proposed solution is to lower each Cray pointee into a POINTER variable with a descriptor. The descriptor is initialized at the point of declaration of the pointee, though its base_addr is set to null. Before each reference of the Cray pointee its descriptor's base_addr is updated to the current value of the Cray pointer. The update of the base_addr is done using PointerAssociateScalar runtime call, which just updates the base_addr of the descriptor. This is a temporary solution just to make Cray pointers work to the same extent they work with FIR lowering.	2023-09-07 11:41:22 -07:00
jeanPerier	d26c78b2ad	[flang] handle indirect module variable use in internal procedure (#65324 ) When a module variable is referenced inside an internal procedure, but the use statement for the module is inside the host, semantics may not create any symbols with HostAssocDetails directly under the internal procedure scope. So pft::getScopeVariableList, that is called in the bridge when lowering the internal procedure scope, failed to instantiate the module variables. This lead to "symbol is not mapped to any IR value" compile time errors. This patch fixes the issue by adding the variables to the list of "captured" global variables from the host program, so that they are instantiated as part of the `internalProcedureBindings` in the bridge. The rational of doing it that way instead of changing `getScopeVariableList` is that `getScopeVariableList` would have to import all the module variables used inside the host since it cannot know which ones are referenced inside the internal procedure from the semantics::Scope information. The fix in this patch only instantiates the module variables from the host that are actually referenced inside the internal procedure.	2023-09-06 09:07:45 +02:00
Slava Zakharin	de8939ffca	[flang] Reset lbounds for allocatable function results. (#65286 ) With HLFIR the lbounds for the ALLOCATABLE result are taken from the mutable box created for the result, so the non-default lbounds might be propagated further causing incorrect result, e.g.: ``` program p real, allocatable :: p5(:) allocate(p5, source=real_init()) print *, lbound(p5, 1) ! must print 1, but prints 7 contains function real_init() real, allocatable :: real_init(:) allocate(real_init(7:8)) end function real_init end program p ``` With FIR lowering the box passed for `source` has explicit lower bound 1 at the call site, but the runtime box initialized by `real_init` call still has lower bound 7. I am not sure if the runtime box initialized by `real_init` will ever be accessed in a debugger via Fortran variable names, but I think that having the right runtime bounds that can be accessible via examining registers/stack might be good in general. So I decided to update the runtime bounds at the point of return. This change fixes the test above for HLFIR. Reviewed By: jeanPerier Differential Revision: https://reviews.llvm.org/D156187	2023-09-05 10:26:16 -07:00
Kiran Chandramohan	90f58eb37b	[Flang][OpenMP] Fix loop index privatisation in Parallel region in HLFIR HLFIR lowering always adds hlfir.declare when symbols are bound to their address allocated on the stack. Ensure that the declare is placed along with the alloca if it is hoisted. And always return the mlir value that is bound to the symbol (i.e the alloca in FIR lowering and the declare in HLFIR lowering). Context: Loop index variables in OpenMP parallel regions should be privatised to work correctly. Reviewed By: tblah Differential Revision: https://reviews.llvm.org/D158594	2023-09-01 10:59:14 +00:00
Peter Klausler	031b4e5e79	[flang] Support SELECT RANK on allocatables & pointers Unlike other executable constructs with associating selectors, the selector of a SELECT RANK construct can have the ALLOCATABLE or POINTER attribute, and will work as an allocatable or object pointer within each rank case, so long as there is no RANK(*) case. Getting this right exposed a correctness risk with the popular predicate IsAllocatableOrPointer() -- it will be true for procedure pointers as well as object pointers, and in many contexts, a procedure pointer should not be acceptable. So this patch adds the new predicate IsAllocatableOrObjectPointer(), and updates some call sites of the original function to use the new one. Differential Revision: https://reviews.llvm.org/D159043	2023-08-29 14:56:15 -07:00
Kazu Hirata	a678ed41d2	[flang] Use DenseMap::lookup (NFC)	2023-08-27 08:26:48 -07:00
Kiran Chandramohan	8b834caa62	[Flang][OpenMP] Fix HLFIR lowering for commonblock threadprivate Commonblock names are not variables, but they can be marked as threadprivate in OpenMP. This requires the commonblock name to be bound to the address of the Commonblock. hlfir.declares are not required for these, but we should be able to retrieve the mlir Value corresponding to the Commonblock. This patch enables this by special casing the Commonblocks like procedures. Reviewed By: tblah, vzakhari Differential Revision: https://reviews.llvm.org/D158070	2023-08-23 11:37:17 +00:00
Mark Danial	a1c736ec08	[Flang] Cray pointer Lowering This patch is to add cray pointer (aka integer pointer) support to flang. Syntax and semantic checking were already available in flang. Cray pointers reference (https://gcc.gnu.org/onlinedocs/gfortran/Cray-pointers.html) In order to implement the feature we create the following sequence for a simple scalar load and store: ``` integer pte, i pointer(ptr, pte) i = pte ``` ``` %1 = fir.alloca i32 {bindc_name = "i", uniq_name = "_QFEi"} %2 = fir.alloca i32 {bindc_name = "pte", uniq_name = "_QFEpte"} %3 = fir.alloca i64 {bindc_name = "ptr", uniq_name = "_QFEptr"} ... %7 = fir.embox %3 : (!fir.ref<i64>) -> !fir.box<i64> %8 = fir.box_addr %7 : (!fir.box<i64>) -> !fir.ref<i64> %9 = fir.convert %8 : (!fir.ref<i64>) -> !fir.ref<!fir.ptr<i32>> %10 = fir.load %9 : !fir.ref<!fir.ptr<i32>> %11 = fir.load %10 : !fir.ptr<i32> fir.store %11 to %1 : !fir.ref<i32> ``` ``` integer pte, i pointer(ptr, pte) pte = i ``` ``` %1 = fir.alloca i32 {bindc_name = "i", uniq_name = "_QFEi"} %2 = fir.alloca i32 {bindc_name = "pte", uniq_name = "_QFEpte"} %3 = fir.alloca i64 {bindc_name = "ptr", uniq_name = "_QFEptr"} %7 = fir.load %1 : !fir.ref<i32> %8 = fir.embox %3 : (!fir.ref<i64>) -> !fir.box<i64> %9 = fir.box_addr %8 : (!fir.box<i64>) -> !fir.ref<i64> %10 = fir.convert %9 : (!fir.ref<i64>) -> !fir.ref<!fir.ptr<i32>> %11 = fir.load %10 : !fir.ref<!fir.ptr<i32>> fir.store %7 to %11 : !fir.ptr<i32> ``` The sequence is very similar for array element cases with the addition of fir.coordinate_of for the specific element. The whole array case is slightly different but uses the same sequence before the fir.array_load and fir.array_merge_store. Reviewed By: kkwli0 Differential Revision: https://reviews.llvm.org/D151478	2023-08-22 12:10:49 -04:00
Valentin Clement	4d04baeca5	[flang][openacc] Lower acc declare to the new acc.declare function Lower the acc delcare directive in function/subroutine to the newly introduced acc.declare operation. Only a single acc.declare operation is procduced in a function or subroutine so they don't end up nested. Depends on D158314 Reviewed By: razvanlupusoru Differential Revision: https://reviews.llvm.org/D158315	2023-08-21 12:39:02 -07:00
Valentin Clement	69a6bd5f05	[flang][openacc] Lower acc routine with function name The routine directive can appear in the specification part of a subroutine, function or module and therefore appear before the function or subroutine is lowered. We keep track of the created routine info attribute and attach them to the function at the end of the lowering if the directive appeared before the function was lowered. Reviewed By: razvanlupusoru Differential Revision: https://reviews.llvm.org/D158204	2023-08-17 14:25:26 -07:00
V Donaldson	335b3990ef	[flang] Do concurrent locality specifiers	2023-08-08 10:09:38 -07:00
Valentin Clement	14741ef88f	[flang][openacc] Lower the exit part for OpenACC declare in function/subroutine This patch adds lowering for the exit part of the OpenACC declare construct in function/subroutine. Depends on D156560 Reviewed By: razvanlupusoru Differential Revision: https://reviews.llvm.org/D156568	2023-08-01 14:10:33 -07:00
Nimish Mishra	f752265231	[flang][OpenMP] Support for privatization in common block This patch provides support for usage of common block in private/firstprivate and lastprivate clauses. Reviewed By: kiranchandramohan Differential Revision: https://reviews.llvm.org/D156120	2023-07-31 16:46:18 +05:30
Peixin Qiao	b4c54b2027	[flang][OpenMP] Support common block in OpenMP private clause This supports the common block in OpenMP privat clause by making each common block member host-associated privatization and adds the test case. Reviewed By: kiranchandramohan Differential Revision: https://reviews.llvm.org/D127215	2023-07-31 16:24:12 +05:30
Valentin Clement	c217ff8794	[flang][openacc] Add basic lowering for OpenACC declare construct in module This patch adds the skeleton and the basic lowering for OpenACC declare construct when located in the module declaration. This patch just lower the create clause with or without modifier. Other clause and global descrutor lowering will come in follow up patches to keep this one small enough for review. Reviewed By: razvanlupusoru Differential Revision: https://reviews.llvm.org/D156266	2023-07-26 09:56:19 -07:00
Andrew Gozillon	e909a2c1ca	[Flang][OpenMP][Lower] Program level implicit SAVE variable handling for declare target This is an attempt at mimicing the method in which threadprivate handles the following type of variables: program main integer :: i !$omp declare target to(i) end Which essentially generates a GlobalOp for the variable (which would normally only be an alloca) when it's instantiated. The main difference is there is no operation generated within the function, instead the declare target attribute is appended later within handleDeclareTarget. Reviewers: kiranchandramohan Differential Revision: https://reviews.llvm.org/D152037	2023-07-13 12:07:21 -05:00
Jan Sjodin	45a9604417	[Flang][OpenMP][MLIR] Add early outlining pass for omp.target operations to flang This patch implements an early outlining transform of omp.target operations in flang. The pass is needed because optimizations may cross target op region boundaries, but with the outlining the resulting functions only contain a single omp.target op plus a func.return, so there should not be any opportunity to optimize across region boundaries. The patch also adds an interface to be able to store and retrieve the parent function name of the original target operation. This is needed to be able to create correct kernel function names when lowering to LLVM-IR. Reviewed By: kiranchandramohan, domada Differential Revision: https://reviews.llvm.org/D154879	2023-07-13 09:14:42 -04:00
Dmitriy Smirnov	bc4586da6e	[Flang][OpenMP] Lower allocatable or pointer in private clause This patch lowers allocatables and pointers named in "private" OpenMP clause. Reviewed By: kiranchandramohan Differential Revision: https://reviews.llvm.org/D148570	2023-07-03 16:46:02 +00:00
Ethan Luis McDonough	9bf5093623	[flang][openmp] Parallel reduction FIR lowering This patch extends the logic for lowering loop construct reductions to parallel block reductions. Reviewed By: kiranchandramohan Differential Revision: https://reviews.llvm.org/D154182	2023-06-30 15:36:27 -05:00
V Donaldson	09ea692d16	[flang] IEEE_ARITHMETIC intrinsic module procedures Implement - IEEE_CLASS - IEEE_COPY_SIGN - IEEE_GET_ROUNDING_MODE - IEEE_IS_FINITE - IEEE_IS_NAN - IEEE_IS_NEGATIVE - IEEE_IS_NORMAL - IEEE_SET_ROUNDING_MODE - IEEE_SIGNBIT - IEEE_SUPPORT_ROUNDING - IEEE_UNORDERED - IEEE_VALUE for all REAL kinds (2, 3, 4, 8, 10, 16) where applicable.	2023-06-29 16:46:22 -07:00
Slava Zakharin	7b4aa95d7c	[flang][hlfir] Set/propagate 'unordered' attribute for elementals. This patch adds 'unordered' attribute handling the HLFIR elementals' builders and fixes the attribute handling in lowering and transformations. Depends on D154031, D154032 Reviewed By: jeanPerier, tblah Differential Revision: https://reviews.llvm.org/D154035	2023-06-29 11:16:38 -07:00
Peter Klausler	e12ffe6a93	[flang] Honor #line and related preprocessing directives Extend the SourceFile class to take account of #line directives when computing source file positions for error messages. Adjust the output of #line directives to -E output so that they reflect any #line directives that were in the input. Differential Revision: https://reviews.llvm.org/D153910	2023-06-29 08:27:37 -07:00
Jean Perier	23fbe525ce	[flang] do not merge block after lowering Lowering relies on dead code generation / unreachable block deletion to delete some code that is potentially invalid. However, calling mlir::simplifyRegion also merges block, which may promote SSA values to block arguments. Not all FIR types are intended to be block arguments. The added test shows an example where block merging led to fir.shape<> being block arguments (and a failure later in codegen). Reviewed By: tblah, clementval, vdonaldson Differential Revision: https://reviews.llvm.org/D153858	2023-06-28 08:28:07 +02:00
Jean Perier	6716923332	[flang][hlfir] Lower user defined assignment Lower user defined assignment inside the hlfir.region_assign "userDefinedAssignment" mlir region. This is done by adding an entry point to ConvertCall.h in order to call genUserCall with the region block arguments as arguments. The codegen for hlfir.region_assign with user defined assignment will be added in a later patch. Differential Revision: https://reviews.llvm.org/D153404	2023-06-26 13:06:59 +02:00
Tom Eccles	569716fc5c	[flang][hlfir] Fix multiple return declaration type When the ENTRY statement is used, the same source can return different types depending on the entry point. These different return values are storage associated (share the same storage). Previously, this led to the declaration of the results to all have the largest type. This patch adds a convert between the stack allocation and the declaration so that the hlfir.decl gets the right type. I haven't managed to generate code where this convert converted a reference to an allocation for a smaller type into an allocation for a larger one, but I have added an assert just in case. This is a different solution to https://reviews.llvm.org/D152725, see discussion there. Differential Revision: https://reviews.llvm.org/D152931	2023-06-19 09:09:01 +00:00
Dhruv Chawla	6e3a872047	[SetVector] Improve performance for small sizes SmallSetVector has an inefficiency where it does set insertions regardless of the number of elements present within it. This contrasts with other "Small-" containers where they use linear scan up to a certain size "N", after which they switch to another strategy. This patch implements this functionality in SetVector, adding a template parameter "N" which specifies the number of elements upto which the SetVector follows the "small" strategy. Due to the use of "if constexpr", there is no "small" code emitted when N is 0 which makes this a zero overhead change for users using the default behaviour. This change also allows having SmallSetVector use DenseSet instead of SmallDenseSet by default, which helps a little with performance. The reason for implementing this functionality in SetVector instead of SmallSetVector is that it allows reusing all the code that is already there and it is just augmented with the "isSmall" checks. This change gives a good speedup (0.4%): https://llvm-compile-time-tracker.com/compare.php?from=086601eac266ec253bf313c746390ff3e5656132&to=acd0a72a4d3ee840f7b455d1b35d82b11ffdb3c0&stat=instructions%3Au Differential Revision: https://reviews.llvm.org/D152497	2023-06-10 12:36:37 +05:30
Kiran Chandramohan	ca81808cc3	[Flang][OpenMP] Refactor to properly fix privatisation of loop bounds The OpenMP loop Operations have the bounds attached to them. If the loop bounds are privatised then the privatisation has to happen before the loop operation is created. To do this the privatisation is split into two steps. The first step performs cloning and firstprivate handling, the second step performs lastprivate handling. This also reverts the changes in the temporary fix (D127137). Fixes https://github.com/flang-compiler/f18-llvm-project/issues/1171#issuecomment-1143880545 Fixes https://github.com/flang-compiler/f18-llvm-project/issues/1171#issuecomment-1119997442 Fixes #60872 Reviewed By: NimishMishra Differential Revision: https://reviews.llvm.org/D151504	2023-06-05 16:04:24 +00:00
Peter Klausler	4ad7279392	[flang] CUDA Fortran - part 1/5: parsing Begin upstreaming of CUDA Fortran support in LLVM Flang. This first patch implements parsing for CUDA Fortran syntax, including: - a new LanguageFeature enum value for CUDA Fortran - driver change to enable that feature for .cuf and .CUF source files - parse tree representation of CUDA Fortran syntax - dumping and unparsing of the parse tree - the actual parsers for CUDA Fortran syntax - prescanning support for !@CUF and !$CUF - basic sanity testing via unparsing and parse tree dumps ... along with any minimized changes elsewhere to make these work, mostly no-op cases in common::visitors instances in semantics and lowering to allow them to compile in the face of new types in variant<> instances in the parse tree. Because CUDA Fortran allows the kernel launch chevron syntax ("call foo<<<blocks, threads>>>()") only on CALL statements and not on function references, the parse tree nodes for CallStmt, FunctionReference, and their shared Call were rearranged a bit; this caused a fair amount of one-line changes in many files. More patches will follow that implement CUDA Fortran in the symbol table and name resolution, and then semantic checking. Differential Revision: https://reviews.llvm.org/D150159	2023-05-31 09:48:59 -07:00
Carlos Eduardo Seo	9ceb0a7bc0	Fix nested block constructs for SELECT CASE In some scenarios, a SELECT CASE could cause an error while lowering to FIR. This was caused by a spurious extra branch added after the end statement. Fixes #62726 Differential Revision: https://reviews.llvm.org/D151118	2023-05-25 16:52:32 +00:00
Kelvin Li	ef93417470	[flang] Support for PowerPC vector type The following PowerPC vector type syntax is added: VECTOR ( element-type-spec ) where element-type-sec is integer-type-spec, real-type-sec or unsigned-type-spec. Two opaque types (__VECTOR_PAIR and __VECTOR_QUAD) are also added. A finite set of functionalities are implemented in order to support the new types: 1. declare objects 2. declare function result 3. declare type dummy arguments 4. intrinsic assignment between the new type objects (e.g. v1=v2) 5. reference functions that return the new types Submit on behalf of @tislam @danielcchen Authors: @tislam @danielcchen Differential Revision: https://reviews.llvm.org/D150876	2023-05-24 13:10:56 -04:00
V Donaldson	6f7a3b0781	[flang] Non-type-bound defined IO lowering Generate supporting data structures and calls to new runtime IO functions for defined IO that accesses non-type-bound procedures, such as `wft` in: module m1 type t integer n end type interface write(formatted) module procedure wft end interface contains subroutine wft(dtv, unit, iotype, v_list, iostat, iomsg) class(t), intent(in) :: dtv integer, intent(in) :: unit character(), intent(in) :: iotype integer, intent(in) :: v_list(:) integer, intent(out) :: iostat character(), intent(inout) :: iomsg iostat = 0 write(unit,,iostat=iostat,iomsg=iomsg) 'wft was called: ', dtv%n end subroutine end module module m2 contains subroutine test1 use m1 print , 'test1, should call wft: ', t(1) end subroutine subroutine test2 use m1, only: t print , 'test2, should not call wft: ', t(2) end subroutine end module use m1 use m2 call test1 call test2 print , 'main, should call wft: ', t(3) end	2023-05-17 09:22:13 -07:00
Slava Zakharin	4eab303404	[flang][hlfir] Fixed symbol lookup for character returns. Symbols corresponding to entries returning character results must be mapped to EmboxCharOp, first, before we can map them to DeclareOp. The code may be reworked after HLFIR is enabled by default, but right now it seems like an acceptable solution to me. Differential Revision: https://reviews.llvm.org/D150749	2023-05-17 08:59:33 -07:00
Peter Klausler	7f7bbc7317	[flang] Correct overriding (or not) of inaccessible bindings Fortran doesn't allow inaccessible procedure bindings to be overridden, and this needs to apply to generic resolution. When resolving a type-bound generic procedure from another module, ensure only that the most extended override from its module is used if it is PRIVATE, not a later apparent override from another module. Differential Revision: https://reviews.llvm.org/D150721	2023-05-16 14:32:48 -07:00
Slava Zakharin	be5747e516	[flang] Fixed global name creation for literal constants. The global names were created using a hash based on the address of std::vector::data address. Since the memory may be reused by different std::vector's, this may cause non-equivalent constant expressions to map to the same name. This is what is happening in the modified flang/test/Lower/constant-literal-mangling.f90 test. I changed the name creation to use a map between the constant expressions and corresponding unique names. The uniquing is done using a name counter in FirConverter. The effect of this change is that the equivalent constant expressions are now mapped to the same global, and the naming is "stable" (i.e. it does not change from compilation to compilation). Though, the issue is not HLFIR specific it was affecting several tests when using HLFIR lowering. Differential Revision: https://reviews.llvm.org/D150380	2023-05-12 13:40:22 -07:00
Jean Perier	c7ff45a529	[flang][hlfir] Lower left-hand side vector subscripts to HLFIR This patch lowers assignments to vector subscripted designators into the newly added hlfir.elemental_addr and hlfir.region_assign. Note that the codegen of these operation to FIR is still TODO and will still emit a TODO message when trying to compile programs end to end. Differential Revision: https://reviews.llvm.org/D149962	2023-05-09 09:22:37 +02:00
Jean Perier	54c88fc9df	[flang][hlfir] Lower WHERE to HLFIR Lower WHERE to the newly added hlfir.where and hlfir.elsewhere operations. Differential Revision: https://reviews.llvm.org/D149950	2023-05-09 09:21:27 +02:00
Jean Perier	b87e65531c	[flang][hlfir] Lower forall to HLFIR Lower Forall to the previously added hlfir.forall, hlfir.forall_mask. hlfir.forall_index, and hlfir.region_assign operations. The HLFIR assignment code lowering is moved into genDataAssignment for more readability and so that user defined assignment (still a TODO), will be able to share most of the logic. Differential Revision: https://reviews.llvm.org/D149878	2023-05-09 09:20:23 +02:00

1 2 3 4 5

225 Commits