llvm-project

Author	SHA1	Message	Date
Abid Qadeer	62d0b712b7	[OMPIRBuilder] Avoid invalid debug location. (#153190 ) Fixes #153043. This is another case of debug location not getting updated when the insert point is changed by the `restoreIP`. Fixed by using the wrapper function that updates the debug location.	2025-08-12 16:20:52 +01:00
Amit Tiwari	2074e1320f	[Clang][OpenMP] Non-contiguous strided update (#144635 ) This patch handles the strided update in the `#pragma omp target update from(data[a🅱️c])` directive where 'c' represents the strided access leading to non-contiguous update in the `data` array when the offloaded execution returns the control back to host from device using the `from` clause. Issue: Clang CodeGen where info is generated for the particular `MapType` (to, from, etc), it was failing to detect the strided access. Because of this, the `MapType` bits were incorrect when passed to runtime. This led to incorrect execution (contiguous) in the libomptarget runtime code. Added a minimal testcase that verifies the working of the patch.	2025-08-12 19:32:15 +05:30
Matheus Izvekov	91cdd35008	[clang] Improve nested name specifier AST representation (#147835 ) This is a major change on how we represent nested name qualifications in the AST. * The nested name specifier itself and how it's stored is changed. The prefixes for types are handled within the type hierarchy, which makes canonicalization for them super cheap, no memory allocation required. Also translating a type into nested name specifier form becomes a no-op. An identifier is stored as a DependentNameType. The nested name specifier gains a lightweight handle class, to be used instead of passing around pointers, which is similar to what is implemented for TemplateName. There is still one free bit available, and this handle can be used within a PointerUnion and PointerIntPair, which should keep bit-packing aficionados happy. * The ElaboratedType node is removed, all type nodes in which it could previously apply to can now store the elaborated keyword and name qualifier, tail allocating when present. * TagTypes can now point to the exact declaration found when producing these, as opposed to the previous situation of there only existing one TagType per entity. This increases the amount of type sugar retained, and can have several applications, for example in tracking module ownership, and other tools which care about source file origins, such as IWYU. These TagTypes are lazily allocated, in order to limit the increase in AST size. This patch offers a great performance benefit. It greatly improves compilation time for [stdexec](https://github.com/NVIDIA/stdexec). For one datapoint, for `test_on2.cpp` in that project, which is the slowest compiling test, this patch improves `-c` compilation time by about 7.2%, with the `-fsyntax-only` improvement being at ~12%. This has great results on compile-time-tracker as well: ![image](https://github.com/user-attachments/assets/700dce98-2cab-4aa8-97d1-b038c0bee831) This patch also further enables other optimziations in the future, and will reduce the performance impact of template specialization resugaring when that lands. It has some other miscelaneous drive-by fixes. About the review: Yes the patch is huge, sorry about that. Part of the reason is that I started by the nested name specifier part, before the ElaboratedType part, but that had a huge performance downside, as ElaboratedType is a big performance hog. I didn't have the steam to go back and change the patch after the fact. There is also a lot of internal API changes, and it made sense to remove ElaboratedType in one go, versus removing it from one type at a time, as that would present much more churn to the users. Also, the nested name specifier having a different API avoids missing changes related to how prefixes work now, which could make existing code compile but not work. How to review: The important changes are all in `clang/include/clang/AST` and `clang/lib/AST`, with also important changes in `clang/lib/Sema/TreeTransform.h`. The rest and bulk of the changes are mostly consequences of the changes in API. PS: TagType::getDecl is renamed to `getOriginalDecl` in this patch, just for easier to rebasing. I plan to rename it back after this lands. Fixes #136624 Fixes https://github.com/llvm/llvm-project/issues/43179 Fixes https://github.com/llvm/llvm-project/issues/68670 Fixes https://github.com/llvm/llvm-project/issues/92757	2025-08-09 05:06:53 -03:00
Drew Kersnar	90e8c8e718	[InferAlignment] Propagate alignment between loads/stores of the same base pointer (#145733 ) We can derive and upgrade alignment for loads/stores using other well-aligned loads/stores. This optimization does a single forward pass through each basic block and uses loads/stores (the alignment and the offset) to derive the best possible alignment for a base pointer, caching the result. If it encounters another load/store based on that pointer, it tries to upgrade the alignment. The optimization must be a forward pass within a basic block because control flow and exception throwing can impact alignment guarantees. --------- Co-authored-by: Nikita Popov <github@npopov.com>	2025-08-08 12:05:29 -05:00
Nikita Popov	c23b4fbdbb	[IR] Remove size argument from lifetime intrinsics (#150248 ) Now that #149310 has restricted lifetime intrinsics to only work on allocas, we can also drop the explicit size argument. Instead, the size is implied by the alloca. This removes the ability to only mark a prefix of an alloca alive/dead. We never used that capability, so we should remove the need to handle that possibility everywhere (though many key places, including stack coloring, did not actually respect this).	2025-08-08 11:09:34 +02:00
Ritanya-B-Bharadwaj	1d594fdb8d	[Clang][OpenMP] Fixing Clang error for metadirective with multiple when clauses and no otherwise (#148583 ) Fixing - https://github.com/llvm/llvm-project/issues/147336	2025-08-05 20:02:44 +05:30
Julian Brown	889faabe78	[OpenMP] Don't emit redundant zero-sized mapping nodes for overlapped structs (#148947 ) The handling of overlapped structure mapping in CGOpenMPRuntime.cpp can lead to redundant zero-sized mapping nodes at runtime. This patch fixes it using a combination of approaches: trivially adjacent struct members won't have a mapping node created between them, and for more complicated cases (inheritance) the physical layout of the struct/class is used to make sure that elements aren't missed. I've introduced a new class to track the state whilst iterating over the struct. This reduces a bit of redundancy in the code (accumulating CombinedInfo both during and after the loop), which I think is a bit neater. Before: omptarget --> Entry 0: Base=0x00007fff8d483830, Begin=0x00007fff8d483830, Size=48, Type=0x20, Name=unknown omptarget --> Entry 1: Base=0x00007fff8d483830, Begin=0x00007fff8d483830, Size=0, Type=0x1000000000003, Name=unknown omptarget --> Entry 2: Base=0x00007fff8d483830, Begin=0x00007fff8d483834, Size=0, Type=0x1000000000003, Name=unknown omptarget --> Entry 3: Base=0x00007fff8d483830, Begin=0x00007fff8d483838, Size=0, Type=0x1000000000003, Name=unknown omptarget --> Entry 4: Base=0x00007fff8d483830, Begin=0x00007fff8d48383c, Size=20, Type=0x1000000000003, Name=unknown omptarget --> Entry 5: Base=0x00007fff8d483830, Begin=0x00007fff8d483854, Size=0, Type=0x1000000000003, Name=unknown omptarget --> Entry 6: Base=0x00007fff8d483830, Begin=0x00007fff8d483858, Size=0, Type=0x1000000000003, Name=unknown omptarget --> Entry 7: Base=0x00007fff8d483830, Begin=0x00007fff8d48385c, Size=4, Type=0x1000000000003, Name=unknown omptarget --> Entry 8: Base=0x00007fff8d483830, Begin=0x00007fff8d483830, Size=4, Type=0x1000000000003, Name=unknown omptarget --> Entry 9: Base=0x00007fff8d483830, Begin=0x00007fff8d483834, Size=4, Type=0x1000000000003, Name=unknown omptarget --> Entry 10: Base=0x00007fff8d483830, Begin=0x00007fff8d483838, Size=4, Type=0x1000000000003, Name=unknown omptarget --> Entry 11: Base=0x00007fff8d483840, Begin=0x00005e7665275130, Size=32, Type=0x1000000000013, Name=unknown omptarget --> Entry 12: Base=0x00007fff8d483830, Begin=0x00007fff8d483850, Size=4, Type=0x1000000000003, Name=unknown omptarget --> Entry 13: Base=0x00007fff8d483830, Begin=0x00007fff8d483854, Size=4, Type=0x1000000000003, Name=unknown omptarget --> Entry 14: Base=0x00007fff8d483830, Begin=0x00007fff8d483858, Size=4, Type=0x1000000000003, Name=unknown After: omptarget --> Entry 0: Base=0x00007fffd0f562e0, Begin=0x00007fffd0f562e0, Size=48, Type=0x20, Name=unknown omptarget --> Entry 1: Base=0x00007fffd0f562e0, Begin=0x00007fffd0f562ec, Size=20, Type=0x1000000000003, Name=unknown omptarget --> Entry 2: Base=0x00007fffd0f562e0, Begin=0x00007fffd0f5630c, Size=4, Type=0x1000000000003, Name=unknown omptarget --> Entry 3: Base=0x00007fffd0f562e0, Begin=0x00007fffd0f562e0, Size=4, Type=0x1000000000003, Name=unknown omptarget --> Entry 4: Base=0x00007fffd0f562e0, Begin=0x00007fffd0f562e4, Size=4, Type=0x1000000000003, Name=unknown omptarget --> Entry 5: Base=0x00007fffd0f562e0, Begin=0x00007fffd0f562e8, Size=4, Type=0x1000000000003, Name=unknown omptarget --> Entry 6: Base=0x00007fffd0f562f0, Begin=0x000058b6013fb130, Size=32, Type=0x1000000000013, Name=unknown omptarget --> Entry 7: Base=0x00007fffd0f562e0, Begin=0x00007fffd0f56300, Size=4, Type=0x1000000000003, Name=unknown omptarget --> Entry 8: Base=0x00007fffd0f562e0, Begin=0x00007fffd0f56304, Size=4, Type=0x1000000000003, Name=unknown omptarget --> Entry 9: Base=0x00007fffd0f562e0, Begin=0x00007fffd0f56308, Size=4, Type=0x1000000000003, Name=unknown For code: #include <cstdlib> #include <cstdio> struct S { int x; int y; int z; int p1; int p2; }; struct T : public S { int a; int b; int c; }; int main() { T v; v.p1 = (int) calloc(8, sizeof(int)); v.p2 = (int) calloc(8, sizeof(int)); #pragma omp target map(tofrom: v, v.x, v.y, v.z, v.p1[:8], v.a, v.b, v.c) { v.x++; v.y += 2; v.z += 3; v.p1[0] += 4; v.a += 7; v.b += 5; v.c += 6; } return 0; }	2025-07-24 14:45:04 +01:00
David Pagan	45d99c26c3	[clang][OpenMP] In 6.0, can omit length in array section (#148048 ) In OpenMP 6.0 specification, section 5.2.5 Array Sections, page 166, lines 28-28: When the length is absent and the size of the dimension is not known, the array section is an assumed-size array. Testing - Updated LIT test - check-all - OpenMP_VV (formerly sollve) test case tests/6.0/target/test_target_assumed_array_size.c	2025-07-23 10:53:38 -07:00
Fazlay Rabbi	5daaaf8d7d	[OpenMP 6.0] Allow only byref arguments with `need_device_addr` modifier on `adjust_args` clause (#149586 ) If the need_device_addr adjust-op modifier is present, each list item that appears in the clause must refer to an argument in the declaration of the function variant that has a reference type. Reference: OpenMP 6.0 [Sec 9.6.2, page 332, line 31-33, adjust_args clause, Restrictions]	2025-07-23 09:38:13 -07:00
Antonio Frighetto	9e0c06d708	[clang][CodeGen] Set `dead_on_return` when passing arguments indirectly Let Clang emit `dead_on_return` attribute on pointer arguments that are passed indirectly, namely, large aggregates that the ABI mandates be passed by value; thus, the parameter is destroyed within the callee. Writes to such arguments are not observable by the caller after the callee returns. This should desirably enable further MemCpyOpt/DSE optimizations. Previous discussion: https://discourse.llvm.org/t/rfc-add-dead-on-return-attribute/86871.	2025-07-18 11:50:18 +02:00
Robert Imschweiler	77861b3a8f	[OpenMP][clang] 6.0: parsing/sema for message/severity for parallel (#146093 ) Implement parsing and semantic analysis support for the message and severity clauses that have been added to the parallel directive in OpenMP 6.0, 12.1.	2025-07-11 11:22:06 +02:00
Urvi Rav	8055c0f380	[OpenMP-5.2] deprecate delimited form of 'declare target' (#145854 ) According to OpenMP 5.2 (Section 7.8.2), the directive name `declare target` may be used as a synonym for `begin declare target` only when no clauses are specified. This clause-less delimited form is now deprecated and should emit a deprecation warning. ``` // Deprecated usage (should trigger warning): #pragma omp declare target // deprecated in OpenMP 5.2 void foo1() { } #pragma omp end declare target // Valid usage with clause (should not trigger warning): #pragma omp declare target enter(foo2) void foo2() { } ``` ``` // Recommended replacement for deprecated delimited form: #pragma omp begin declare target void foo() { } #pragma omp end declare target ``` --------- Co-authored-by: urvi-rav <urvi.rav@hpe.com>	2025-07-10 13:55:20 +05:30
Abhinav Gaba	ae4a81e849	[NFC][OpenMP] Add tests for mapping pointers and their dereferences. (#146934 ) The output of the compile-and-run tests is incorrect. These will be used for reference in future commits that resolve the issues. Also updated the existing clang LIT test, target_map_both_pointer_pointee_codegen.cpp, with more constructs and fewer CHECKs (through more update_cc_test_checks filters).	2025-07-08 06:52:38 -04:00
Oleksandr T.	2e8e254d18	[Clang] include attribute scope in diagnostics (#144619 ) This patch updates diagnostics to print fully qualified attribute names, including scope when present.	2025-07-08 11:36:52 +03:00
Krzysztof Parzyszek	1a1a11f709	[clang][OpenMP] Issue a warning when parsing future directive spelling (#146933 ) OpenMP 6.0 introduced alternative spelling for some directives, with the previous spellings still allowed. Warn the user when a new spelling is encountered with OpenMP version set to an older value.	2025-07-07 08:47:53 -05:00
Krzysztof Parzyszek	d7b936b633	[OpenMP] Add directive spellings introduced in OpenMP 6.0 (#141772 ) For background information see https://discourse.llvm.org/t/rfc-alternative-spellings-of-openmp-directives/85507	2025-06-25 07:55:06 -05:00
Robert Imschweiler	f624ba2d9d	[OpenMP][clang] 6.0: parsing/sema for num_threads 'strict' modifier (#145490 ) Implement parsing and semantic analysis support for the optional 'strict' modifier of the num_threads clause. This modifier has been introduced in OpenMP 6.0, section 12.1.2. Note: this is basically 1:1 https://reviews.llvm.org/D138328.	2025-06-24 21:12:40 +02:00
Robert Imschweiler	ff295d2f34	[OpenMP][clang] declare mapper: fix handling of nested types (#143504 ) Fix a crash that happened during parsing of a "declare mapper" construct for a struct that contains an element for which we also declared a custom default mapper.	2025-06-14 16:17:08 +02:00
Aaron Ballman	9eef4d1c5f	Remove delayed typo expressions (#143423 ) This removes the delayed typo correction functionality from Clang (regular typo correction still remains) due to fragility of the solution. An RFC was posted here: https://discourse.llvm.org/t/rfc-removing-support-for-delayed-typo-correction/86631 and while that RFC was asking for folks to consider stepping up to be maintainers, and we did have a few new contributors show some interest, experiments show that it's likely worth it to remove this functionality entirely and focus efforts on improving regular typo correction. This removal fixes ~20 open issues (quite possibly more), improves compile time performance by roughly .3-.4% (https://llvm-compile-time-tracker.com/?config=Overview&stat=instructions%3Au&remote=AaronBallman&sortBy=date), and does not appear to regress diagnostic behavior in a way we wouldn't find acceptable. Fixes #142457 Fixes #139913 Fixes #138850 Fixes #137867 Fixes #137860 Fixes #107840 Fixes #93308 Fixes #69470 Fixes #59391 Fixes #58172 Fixes #46215 Fixes #45915 Fixes #45891 Fixes #44490 Fixes #36703 Fixes #32903 Fixes #23312 Fixes #69874	2025-06-13 06:45:40 -04:00
Fazlay Rabbi	02550da932	[OpenMP 60] Initial parsing/sema for `need_device_addr` modifier on `adjust_args` clause (#143442 ) Adds initial parsing and semantic analysis for `need_device_addr` modifier on `adjust_args` clause.	2025-06-11 22:06:11 -07:00
Abhinav Gaba	02b6849cf1	[Clang][OpenMP] Fix mapping of arrays of structs with members with mappers (#142511 ) This builds upon #101101 from @jyu2-git, which used compiler-generated mappers when mapping an array-section of structs with members that have user-defined default mappers. Now we do the same when mapping arrays of structs.	2025-06-11 19:03:55 +00:00
CHANDRA GHALE	afbcf9529a	[OpenMP 6.0 ]Codegen for Reduction over private variables with reduction clause (#134709 ) Codegen support for reduction over private variable with reduction clause. Section 7.6.10 in in OpenMP 6.0 spec. - An internal shared copy is initialized with an initializer value. - The shared copy is updated by combining its value with the values from the private copies created by the clause. - Once an encountering thread verifies that all updates are complete, its original list item is updated by merging its value with that of the shared copy and then broadcast to all threads. Sample Test Case from OpenMP 6.0 Example ``` #include <assert.h> #include <omp.h> #define N 10 void do_red(int n, int *v, int &sum_v) { sum_v = 0; // sum_v is private #pragma omp for reduction(original(private),+: sum_v) for (int i = 0; i < n; i++) { sum_v += v[i]; } } int main(void) { int v[N]; for (int i = 0; i < N; i++) v[i] = i; #pragma omp parallel num_threads(4) { int s_v; // s_v is private do_red(N, v, s_v); assert(s_v == 45); } return 0; } ``` Expected Codegen: ``` // A shared global/static variable is introduced for the reduction result. // This variable is initialized (e.g., using memset or a UDR initializer) // e.g., .omp.reduction.internal_private_var // Barrier before any thread performs combination call void @__kmpc_barrier(...) // Initialization block (executed by thread 0) // e.g., call void @llvm.memset.p0.i64(...) or call @udr_initializer(...) call void @__kmpc_critical(...) // Inside critical section: // Load the current value from the shared variable // Load the thread-local private variable's value // Perform the reduction operation // Store the result back to the shared variable call void @__kmpc_end_critical(...) // Barrier after all threads complete their combinations call void @__kmpc_barrier(...) // Broadcast phase: // Load the final result from the shared variable) // Store the final result to the original private variable in each thread // Final barrier after broadcast call void @__kmpc_barrier(...) ``` --------- Co-authored-by: Chandra Ghale <ghale@pe31.hpc.amslabs.hpecorp.net>	2025-06-11 14:01:31 +05:30
Joseph Huber	539a2ac5f2	[OpenMP] Fix atomic compare handling with overloaded operators (#141142 ) Summary: When there are overloaded C++ operators in the global namespace the AST node for these is not a `BinaryExpr` but a `CXXOperatorCallExpr`. Modify the uses to handle this case, basically just treating it as a binary expression with two arguments. Fixes https://github.com/llvm/llvm-project/issues/141085	2025-05-28 21:44:55 -05:00
Yingwei Zheng	79f0dccc91	[Clang][CodeGen] Add metadata for load from reference (#98746 ) This patch adds `!nonnull/!align` for load from reference to improve codegen.	2025-05-26 14:08:50 +08:00
Krzysztof Parzyszek	9273091502	[clang][OpenMP] Improve handling of non-C/C++ directives (#139961 ) The PR139793 added handling of the Fortran-only "workshare" directive, however there are more such directives, e.g. "allocators". Use the genDirectiveLanguages function to detect non-C/C++ directives instead of enumerating them.	2025-05-15 07:37:41 -05:00
Aaron Ballman	0eb8a92ce9	[OpenMP] Fix tentative parsing crash with metadirective (#139901 ) There were two crashes that have the same root cause: not correctly handling unexpected tokens. In one case, we were failing to return early which caused us to parse a paren as a regular token instead of a special token, causing an assertion. The other case was failing to commit or revert the tentative parse action when not getting a paren when one was expected. Fixes #139665	2025-05-14 10:19:24 -04:00
Krzysztof Parzyszek	3abd77ac15	[clang][OpenMP] Treat "workshare" as unknown OpenMP directive (#139793 ) The "workshare" construct is only present in Fortran. The common OpenMP code does treat it as any other directive, but in clang we need to reject it, and do so gracefully before it encounters an internal assertion. Fixes https://github.com/llvm/llvm-project/issues/139424	2025-05-14 07:06:11 -05:00
Aaron Ballman	131c8f84bb	[OpenMP] Fix crash with invalid size expression (#139745 ) We weren't correctly handling size expressions with errors before trying to get the type of the size expression. No release note needed because support for 'stripe' was added to the current release. Fixes #139433	2025-05-13 12:55:53 -04:00
Amr Hesham	a4186bd04b	[clang][OpenMP] Add error for large expr in collapse clause (#138592 ) Report error when OpenMP collapse clause has an expression that can't be represented in 64-bit Issue #138445	2025-05-12 21:34:35 +02:00
Aaron Ballman	70ca3f41fa	[OpenMP] Fix crash on invalid with cancel directive (#139577 ) If the next token after 'cancel' is a special token, we would trigger an assertion. We should be consuming any token, same as elsewhere in the function. Note, we could check for an unknown directive and do different error recovery, but that caused too many behavioral changes for other tests in the form of "unexpected tokens ignored" diagnostics that didn't seem like an improvement for the test cases. Fixes #139360	2025-05-12 13:44:03 -04:00
Johannes Doerfert	53fe3df0f6	[OpenMP] Allow begin/end declare variant in executable context (#139344 ) We are missing a few declerative directives in the parser for executable and declerative directives causing us to error out if they are inside of functions. This adds support for begin/end declare variant by reusing the logic we used in global scope.	2025-05-12 09:06:03 -07:00
Aaron Ballman	d5974524de	[OpenMP] Fix crash with invalid argument to simd collapse (#139313 ) Same as with other recent crash fixes, this is checking whether the argument expression contains errors or not. Fixes #138493	2025-05-12 08:04:22 -04:00
Johannes Doerfert	73165de4e6	[OpenMP] implementation set controls elision for begin declare variant (#139287 ) The device and implementation set should trigger elision of tokens if they do not match statically in a begin/end declare variant. This simply extends the logic from the device set only and includes the implementation set. Reported by @kkwli.	2025-05-09 16:32:39 -07:00
Aaron Ballman	ab6c4f5085	[OpenMP] Fix a crash on invalid with unroll partial (#139280 ) You cannot get the integer constant expression's value if the expression contains errors. Fixes https://github.com/llvm/llvm-project/issues/139267	2025-05-09 12:32:44 -04:00
Aaron Ballman	51ca3cbb2b	Correct typo in a test I made the change locally but didn't save the file before pushing. :-(	2025-05-09 11:25:41 -04:00
Aaron Ballman	b249b49c13	[OpenMP] Fix crash when diagnosing dist_schedule (#139277 ) We were failing to pass a required argument when emitting the diagnostic, so the source range was being used in place of an index. This caused a failed assertion due to the incorrect index. Fixes #139266	2025-05-09 11:12:19 -04:00
Aaron Ballman	187a83f86c	[OpenMP] No long crash on an invalid sizes argument (#139118 ) We were trying to get type information out of an expression node which contained errors. That causes the type of the expression to be dependent, which the code was not expecting. Now we handle error conditions with an early return. Fixes #139073	2025-05-09 06:52:06 -04:00
Oleksandr T.	61b435ec4d	[Clang] show attribute namespace in diagnostics (#138519 ) This patch enhances Clang's diagnosis of an unknown attribute by printing the attribute's namespace in the diagnostic text. e.g., ```cpp [[foo::nodiscard]] int f(); // warning: unknown attribute 'foo::nodiscard' ignored ```	2025-05-09 00:49:01 +03:00
Aaron Ballman	85c810060e	[C] Diagnose use of C++ keywords in C (#137234 ) This adds a new diagnostic group, -Wc++-keyword, which is off by default and grouped under -Wc++-compat. The diagnostic catches use of C++ keywords in C code. This change additionally fixes an issue with -Wreserved-identifier not diagnosing use of reserved identifiers in function parameter lists in a function declaration which is not a definition. Fixes https://github.com/llvm/llvm-project/issues/21898	2025-05-02 07:20:02 -04:00
Urvi Rav	35e743d4bf	default clause replaced by otherwise clause for metadirective in OpenMP 5.2 (#128640 ) This PR replaces the `default` clause with the `otherwise` clause for the `metadirective` in OpenMP. The `otherwise` clause serves as a fallback condition when no directive from the when clauses is selected. In the `when` clause, context selectors define traits evaluated to determine the directive to be applied. The previous merge was reverted due to a failing test case, which has now been resolved. --------- Co-authored-by: urvi-rav <urvi.rav@hpe.com>	2025-05-02 10:29:57 +05:30
Abid Qadeer	04aa5a88d1	[OMPIRBuilder] Don't discard the debug record from entry block. (#135161 ) When we get a function back from `CodeExtractor`, we discard its entry block after coping its instructions into the entry block we prepared. While copying the instructions, the terminator is discarded for obvious reasons. But if there were some debug values attached to the terminator, those are useful and needs to be copied.	2025-04-30 10:25:35 +01:00
Ernesto Su	89f930a7de	[clang][OpenMP] Fix/enforce order-concurrent-nestable rules (#135463 ) OpenMP has restrictions on directives allowed to be strictly nested inside a construct with the order(concurrent) clause specified. - OpenMP 5.0, 5.1, and 5.2 allows: 'loop', 'parallel', 'simd', and combined directives starting with 'parallel'. - OpenMP 6.0 allows: the above directives plus 'atomic' and all loop-transformation directives. Furthermore, a region that corresponds to a construct with order(concurrent) specified may not contain calls to the OpenMP runtime API. This PR fixes the following issues in the current implementation: With -fopenmp-version=50: none of the nesting restrictions above were enforced With -fopenmp-version=60: 1. Clang did not reject OpenMP runtime APIs encountered in the region. 2. Clang erroneously rejected combined directives starting with parallel. --------- Co-authored-by: Zahira Ammarguellat <zahira.ammarguellat@intel.com>	2025-04-18 13:57:50 -04:00
Matheus Izvekov	fceb9cecdf	[clang] consistently quote expressions in diagnostics (#134769 )	2025-04-15 04:18:23 -03:00
Nick Sarnie	008040482b	[clang] Add SPIR-V to some OpenMP clang tests (#133503 ) Just to get some more coverage. Some of the behavior might be weird and change in the future, but let's lock down what happens today to at least prevent regressions. Signed-off-by: Sarnie, Nick <nick.sarnie@intel.com>	2025-04-03 14:36:46 +00:00
Joseph Huber	772173f548	[Clang][AMDGPU] Remove special handling for COV4 libraries (#132870 ) Summary: When we were first porting to COV5, this lead to some ABI issues due to a change in how we looked up the work group size. Bitcode libraries relied on the builtins to emit code, but this was changed between versions. This prevented the bitcode libraries, like OpenMP or libc, from being used for both COV4 and COV5. The solution was to have this 'none' functionality which effectively emitted code that branched off of a global to resolve to either version. This isn't a great solution because it forced every TU to have this variable in it. The patch in https://github.com/llvm/llvm-project/pull/131033 removed support for COV4 from OpenMP, which was the only consumer of this functionality. Other users like HIP and OpenCL did not use this because they linked the ROCm Device Library directly which has its own handling (The name was borrowed from it after all). So, now that we don't need to worry about backward compatibility with COV4, we can remove this special handling. Users can still emit COV4 code, this simply removes the special handling used to make the OpenMP device runtime bitcode version agnostic.	2025-03-28 07:35:16 -05:00
Shilei Tian	f1ac2afe21	Reapply "[AMDGPU] Use COV6 by default (#118515 )" (#130963 ) This reverts commit 68bcba6d7a1cc18996c0bcb7c62267c62d2040d0.	2025-03-21 15:26:45 -04:00
CHANDRA GHALE	6da8f56619	[OpenMP 6.0] Parse/Sema support for reduction over private variable with reduction clause. (#129938 ) Initial Parse/Sema support for reduction over private variable with reduction clause. Section 7.6.10 in in OpenMP 6.0 spec. - list item in a reduction clause can now be private in the enclosing context. - Added support for _original-sharing-modifier_ with reduction clause. --------- Co-authored-by: Chandra Ghale <ghale@pe31.hpc.amslabs.hpecorp.net>	2025-03-21 14:19:08 +05:30
Ritanya-B-Bharadwaj	63635c1746	[clang] [OpenMP] New OpenMP 6.0 self_maps clause (#129888 ) Initial parsing/sema support for self maps in map and requirement clause [Sections 7.9.6 and 10.5.1.6 in OpenMP 6.0 spec]	2025-03-11 16:31:42 +05:30
Tom Eccles	f7daa9d302	[mlir][OpenMP] fix crash outlining infinite loop (#129872 ) Previously an extra block was created by splitting the previous exit block. This produced incorrect results when the outlined region statically never terminated because then there wouldn't be a valid exit block for the outlined region, this caused this newly added block to have an incoming edge from outside of the outlining region, which caused outlining to fail. So far as I can tell this extra block no longer serves any purpose. The comment says it is supposed to collate multiple control flow edges into one place, but the code as it is now does not achieve this. In fact, as can be seen from the changes to lit tests, this block was not actually outlined in the end. This is because there are actually two code extractors: one in the callback for creating a parallel op which is used to find what the input/output variables are (which does have this block added to it), and another one which actually does the outlining (which this block was not added to). Tested with the gfortran and fujitsu test suites. Fixes #112884	2025-03-07 11:02:52 +00:00
Zahira Ammarguellat	26fc3aa983	[OpenMP] Missing implicit otherwise clause in metadirective. (#127113 ) Compiling this: `int main() {` ` #pragma omp metadirective when(use r= {condition(0)}` `: parallel for)` `for (int i=0; i<10; i++)` ; }` is generating an error: `error: expected expression` The compiler is interpreting this as if it's compiling a `#pragma omp metadirective` with no `otherwise` clause. In the OMP5.2 specs chapter 7.4 it's mentioned that: `If no otherwise clause is specified the effect is as if one was specified without an associated directive variant.` This patch fixes the issue.	2025-02-28 08:02:35 -05:00

1 2 3 4 5 ...

2446 Commits