llvm-project

Author	SHA1	Message	Date
Matt Arsenault	3e5d8a1439	Reapply "RuntimeLibcalls: Generate table of libcall name lengths (#153… (#153864 ) This reverts commit 334e9bf2dd01fbbfe785624c0de477b725cde6f2. Check if llvm-nm exists before building the benchmark.	2025-08-16 09:53:50 +09:00
gulfemsavrun	334e9bf2dd	Revert "RuntimeLibcalls: Generate table of libcall name lengths (#153… (#153864 ) …210)" This reverts commit 9a14b1d254a43dc0d4445c3ffa3d393bca007ba3. Revert "RuntimeLibcalls: Return StringRef for libcall names (#153209)" This reverts commit cb1228fbd535b8f9fe78505a15292b0ba23b17de. Revert "TableGen: Emit statically generated hash table for runtime libcalls (#150192)" This reverts commit 769a9058c8d04fc920994f6a5bbb03c8a4fbcd05. Reverted three changes because of a CMake error while building llvm-nm as reported in the following PR: https://github.com/llvm/llvm-project/pull/150192#issuecomment-3192223073	2025-08-15 13:32:27 -07:00
Matt Arsenault	9a14b1d254	RuntimeLibcalls: Generate table of libcall name lengths (#153210 ) Avoids strlen when constructing the returned StringRef. We were emitting these in the libcall name lookup anyway, so split out the offsets for general use. Currently emitted as a separate table, not sure if it would be better to change the string offset table to store pairs of offset and width instead.	2025-08-15 23:29:10 +09:00
Matt Arsenault	769a9058c8	TableGen: Emit statically generated hash table for runtime libcalls (#150192 ) a96121089b9c94e08c6632f91f2dffc73c0ffa28 reverted a change to use a binary search on the string name table because it was too slow. This replaces it with a static string hash table based on the known set of libcall names. Microbenchmarking shows this is similarly fast to using DenseMap. It's possibly slightly slower than using StringSet, though these aren't an exact comparison. This also saves on the one time use construction of the map, so it could be better in practice. This search isn't simple set check, since it does find the range of possible matches with the same name. There's also an additional check for whether the current target supports the name. The runtime constructed set doesn't require this, since it only adds the symbols live for the target. Followed algorithm from this post http://0x80.pl/notesen/2023-04-30-lookup-in-strings.html I'm also thinking the 2 special case global symbols should just be added to RuntimeLibcalls. There are also other global references emitted in the backend that aren't tracked; we probably should just use this as a centralized database for all compiler selected symbols.	2025-08-15 09:02:56 +09:00
Matt Arsenault	bbcac029db	ARM: Move more aeabi libcall config into tablegen (#152109 )	2025-08-14 15:43:15 +09:00
Matt Arsenault	0a9ca5d7da	TableGen: Add failing function to libcall emitter error message (#153390 )	2025-08-13 23:15:47 +09:00
Benjamin Maxwell	9c361cc068	[TableGen] Avoid duplicate variable names in RuntimeLibcallsEmitter (partial reland of #152505 ) (#153398 )	2025-08-13 12:43:21 +00:00
Nikita Popov	48beed5b71	Revert "[AArch64][SME] Port all SME routines to RuntimeLibcalls" (#153392 ) This introduced a 5% compile-time regression on AArch64, see https://llvm-compile-time-tracker.com/compare.php?from=b9138bde3562de5c28a239dbd303caf2406678c6&to=271688b87abe7cf45aceaff8266270a25eb7b436&stat=instructions:u. Reverts llvm/llvm-project#152505.	2025-08-13 11:54:39 +00:00
Benjamin Maxwell	271688b87a	[AArch64][SME] Port all SME routines to RuntimeLibcalls (#152505 ) This updates everywhere we emit/check an SME routines to use RuntimeLibcalls to get the function name and calling convention. Note: RuntimeLibcallEmitter had some issues with emitting non-unique variable names for sets of libcalls, so I tweaked the output to avoid the need for variables.	2025-08-13 08:48:59 +01:00
Rahul Joshi	89ea9df6a2	[NFCI[TableGen] Minor improvements to `Intrinsic::getAttributes` (#152761 ) This change implements several small improvements to `Intrinsic::getAttributes`: 1. Use `SequenceToOffsetTable` to emit `ArgAttrIdTable`. This enables reuse of entries when they share a common prefix. This reduces the size of this table from 546 to 484 entries, which is 248 bytes. 2. Fix `AttributeComparator` to purely compare argument attributes and not look at function attributes. This avoids unnecessary duplicates in the uniqueing process and eliminates 2 entries from `ArgAttributesInfoTable`, saving 8 bytes. 3. Improve `Intrinsic::getAttributes` code to not initialize all entries of `AS` always. Currently, we initialize all entries of the array `AS` even if we may not use all of them. In addition to the runtime cost, for Clang release builds, since the initialization loop is unrolled, it consumes ~330 bytes of code to initialize the `AS` array. Address this by declaring the storage for AS using just a char array with appropriate `alignas` (similar to how `SmallVectorStorage` defines its inline elements).	2025-08-12 07:15:08 -07:00
Rahul Joshi	7f0e4079c8	[NFCI][TableGen] Make `Intrinsic::getAttributes` table driven (#152349 ) This a follow on to https://github.com/llvm/llvm-project/pull/152219 to reduce both code and frame size of `Intrinsic::getAttributes` further. Currently, this function consists of several switch cases (one per unique argument attributes) that populates the local `AS` array with all non-empty argument attributes for that intrinsic by calling `getIntrinsicArgAttributeSet`. This change makes this code table driven and implements `Intrinsic::getAttributes` without any switch cases, which reduces the code size of this function on all platforms and in addition reduces the frame size by a factor of 10 on Windows. This is achieved by: 1. Emitting table `ArgAttrIdTable` containing a concatenated list of `<ArgNo, AttrID>` entries across all unique arguments. 2. Emitting table `ArgAttributesInfoTable` (indexed by unique arguments-ID) to store the starting index and number of non-empty arg attributes. 3. Reserving unique function-ID 255 to indicate that the intrinsic has no function attributes (to replace `HasFnAttr` setup in each switch case). 4. Using a simple table lookup and for loop to build the list of argument and function attributes for a given intrinsic. Experimental data shows that with release builds and assertions disabled, this change reduces the code size for GCC and Clang builds on Linux by ~9KB for a modest (80/152 byte) increase in frame size. For Windows, it reduces the code size by 20KB and frame size from 4736 bytes to 461 bytes which is 10x reduction. Actual data is as follows: ``` Current trunk: Compiler gcc-13.3.0 clang-18.1.3 MSVC 19.43.34810.0 code size 0x35a9 0x370c 0x5581 frame size 0x120 0x118 0x1280 table driven Intrinsic::getAttributes: code size 0xcfb 0xcd0 0x1cf frame size 0x1b8 0x188 0x1A0 Total savings (code + data) 9212 bytes 9790 bytes 20119 bytes ``` Total savings above accounts for the additional data size for the 2 new tables, which in this experiment was: `ArgAttributesInfoTable` = 314 bytes and `ArgAttrIdTable` = 888 bytes. Coupled with the earlier https://github.com/llvm/llvm-project/pull/152219, this achieves a 46x reduction in frame size for this function in Windows release builds.	2025-08-08 06:02:43 -07:00
Rahul Joshi	22af0cd6f9	[LLVM][Intrinsics] Reduce stack size for `Intrinsic::getAttributes` (#152219 ) This change fixes a stack size regression that got introduced in `0de0354aa8`. That change did 2 independent things: 1. Uniquify argument and function attributes separately so that we generate a smaller number of unique sets as opposed to uniquifying them together. This is beneficial for code size. 2. Eliminate the fixed size array `AS` and `NumAttrs` variable and instead build the returned AttribteList in each case using an initializer list. The second part seems to have caused a regression in the stack size usage of this function for Windows. This change essentially undoes part 2 and reinstates the use of the fixed size array `AS` which fixes this stack size regression. The actual measured stack frame size for this function before/after this change is as follows: ``` Current trunk data for release build (x86_64 builds for Linux, x86 build for Windows): Compiler gcc-13.3.0 clang-18.1.3 MSVC 19.43.34810.0 DLLVM_ENABLE_ASSERTIONS=OFF 0x120 0x110 0x54B0 DLLVM_ENABLE_ASSERTIONS=ON 0x2880 0x110 0x5250 After applying the fix: Compiler gcc-13.3.0 clang-18.1.3 MSVC 19.43.34810.0 DLLVM_ENABLE_ASSERTIONS=OFF 0x120 0x118 0x1240h DLLVM_ENABLE_ASSERTIONS=ON 0x120 0x118 0x1240h ``` Note that for Windows builds with assertions disabled, the stack frame size for this function reduces from 21680 to 4672 which is a 4.6x reduction. Stack frame size for GCC build with assertions also improved and clang builds are unimpacted. The speculation is that clang and gcc is able to reuse the stack space across these switch cases better with existing code, but MSVC is not, and re-introducing the `AS` variable forces all cases to use the same local variable, addressing the stack space regression.	2025-08-06 07:09:52 -07:00
Matt Arsenault	b2f0ffd659	RuntimeLibcalls: Really move default libcall handling to tablegen (#148780 ) Hack in the default setting so it's consistently generated like the other cases. Maintain a list of targets where this applies. The alternative would require new infrastructure to sort the system library initialization in some way. I wanted the unhandled target case to be treated as a fatal error, but it turns out there's a hack in IRSymtab using RuntimeLibcalls, which will fail out in many tests that do not have a triple set. Many of the failures are simply running llvm-as with no triple, which probably should not depend on knowing an accurate set of calls.	2025-08-04 08:32:00 +09:00
Matt Arsenault	bd7db75489	TableGen: Sort RuntimeLibcallImpls secondarily by enum names (#150728 ) Extracted from #150192, this hopefully fixes occasional EXPENSIVE_CHECKS failures.	2025-07-26 11:22:59 +09:00
Matt Arsenault	0b7a95a6fd	Partially Reapply "RuntimeLibcalls: Add methods to recognize libcall names (#149001 )" This partially reverts commit a96121089b9c94e08c6632f91f2dffc73c0ffa28. Drop the IRSymtab changes for now	2025-07-18 18:06:26 +09:00
Nikita Popov	a96121089b	Revert "RuntimeLibcalls: Add methods to recognize libcall names (#149001 )" This reverts commit 45477add8dfe9851605697bd908b49f0ec244625. This causes a significant LTO compile-time regression.	2025-07-18 10:15:19 +02:00
Matt Arsenault	45477add8d	RuntimeLibcalls: Add methods to recognize libcall names (#149001 ) Also replace the current static DenseMap of preserved symbol names in the Symtab hack with this. That was broken statefulness across compiles, so this at least fixes that. However this is still broken, llvm-as shouldn't really depend on the triple.	2025-07-18 08:53:32 +09:00
Krzysztof Parzyszek	661cbd5a52	[utils][TableGen] Make some non-bitmask enums iterable (#148647 ) Additionally, add sentinel values <Enum>::First_ and <Enum>::Last_ to each one of those enums. This will allow using `enum_seq_inclusive` to generate the list of enum-typed values of any generated scoped (non-bitmask) enum.	2025-07-17 11:27:47 -05:00
Matt Arsenault	af56fd0c1a	TableGen: Use StringOffsetTable for RuntimeLibcall names (#148839 )	2025-07-15 23:24:46 +09:00
Matt Arsenault	953416ae40	ARM: Start moving runtime libcalls into tablegen (#146084 ) We still need to manually set the calling conventions of some libcalls until the lowering is separated out.	2025-07-10 19:38:31 +09:00
Rahul Joshi	2366573679	[TableGen] Minor cleanup in `StringToOffsetTable` (#147712 ) Make `AppendZero` a class member instead of an argument to `GetOrAddStringOffset` to reflect the intended usage that for a given `StringToOffsetTable`, all strings must use the same value of `AppendZero`. Modify `EmitStringTableDef` to drop the `Indent` argument as its always set to `""`, and to fail if it's called for a table with non-null-terminated strings.	2025-07-09 12:22:29 -07:00
Matt Arsenault	a1ea9e632b	TableGen: Try to fix expensive checks assert on compare Attempt to fix regression after #144978	2025-07-08 18:23:38 +09:00
Matt Arsenault	58ad99606b	TableGen: Handle setting runtime libcall calling conventions (#144980 ) Allow associating a non-default CallingConv with a set of library functions, and applying a default for a SystemLibrary. I also wanted to be able to apply a default calling conv to a RuntimeLibcallImpl, but that turned out to be annoying so leave it for later.	2025-07-08 17:12:48 +09:00
Matt Arsenault	5088231e52	TableGen: Allow defining sets of runtime libraries (#144978 ) Add a way to define a SystemLibrary for a complete set of libcalls, subdivided by a predicate based on the triple. Libraries can be defined using dag set operations, and the prior default set can be subtracted from and added to (though I think eventually all targets should move to explicit opt-ins. We're still doing things like reporting ppcf128 libcalls as available dy default on all targets). Start migrating some of the easier targets to only use the new system. Targets that don't define a SystemLibrary are still manually mutating a table set to the old defaults.	2025-07-08 10:17:45 +09:00
Matt Arsenault	b88e1f6a79	TableGen: Generate enum for runtime libcall implementations (#144973 ) Work towards separating the ABI existence of libcalls vs. the lowering selection. Set libcall selection through enums, rather than through raw string names.	2025-06-27 17:40:43 +09:00
Matt Arsenault	3fdf46ad60	TableGen: Add runtime libcall backend (#144972 ) Replace RuntimeLibcalls.def with a tablegenerated version. This is in preparation for splitting RuntimeLibcalls into two components. For now match the existing functionality.	2025-06-27 17:37:03 +09:00
Kazu Hirata	981f8e1380	[TableGen] Remove redundant control flow statements (NFC) (#145143 )	2025-06-20 21:55:14 -07:00
zhijian lin	bf79d4819e	[Reland] [PowerPC] frontend get target feature from backend with cpu name (#144594 ) 1. The PR proceeds with a backend target hook to allow front-ends to determine what target features are available in a compilation based on the CPU name. 2. Fix a backend target feature bug that supports HTM for Power8/9/10/11. However, HTM is only supported on Power8/9 according to the ISA. 3. All target features that are hardcoded in PPC.cpp can be retrieved from the backend target feature. I have double-checked that the hardcoded logic for inferring target features from the CPU in the frontend(PPC.cpp) is the same as in PPC.td. The reland patch addressed the comment https://github.com/llvm/llvm-project/pull/137670#discussion_r2143541120	2025-06-19 09:22:16 -04:00
Jay Foad	432c5f2c60	[TableGen] Use `emplace` instead of `insert` and similar. NFC. (#143164 )	2025-06-07 09:32:36 +01:00
Jay Foad	bb9dcb27df	[TableGen] Use `contains` instead of `count`. NFC. (#143156 )	2025-06-07 09:31:42 +01:00
Krzysztof Parzyszek	7b2aa02a33	[utils][TableGen] Handle versions on clause/directive spellings (#143021 ) In "get<lang>DirectiveName(Kind, Version)", return the spelling that corresponds to Version, and in "get<lang>DirectiveKindAndVersions(Name)" return the pair {Kind, VersionRange}, where VersionRange contains the minimum and the maximum versions that allow "Name" as a spelling. This applies to clauses as well. In general it applies to classes that have spellings (defined via TableGen class "Spelling"). Given a Kind and a Version, getting the corresponding spelling requires a runtime search (which can fail in a general case). To avoid generating the search function inline, a small additional component of llvm/Frontent was added: LLVMFrontendDirective. The corresponding header file also defines C++ classes "Spelling" and "VersionRange", which are used in TableGen/DirectiveEmitter as well. For background information see https://discourse.llvm.org/t/rfc-alternative-spellings-of-openmp-directives/85507	2025-06-06 17:07:25 -05:00
Krzysztof Parzyszek	4dcc159485	[utils][TableGen] Implement clause aliases as alternative spellings (#141765 ) Use the spellings in the generated clause parser. The functions `get<lang>ClauseKind` and `get<lang>ClauseName` are not yet updated. The definitions of both clauses and directives now take a list of "Spelling"s instead of a single string. For example ``` def ACCC_Copyin : Clause<[Spelling<"copyin">, Spelling<"present_or_copyin">, Spelling<"pcopyin">]> { ... } ``` A "Spelling" is a versioned string, defaulting to "all versions". For background information see https://discourse.llvm.org/t/rfc-alternative-spellings-of-openmp-directives/85507	2025-06-05 12:35:30 -05:00
Krzysztof Parzyszek	463a2bd159	[utils][TableGen] Treat clause aliases equally with names (#141763 ) The code in DirectiveEmitter that generates clause parsers sorted clause names to ensure that longer names were tried before shorter ones, in cases where a shorter name may be a prefix of a longer one. This matters in the strict Fortran source format, since whitespace is ignored there. This sorting did not take into account clause aliases, which are just alternative names. These extra names were not protected in the same way, and were just appended immediately after the primary name. This patch generates a list of pairs Record+Name, where a given record can appear multiple times with different names. Sort that list and use it to generate parsers for each record. What used to be ``` ("fred" \|\| "f") >> construct<SomeClause>{} \|\| "foo" << construct<OtherClause>{} ``` is now ``` "fred" >> construct<SomeClause>{} \|\| "foo" >> construct<OtherClause>{} \|\| "f" >> construct<SomeClause>{} ```	2025-06-05 09:05:16 -05:00
Krzysztof Parzyszek	2b3e07f485	[utils][TableGen] Unify converting names to upper-camel case (#141762 ) There were 3 different functions in DirectiveEmitter.cpp doing essentially the same thing: taking a name separated with _ or whitepace, and converting it to the upper-camel case. Extract that into a single function that can handle different sets of separators.	2025-06-05 07:34:28 -05:00
Krzysztof Parzyszek	57500cd6a0	[utils][TableGen] Clarify usage of ClauseVal, rename to EnumVal (#141761 ) The class "ClauseVal" actually represents a definition of an enumeration value, and in itself it is not bound to any clause. Rename it to EnumVal and add a comment clarifying how it's translated into an actual enum definition in the generated source code. There is no change in functionality.	2025-06-04 08:16:21 -05:00
Kazu Hirata	252bd80871	[TableGen] Remove unused includes (NFC) (#141356 ) These are identified by misc-include-cleaner. I've filtered out those that break builds. Also, I'm staying away from llvm-config.h, config.h, and Compiler.h, which likely cause platform- or compiler-specific build failures.	2025-05-24 09:37:40 -07:00
Krzysztof Parzyszek	70f9a8163f	[utils][TableGen] Clean up code in DirectiveEmitter (#140772 ) Remove most redundant function calls. Unify enum identifier name generation (via getIdentifierName), and namespace qualification (via getQualifier).	2025-05-21 07:35:05 -05:00
Krzysztof Parzyszek	46ab684bdd	[utils][TableGen] Unify name qualifications in DirectiveEmitter (#140606 ) Remove extraneous qualifications from names when - the name is explicitly enclosed by corresponding namespaces, and - the name is in a body of a function defined in corresponding namespaces. Otherwise add missing qualifications. This applies to individual sections of TableGen output, and makes name lookup independent of the context in which these sections are included.	2025-05-20 07:56:59 -05:00
Krzysztof Parzyszek	90539473ec	[utils][TableGen] Remove unused parameter in DirectiveEmitter, NFC (#140605 )	2025-05-20 07:56:39 -05:00
Rahul Joshi	3932360b14	[LLVM][TableGen] Rename `ListInit::getValues()` to `getElements()` (#140289 ) Rename `ListInit::getValues()` to `getElements()` to better match with other `ListInit` members like `getElement`. Keep `getValues()` for existing downstream code but mark it deprecated.	2025-05-19 12:16:33 -07:00
Krzysztof Parzyszek	8e2ac7d619	[llvm][OpenMP] Add "SourceLanguages" property to Directive (#139960 ) The official languages that OpenMP recognizes are C/C++ and Fortran. Some OpenMP directives are language-specific, some are C/C++-only, some are Fortran-only. Add a property to the TableGen definition of Directive that will be the list of languages that allow the directive. The TableGen backend will then generate a bitmask-like enumeration SourceLanguages, and a function SourceLanguages getDirectiveLanguages(Directive D);	2025-05-15 07:36:53 -05:00
Rahul Joshi	2e8b539e71	[NFC][TableGen] Add {} for `else` when `if` body has {} (#139420 )	2025-05-12 08:34:12 -07:00
Krzysztof Parzyszek	4b29ee407e	[LLVM][OpenMP] Add "version" parameter to getOpenMPDirectiveName (#139114 ) Some OpenMP directives have different spellings in different versions of the OpenMP spec. To use the proper spelling for a given spec version pass "version" as a parameter to getOpenMPDirectiveName. This parameter won't be used at the moment, and will have a default value to allow callers not to pass it, for gradual adoption in various components. RFC: https://discourse.llvm.org/t/rfc-alternative-spellings-of-openmp-directives/85507	2025-05-09 07:41:27 -05:00
Krzysztof Parzyszek	52c62364e7	[OpenMP] Remove "alternativeName" from Clause and Directive classes (#138179 ) The "alternativeName" was introduced ~5 years ago in D82405, and at the moment it has only one use, the one that the original change introduced. OpenMP 6.0 spec has introduced different spellings of some directives, but the "alternativeName" is not an adequate mechanism to implement it. For those reasons remove the "alternativeName" member from both Clause and Directive in DirectiveBase.td. Since the alternativeName text appeared in a number of clang OpenMP tests, implement `getOpenMPClauseNameForDiag` in SemaOpenMP.cpp to preserve the text of the diagnostics.	2025-05-02 13:02:24 -05:00
Tom Eccles	97eb416c65	[flang][Parser][OpenMP] Fix unparser for cancellation_construct_type (#136001 ) Previously the unparser would print like ``` !$OMP CANCEL CANCELLATION_CONSTRUCT_TYPE(SECTIONS) ``` This is not valid Fortran. I have fixed it to print without the clause name.	2025-04-22 10:55:50 +01:00
Erich Keane	d1cce66469	[OpenACC] Switch Clang to use the Flang 'appertainment' rules for cla… (#135372 ) …uses The Flang implemenation of OpenACC uses a .td file in the llvm/Frontend directory to determine appertainment in 4 categories: -Required: If this list has items in it, the directive requires at least 1 of these be present. -AllowedExclusive: Items on this list are all allowed, but only 1 from the list may be here (That is, they are exclusive of eachother). -AllowedOnce: Items on this list are all allowed, but may not be duplicated. Allowed: Items on this list are allowed. Note th at the actual list of 'allowed' is all 4 of these lists together. This is a draft patch to swtich Clang over to use those tables. Surgery to get this to happen in Clang Sema was somewhat reasonable. However, some gaps in the implementations are obvious, the existing clang implementation disagrees with the Flang interpretation of it. SO, we're keeping a task list here based on what gets discovered. Changes to Clang: - [x] Switch 'directive-kind' enum conversions to use tablegen See ff1a7bddd9435b6ae2890c07eae60bb07898bbf5 - [x] Switch 'clause-kind' enum conversions to use tablegen See ff1a7bddd9435b6ae2890c07eae60bb07898bbf5 - [x] Investigate 'parse' test differences to see if any new disagreements arise. - [x] Clang/Flang disagree as to whether 'collapse' can be multiple times on a loop. Further research showed no prose to limit this, and the comment on the clang implementation said "no good reason to allow", so no standards justification. - [x] Clang/Flang disagree whether 'num_gangs' can appear >1 on a compute/combined construct. This ended up being an unjustified restriction. - [x] Clang/Flang disagree as to the list of required clauses on a 'set' construct. My research shows that Clang mistakenly included 'if' in the list, and that it should be just 'default_async', 'device_num', and 'device_type'. - [x] Order of 'at least one of' diagnostic has changed. Tests were updated. - [x] Ensure we are properly 'de-aliasing' clause names in appertainment checks? - [x] What is 'shortloop'? 'shortloop' seems to be an old non-standard extension that isn't supported by flang, but is parsed for backward compat reasons. Clang won't parse, but we at least have a spot for it in the clause list. - [x] Implemented proposed change for 'routine' gang/worker/vector/seq. (see issue 539) - [x] Implement init/shutdown can only have 1 'if' (see issue 540) - [x] Clang/Flang disagree as to whether 'tile' is permitted more than once on a 'loop' or combined constructs (Flang prohibits >1). I see no justification for this in the standard. EDIT: I found a comment in clang that I did this to make SOMETHING around duplicate checks easier. Discussion showed we should actually have a better behavior around 'device_type' and duplicates, so I've since implemented that. - [x] Clang/Flang disagree whether 'gang', 'worker', or 'vector' may appear on the same construct as a 'seq' on a 'loop' or 'combined'. There is prose for this in 2022: (a gang, worker, or vector clause may not appear if a 'seq' clause appears). EDIT: These don't actually disagree, but aren't in the .td file, so I restored the existing code to do this. - [x] Clang/Flang disagree on whether 'bind' can appear >1 on a 'routine'. I believe line 3096 (A bind clause may not bind to a routine name that has a visible bind clause) makes this limitation (Flang permits >1 bind). we discussed and decided this should have the same rules as worker/vector/etc, except without the 'exactly 1 of' rule (so no dupes in individual sections). - [x] Clang/Flang disagree on whether 'init'/'shutdown' can have multiple 'device_num' clauses. I believe there is no supporting prose for this limitation., We decided that `device_num` should only happen 1x. - [x] Clang/Flang disagree whether 'num_gangs' can appear >1 on a 'kernels' construct. Line 1173 (On a kernels construct, the num_gangs clause must have a single argument) justifies limiting on a per-arguement basis, but doesn't do so for multiple num_gangs clauses. WE decided to do this with the '1-per-device-type' region for num_gangs, num_workers, and vector_length, see openacc bug here: https://github.com/OpenACC/openacc-spec/issues/541 Changes to Flang: - [x] Clang/Flang disgree on whether 'atomic' can take an 'if' clause. This was added in OpenACC3.3_Next See #135451 - [x] Clang/Flang disagree on whether 'finalize' can be allowed >1 times on a 'exit_data' construct. see #135415. - [x] Clang/Flang disagree whether 'if_present' should be allowed >1 times on a 'host_data'/'update' construct. see #135422 - [x] Clang/Flang disagree on whether 'init'/'shutdown' can have multiple 'device_type' clauses. I believe there is no supporting prose for this limitation. - [ ] SEE change for num_gangs/etc above. Changes that need discussion/research:	2025-04-18 14:54:21 -07:00
Nikita Popov	c3c0b27f2d	[Intrinsics] Add support for range attributes (#135642 ) Add support for specifying range attributes in Intrinsics.td. Use this to specify the ucmp/scmp range [-1,2). This case is trickier than existing intrinsic attributes, because we need to create the attribute with the correct bitwidth. As such, the attribute construction now needs to be aware of the function type. We also need to be careful to no longer assign attributes on intrinsics with invalid signatures, as we'd make invalid assumptions about the number of arguments etc otherwise. Fixes https://github.com/llvm/llvm-project/issues/130179.	2025-04-17 11:11:00 +02:00
Pengcheng Wang	90c01f4bad	[RISCV] Add missing bitmask for some extensions (#135599 ) According to: https://github.com/riscv-non-isa/riscv-c-api-doc/blob/main/src/c-api.adoc#extension-bitmask-definitions And we sort the bitmask by group id and then bit position.	2025-04-14 19:18:38 +08:00
Reid Kleckner	0a27c4e318	[StrTable] Use string literal emission for intrinsics on non-MSVC platforms (#124856 ) This mainly transitions the LLVM intrinsic string table from character emission to string literal emission, which I confirmed happens for me locally. I moved the guts of StringToOffsetTable to a cpp file so I could move the `EmitLongStrLiterals` cl::opt global to a non-vague linkage home in the `TableGen` library. I had to add missing FormatVariadic.h includes to account for moving other includes to a cpp file.	2025-04-13 17:58:53 +02:00
Nikita Popov	0738f70615	[Intrinsics] Add Intrinsic::getFnAttributes() (NFC) (#132029 ) Most places that call Intrinsic::getAttributes() are only interested in the function attributes, so add a separate function for that. The motivation for this is that I'd like to add the ability to specify range attributes on intrinsics, which requires knowing the function type. This avoids needing to know the type for most attribute queries.	2025-03-20 09:20:39 +01:00

1 2

79 Commits