llvm-project

Author	SHA1	Message	Date
paperchalice	f3df058b03	[Passes] Report error when pass requires target machine (#142550 ) Fixes #142146 Do nullptr check when pass accept `const TargetMachine &` in constructor, but it is still not exhaustive.	2025-10-23 12:57:03 +08:00
Frederik Harwath	46a866ab77	expand-fp: Refactor modification status handling (NFC) (#163542 ) Modify the return value of the runImpl function which indicates whether or not the IR has been changed in a single place instead of doing it separately for each instruction at the insertion into the worklist. Further changes: Replace if-else in worklist processing loop by switch and add test cases which demonstrate that the "scalarize" function does not always add items to the worklist and hence a worklist emptiness check cannot be used for the runImpl return value.	2025-10-20 10:24:12 +02:00
Frederik Harwath	8cc862ce3b	[AMDGPU] expand-fp: always report modifications (#163153 ) The last change to the pass in PR #158588 lost the assignment to the "Modified" variable for one of the pass optimizations. Add it back. This fixes the test failure in `CodeGen/AMDGPU/itofp.i128.bf.ll` (in a `LLVM_ENABLE_EXPENSIVE_CHECKS=ON` build).	2025-10-13 15:21:02 +00:00
Frederik Harwath	7314565281	[AMDGPU] expand-fp: unify scalarization (NFC) (#158588 ) Extend the existing "scalarize" function which is used for the fp-integer conversion instruction expansion to BinaryOperator instructions and reuse it for the frem expansion; a similar function for scalarizing BinaryOperator instructions exists in the ExpandLargeDivRem pass and this change is a step towards merging that pass with ExpandFp. Further refactoring: Scalarize directly instead of using the "ReplaceVector" as a worklist, rename "Replace" vector to "Worklist", and hoist a check for unsupported scalable vectors to the top of the instruction visiting loop.	2025-10-13 09:18:03 +02:00
Frederik Harwath	ffcf82c4a8	[AMDGPU] Change expand-fp opt level argument syntax (#157408 ) Align the syntax used for the optimization level argument of the expand-fp pass in textual descriptions of pass pipelines with the syntax used by other passes taking a similar argument. That is, use e.g. `expand-fp<O1>` instead of `expand-fp<opt-level=1>`.	2025-09-10 10:44:28 +02:00
Frederik Harwath	7f6098ed98	[X86] Fix expand-fp on optnone functions (#156900 ) As observed by @mikaelholmen, PR #130988 "[AMDGPU] Implement IR expansion for frem instruction" introduced a regression on x86. Its changes led to the pass being skipped on functions with the optnone attribute. @bjope also noted that a check concerning the optnone handling is wrong. This patch fixes both issues which together fixes the regression. During the review it was observed that, even before PR #130988, the pass would not run on optnone functions with the new pass manager. This is also fixed.	2025-09-05 16:22:22 +02:00
Frederik Harwath	47793f9a73	[AMDGPU] Implement IR expansion for frem instruction (#130988 ) This patch implements a correctly rounded expansion of the frem instruction in LLVM IR. This is useful for target architectures for which such an expansion is too involved to be implement in ISel Lowering. The expansion is based on the code from the AMD device libs and has been tested successfully against the OpenCL conformance tests on amdgpu. The expansion is implemented in the preexisting "expand-fp" pass. It replaces the expansion of "frem" in ISel for the amdgpu target; it is enabled for targets which do not directly support "frem" and for which no matching "fmod" LibCall is available. --------- Co-authored-by: Matt Arsenault <Matthew.Arsenault@amd.com>	2025-09-03 16:27:15 +02:00
Craig Topper	763f425b08	[ExpandFP] Replace getIntN(Ty) with getInt32/64(Ty). NFC (#150501 )	2025-07-24 13:10:16 -07:00
Frederik Harwath	6962cf1700	Rename ExpandLargeFpConvertPass to ExpandFpPass (#131128 ) This is meant as a preparation for PR #130988 "[AMDGPU] Implement IR expansion for frem instruction" which implements the expansion of another instruction in this pass. The more general name seems more appropriate given this change and quite reasonable even without it.	2025-03-14 13:11:45 +01:00

9 Commits