llvm-project

Author	SHA1	Message	Date
Yingwei Zheng	38a44bdc93	[CodeGenPrepare] Reverse the canonicalization of isInf/isNanOrInf (#81572 ) In commit `2b582440c1`, we canonicalize the isInf/isNanOrInf idiom into fabs+fcmp for better analysis/codegen (See also the discussion in https://github.com/llvm/llvm-project/pull/76338). This patch reverses the fabs+fcmp to `is.fpclass`. If the `is.fpclass` is not supported by the target, it will be expanded by TLI. Fixes the regression introduced by `2b582440c1` and https://github.com/llvm/llvm-project/pull/80414#issuecomment-1936374206.	2024-03-18 18:27:45 +08:00
Stanislav Mekhanoshin	fe8335babb	[AMDGPU] Select 64-bit imm moves if can be encoded as 32 bit operand (#70395 ) This allows folding of 64-bit operands if fit into 32-bit. Fixes https://github.com/llvm/llvm-project/issues/67781	2023-10-30 08:12:28 -07:00
Jay Foad	f2c164c815	[AMDGPU] Do not wait for vscnt on function entry and return SIInsertWaitcnts inserts waitcnt instructions to resolve data dependencies. The GFX10+ vscnt (VMEM store count) counter is never used in this way. It is only used to resolve memory dependencies, and that is handled by SIMemoryLegalizer. Hence there is no need to conservatively wait for vscnt to be 0 on function entry and before returns. Differential Revision: https://reviews.llvm.org/D153537	2023-07-04 12:22:38 +01:00
Matt Arsenault	0d0ed9a355	AMDGPU: Pattern match fract instructions in AMDGPUCodeGenPrepare This will allow eliminating the intrinsic uses in the device libraries, which will remove a subtarget dependency on the f16 version of the intrinsic. We previously had some wrong patterns for this under unsafe math which I've removed. Do it in IR partially to take advantage of the much better isKnownNeverNaN handling, and partially out of laziness to avoid repeating this in the DAG and GlobalISel path. Plus I think this should be done much earlier. Ideally this would be in InstCombine, but you can't introduce target intrinsics from a generic instruction rooted pattern.	2023-05-18 23:29:47 +01:00
Matt Arsenault	9c1bbcd1e6	AMDGPU: Add baseline tests for fract matching	2023-05-18 19:44:56 +01:00

5 Commits