llvm-project

Author	SHA1	Message	Date
Igor Kirillov	59a063d5c6	[ExpandMemCmp] Improve memcmp optimisation for boolean results (#71221 ) This patch enhances the optimization of memcmp calls when only two outcomes are needed and comparison fits into one block, for example: bool result = memcmp(a, b, 6) > 0; Previously, LLVM would generate unnecessary operations even when the user of memcmp was only interested in a binary outcome.	2023-11-09 11:52:04 +00:00
Igor Kirillov	849f963e31	[CodeGen] Improve ExpandMemCmp for more efficient non-register aligned sizes handling (#70469 ) * Enhanced the logic of ExpandMemCmp pass to merge contiguous subsequences in LoadSequence, based on sizes allowed in `AllowedTailExpansions`. * This enhancement seeks to minimize the number of basic blocks and produce optimized code when using memcmp with non-register aligned sizes. * Enable this feature for AArch64 with memcmp sizes modulo 8 equal to 3, 5, and 6. Reapplication of #69942 after fixing a bug	2023-10-30 18:40:48 +00:00
Igor Kirillov	deb429e5b0	Revert "[CodeGen] Improve ExpandMemCmp for more efficient non-register aligned sizes handling (#69942 )" This reverts commit 9bcb30d31813bbdea6b65789f64aed3f0e58d507.	2023-10-27 14:12:45 +00:00
Igor Kirillov	9bcb30d318	[CodeGen] Improve ExpandMemCmp for more efficient non-register aligned sizes handling (#69942 ) * Enhanced the logic of ExpandMemCmp pass to merge contiguous subsequences in LoadSequence, based on sizes allowed in `AllowedTailExpansions`. * This enhancement seeks to minimize the number of basic blocks and produce optimized code when using memcmp with non-register aligned sizes. * Enable this feature for AArch64 with memcmp sizes modulo 8 equal to 3, 5, and 6.	2023-10-27 12:41:08 +01:00

4 Commits