llvm-project

Author	SHA1	Message	Date
Simon Pilgrim	448fe73428	[X86] Add X86::getVectorRegisterWidth helper. NFC. Replaces internal helper used by addConstantComments to allow reuse in a future patch.	2024-02-08 12:42:33 +00:00
Simon Pilgrim	2096e57905	[X86] addConstantComments - add FP16 MOVSH asm comments support	2024-02-05 18:02:03 +00:00
Simon Pilgrim	f958ad3b89	[X86] printZeroUpperMove - add support for mask predicated instructions Handle masked predicated movss/movsd in addConstantComments now that we can generically handle the destination + mask register This will more significantly help improve 'fixup constant' comments from #73509	2024-02-05 16:23:16 +00:00
Simon Pilgrim	47dcf5d5dc	[X86] printBroadcast - add support for mask predicated instructions Handle masked predicated load/broadcasts in addConstantComments now that we can generically handle the destination + mask register This will more significantly help improve 'fixup constant' comments from #73509	2024-02-05 16:23:15 +00:00
Simon Pilgrim	f4714204d0	[X86] printExtend - add support for mask predicated instructions Remove handling from EmitAnyX86InstComments and handle all VPMOVSX/VPMOVZX comments in addConstantComments now that we can generically handle the destination + mask register and shuffle mask comment	2024-02-05 16:23:15 +00:00
Simon Pilgrim	de9a87301a	[X86] Split up getShuffleComment into printShuffleMask and printDstRegisterName helpers. NFC. This will allow us to easily use printDstRegisterName for other mask predicate destination registers, and printout shuffle masks from other instruction types.	2024-02-05 16:23:15 +00:00
Simon Pilgrim	1af05363d6	[X86] getShuffleComment - use MI description to determine AVX512 masked predicates instead of src index offsets.	2024-02-05 14:22:46 +00:00
Simon Pilgrim	bc6370abd3	[X86] addConstantComments - split VPERMILPS/VPERMILPD handling to reduce repeated switch cases etc. NFC.	2024-02-05 13:48:15 +00:00
Simon Pilgrim	66397435ed	[X86] Add common getSrcIdx helper to determine source index after AVX512 masked predicates. NFC.	2024-02-05 13:48:15 +00:00
Simon Pilgrim	69ffa7be3b	[X86] X86FixupVectorConstants - load+zero vector constants that can be stored in a truncated form (#80428 ) Further develops the vsextload support added in #79815 / b5d35feacb7246573c6a4ab2bddc4919a4228ed5 - reduces the size of the vector constant by storing it in the constant pool in a truncated form, and zero-extend it as part of the load.	2024-02-05 12:17:58 +00:00
Jie Fu	4cf2ed4396	[X86] Fix -Wsign-compare in X86MCInstLower.cpp (NFC) llvm-project/llvm/lib/Target/X86/X86MCInstLower.cpp:1588:48: error: comparison of integers of different signs: 'unsigned int' and 'int' [-Werror,-Wsign-compare] if (C && C->getType()->getScalarSizeInBits() == SrcEltBits) { ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ ^ ~~~~~~~~~~ 1 error generated.	2024-02-02 19:41:06 +08:00
Simon Pilgrim	b5d35feacb	[X86] X86FixupVectorConstants - load+sign-extend vector constants that can be stored in a truncated form (#79815 ) Reduce the size of the vector constant by storing it in the constant pool in a truncated form, and sign-extend it as part of the load. I've extended the existing FixupConstant functionality to support these sext constant rebuilds - we still select the smallest stored constant entry and prefer vzload/broadcast/vextload for same bitwidth to avoid domain flips. I intend to add the matching load+zero-extend handling in a future PR, but that requires some alterations to the existing MC shuffle comments handling first.	2024-02-02 11:28:58 +00:00
Simon Pilgrim	4318b033bd	[MC][X86] Merge lane/element broadcast comment printers. (#79020 ) This is /almost/ NFC - the only annoyance is that for some reason we were using "<C1,C2,..>" for ConstantVector types unlike all other cases - these now use the same "[C1,C2,..]" format as the other constant printers.	2024-01-23 12:33:52 +00:00
Alexandre Ganea	bb28442c0b	[CodeGen][X86] Fix lowering of tailcalls when `-ms-hotpatch` is used (#77245 ) Previously, tail jump pseudo-opcodes were skipped by the `encodeInstruction()` call inside `X86AsmPrinter::LowerPATCHABLE_OP`. This caused emission of a 2-byte NOP and dropping of the tail jump. With this PR, we change `PATCHABLE_OP` to not wrap the first `MachineInstr` anymore, but inserting itself before, leaving the instruction unaltered. At lowering time in `X86AsmPrinter`, we now "look ahead" for the next non-pseudo `MachineInstr` and lower+encode it, to inspect its size. If the size is below what `PATCHABLE_OP` expects, it inserts NOPs; otherwise it does nothing. That way, now the first `MachineInstr` is always lowered as usual even if `"patchable-function"="prologue-short-redirect"` is used. Fixes https://github.com/llvm/llvm-project/issues/76879, https://github.com/llvm/llvm-project/issues/76958 and https://github.com/llvm/llvm-project/issues/59039	2024-01-22 14:19:08 -05:00
Simon Pilgrim	27eb8d53ae	[X86] printConstant - add ConstantVector handling	2024-01-22 15:59:55 +00:00
Simon Pilgrim	74ab7958bd	[X86] printZeroUpperMove - add support for constant vectors. Allows cases where movss/movsd etc. are loading constant (ConstantDataSequential) sub-vectors, ensuring we pad with the correct number of zero upper elements by making repeated printConstant calls to print zeroes in a matching int/fp format.	2024-01-22 15:40:46 +00:00
Simon Pilgrim	4e64ed9780	[X86] Update X86::getConstantFromPool to take base OperandNo instead of Displacement MachineOperand This allows us to check the entire constant address calculation, and ensure we're not performing any runtime address math into the constant pool (noticed in an upcoming patch).	2024-01-22 15:40:45 +00:00
Simon Pilgrim	60963272c5	[X86] Add printElementBroadcast constant comments helper. NFC. Pull out helper instead of repeating switch cases.	2024-01-22 12:16:19 +00:00
Simon Pilgrim	09bd2cb70f	[X86] Add printLaneBroadcast constant comments helper. NFC. Pull out helper instead of repeating switch cases.	2024-01-22 12:16:18 +00:00
Simon Pilgrim	1a5eeade16	[X86] Add printZeroUpperMove constant/shuffle comments helper. NFC. Pull out helper instead of repeating switch cases.	2024-01-22 11:44:51 +00:00
Jie Fu	0d51c8704c	[X86] Fix -Wsign-compare in X86MCInstLower.cpp (NFC) llvm-project/llvm/lib/Target/X86/X86MCInstLower.cpp:1867:20: error: comparison of integers of different signs: 'int' and 'unsigned int' [-Werror,-Wsign-compare] if (SclWidth == C->getType()->getScalarSizeInBits()) { ~~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1 error generated.	2024-01-19 22:38:47 +08:00
Simon Pilgrim	a2a0089ac3	[X86] movsd/movss/movd/movq - add support for constant comments (#78601 ) If we're loading a constant value, print the constant (and the zero upper elements) instead of just the shuffle mask. This did require me to move the shuffle mask handling into addConstantComments as we can't handle this in the MC layer.	2024-01-19 14:21:26 +00:00
Simon Pilgrim	110e1717b3	[X86] X86MCInstLower.cpp - fix spelling mistake	2024-01-18 15:44:27 +00:00
Simon Pilgrim	33287e35f2	[X86] Emit verbose (constant) comments before EVEX compression tag (#78585 ) This helps ensure the encoding details are next to the EVEX tag Noticed while preparing to add more constant commenting as part of #73783 and #71078	2024-01-18 15:13:42 +00:00
Simon Pilgrim	d12dffacaa	[X86] Add X86::getConstantFromPool helper function to replace duplicate implementations. We had the same helper function in shuffle decode / vector constant code - move this to X86InstrInfo to avoid duplication.	2024-01-18 11:59:46 +00:00
Shengchen Kan	4f71068b72	[X86] Correct the asm comment for compression NF_ND -> NF	2024-01-12 12:55:11 +08:00
Shengchen Kan	1c674666fa	[X86] Support EVEX compression for EGPR (#77202 ) Compress promoted instruction (EVEX) to pre-promotion instruction (legacy/VEX) when R16-R31 is not used. Alternative of #77065	2024-01-08 16:50:23 +08:00
Simon Pilgrim	d1deeae094	[X86] Rename VBROADCASTF128/VBROADCASTI128 to VBROADCASTF128rm/VBROADCASTI128rm (#75040 ) Add missing rm postfix to show these are load instructions	2023-12-11 11:52:53 +00:00
Simon Pilgrim	2ed15877e7	[X86] Ensure asm comments only print the constant values for the vector load's register width We were printing the entire Constant, which if we were loading from a wider constant pool entry meant that we were confusing the asm comment with upper bits that aren't actually part of the load result	2023-11-17 14:30:30 +00:00
Kazu Hirata	9c5a5a421d	[llvm] Stop including llvm/ADT/iterator_range.h (NFC) Identified with misc-include-cleaner.	2023-10-22 15:41:18 -07:00
Fangrui Song	111fcb0df0	[llvm] Fix duplicate word typos. NFC Those fixes were taken from https://reviews.llvm.org/D137338	2023-09-01 18:25:16 -07:00
Elliot Goodrich	b0abd4893f	[llvm] Add missing StringExtras.h includes In preparation for removing the `#include "llvm/ADT/StringExtras.h"` from the header to source file of `llvm/Support/Error.h`, first add in all the missing includes that were previously included transitively through this header.	2023-06-25 15:42:22 +01:00
prabhukr	30198bd788	[Triple] Add triple for UEFI Target triple to support "x86_64-unknown-uefi" Reviewed By: phosek Differential Revision: https://reviews.llvm.org/D131594	2023-06-06 08:42:28 -07:00
Simon Pilgrim	f1a42300aa	[X86] printConstant - fix asm comment issue when broadcasting from a wider constant pool entry In cases where a broadcast op is loading from a constant entry wider than the broadcast element, we were incorrectly printing the entire entry and not just the lower bits referenced by the broadcast.	2023-05-31 12:28:17 +01:00
Simon Pilgrim	0f8e0f4228	[X86] lowerBuildVectorAsBroadcast - broadcast Constant of original (BuildVector) element size Noticed in D150143/D150526 - we currently create scalar Constant values using the broadcast instruction width, which might be wider than the original build vector width, making it tricky to recognise the original constant bits data. If we have widened the broadcast value, its much more useful for asm comments if we create a ConstantVector with the original element data, add that to the constant-pool and load that with the same (wider) broadcast instruction.	2023-05-27 14:05:44 +01:00
Kyle Huey	3be667ae5a	[X86] Use the CFA when appropriate for better variable locations around calls. Without frame pointers, the locations of variables on the stack are emitted relative to the stack pointer (via the stack pointer being the value of DW_AT_frame_base on the subprogram). If a call modifies the stack pointer this results in the locations being wrong and the debugger displaying the wrong values for variables. By using DW_OP_call_frame_cfa in these situations the emitted location for the variable will automatically handle changes in the stack pointer (provided LLVM is emitting the correct CFI directives elsewhere, of course). The CFA needs to be adjusted for the size of the stack frame (including the return address) to allow the variable locations themselves to remain unchanged by this patch. Certain LLDB features cannot cope with DW_OP_call_frame_cfa, so this change is heuristically limited to the cases where it's necessary for correctness to minimize the fallout there. Reviewed By: #debug-info, scott.linder, jryans, jmorse Differential Revision: https://reviews.llvm.org/D143463	2023-05-23 20:24:55 +00:00
Shengchen Kan	c81a121f3f	Revert "Revert "[X86] Remove patterns for ADC/SBB with immediate 8 and optimize during MC lowering, NFCI"" This reverts commit cb16b33a03aff70b2499c3452f2f817f3f92d20d. In fact, the test https://bugs.chromium.org/p/chromium/issues/detail?id=1446973#c2 already passed after 5586bc539acb26cb94e461438de01a5080513401	2023-05-19 22:21:56 +08:00
Hans Wennborg	cb16b33a03	Revert "[X86] Remove patterns for ADC/SBB with immediate 8 and optimize during MC lowering, NFCI" This caused compiler assertions, see comment on https://reviews.llvm.org/D150107. This also reverts the dependent follow-up change: > [X86] Remove patterns for ADD/AND/OR/SUB/XOR/CMP with immediate 8 and optimize during MC lowering, NFCI > > This is follow-up of D150107. > > In addition, the function `X86::optimizeToFixedRegisterOrShortImmediateForm` can be > shared with project bolt and eliminates the code in X86InstrRelaxTables.cpp. > > Differential Revision: https://reviews.llvm.org/D150949 This reverts commit 2ef8ae134828876ab3ebda4a81bb2df7b095d030 and 5586bc539acb26cb94e461438de01a5080513401.	2023-05-19 14:43:33 +02:00
Shengchen Kan	5586bc539a	[X86] Remove patterns for ADD/AND/OR/SUB/XOR/CMP with immediate 8 and optimize during MC lowering, NFCI This is follow-up of D150107. In addition, the function `X86::optimizeToFixedRegisterOrShortImmediateForm` can be shared with project bolt and eliminates the code in X86InstrRelaxTables.cpp. Differential Revision: https://reviews.llvm.org/D150949	2023-05-19 18:22:30 +08:00
Shengchen Kan	2ef8ae1348	[X86] Remove patterns for ADC/SBB with immediate 8 and optimize during MC lowering, NFCI This is follow-up of D150107.	2023-05-19 10:33:52 +08:00
Shengchen Kan	a3c5952827	[X86][MC] Add labels for BSF in the switch-cases of X86MCInstLower::Lower, NFCI BSF is not special here and leaving it in default label looked weird.	2023-05-18 20:02:41 +08:00
Shengchen Kan	b87bb4e0ff	[X86][MC] Move the code about fixed register encoding optimization to X86EncodingOptimization.cpp, NFCI	2023-05-18 19:42:00 +08:00
Shengchen Kan	025c1587a2	[X86][MC] Move encoding optimization for VCMP to X86::optimizeInstFromVEX3ToVEX2, NFCI This is a follow-up for c13ed1cc7578	2023-05-18 18:30:54 +08:00
Shengchen Kan	15100a2db8	[X86][MC] Move the code about MOV encoding optimization to X86EncodingOptimization.cpp, NFCI	2023-05-18 17:26:54 +08:00
Shengchen Kan	4a92d69a16	[X86][MC] Move the code about INC/DEC encoding optimization to X86EncodingOptimization.cpp, NFCI	2023-05-18 11:11:32 +08:00
Shengchen Kan	d9610b4a56	[X86][MC] Move the code about MOVSX encoding optimization to X86EncodingOptimization.cpp, NFCI	2023-05-17 22:46:28 +08:00
Shengchen Kan	2fb7506f58	[X86][MC] Move the code about VPCMP encoding optimization to X86EncodingOptimization.cpp, NFCI	2023-05-17 22:11:02 +08:00
Shengchen Kan	77589e945f	[X86] Remove patterns for shift/rotate with immediate 1 and optimize during MC lowering It's first suggested by @craig.topper in D150068. I think there are at least three pros 1. This can reduce the patterns during ISEL, as a result, reducing the bytes in X86GenDAGISel.inc 2. The patterns for shift/rotate with immediate 1 look quite similar to shift/rotate with immediate 8. So this can be seen as eliminating "duplicate" code. 3. Delay the optimization from imm8 to imm1, so that the previous optimization passes do not need to handle the version of imm1 It improves fast isel code and makes X86DomainReassignment work for shifts by 1, but regressed global isel, though no one should care. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D150107	2023-05-17 19:55:44 +08:00
Shengchen Kan	db39d47928	[X86][AsmParser] Reapply "Refactor code and optimize more instructions from VEX3 to VEX2" This was reverted in d4994d0e7922 b/c a bolt test failed after the encoding changed. Relanded the patch with the updated test.	2023-05-13 09:26:29 +08:00
Alan Zhao	d4994d0e79	Revert "[X86][AsmParser] Refactor code and optimize more instructions from VEX3 to VEX2" This reverts commit c13ed1cc75781fccc7cec91017e84341c3047229. Reason: causes bolt tests to fail: https://crbug.com/1445243	2023-05-12 16:08:35 -07:00

1 2 3 4 5 ...

438 Commits