llvm-project

Author	SHA1	Message	Date
Arthur Eubanks	0a1aa6cda2	[NFC][CodeGen] Change CodeGenOpt::Level/CodeGenFileType into enum classes (#66295 ) This will make it easy for callers to see issues with and fix up calls to createTargetMachine after a future change to the params of TargetMachine. This matches other nearby enums. For downstream users, this should be a fairly straightforward replacement, e.g. s/CodeGenOpt::Aggressive/CodeGenOptLevel::Aggressive or s/CGFT_/CodeGenFileType::	2023-09-14 14:10:14 -07:00
Sander de Smalen	b8ec2832c3	[AArch64][SME] Various tests should work with +sme, just as they do for +sve (#65260 )	2023-09-13 11:11:51 +01:00
Daniel Hoekwater	866ae69cfa	[AArch64] [BranchRelaxation] Optimize for hot code size in AArch64 branch relaxation On AArch64, it is safe to let the linker handle relaxation of unconditional branches; in most cases, the destination is within range, and the linker doesn't need to do anything. If the linker does insert fixup code, it clobbers the x16 inter-procedural register, so x16 must be available across the branch before linking. If x16 isn't available, but some other register is, we can relax the branch either by spilling x16 OR using the free register for a manually-inserted indirect branch. This patch builds on D145211. While that patch is for correctness, this one is for performance of the common case. As noted in https://reviews.llvm.org/D145211#4537173, we can trust the linker to relax cross-section unconditional branches across which x16 is available. Programs that use machine function splitting care most about the performance of hot code at the expense of the performance of cold code, so we prioritize minimizing hot code size. Here's a breakdown of the cases: Hot -> Cold [x16 is free across the branch] Do nothing; let the linker relax the branch. Cold -> Hot [x16 is free across the branch] Do nothing; let the linker relax the branch. Hot -> Cold [x16 used across the branch, but there is a free register] Spill x16; let the linker relax the branch. Spilling requires fewer instructions than manually inserting an indirect branch. Cold -> Hot [x16 used across the branch, but there is a free register] Manually insert an indirect branch. Spilling would require adding a restore block in the hot section. Hot -> Cold [No free regs] Spill x16; let the linker relax the branch. Cold -> Hot [No free regs] Spill x16 and put the restore block at the end of the hot function; let the linker relax the branch. Ex: [Hot section] func.hot: ... hot code... func.restore: ... restore x16 ... B func.hot [Cold section] func.cold: ... spill x16 ... B func.restore Putting the restore block at the end of the function instead of just before the destination increases the cost of executing the store, but it avoids putting cold code in the middle of hot code. Since the restore is very rarely taken, this is a worthwhile tradeoff. Differential Revision: https://reviews.llvm.org/D156767	2023-09-06 20:44:40 +00:00
Sander de Smalen	3c8bb18162	[AArch64][SME] Don't use OBSCURE_COPY to avoid rematerialization. This is intended to be a non-functional change. This patch removes OBSCURE_COPY in favour of using `forceDisableTriviallyReMaterializable`. Reviewed By: paulwalker-arm Differential Revision: https://reviews.llvm.org/D159194	2023-09-01 13:36:03 +00:00
Sander de Smalen	9e9be99c97	[AArch64][SME] Disable remat of VL-dependent ops when function changes streaming mode. This is a way to prevent the register allocator from inserting instructions which behave differently for different runtime vector-lengths, inside a call-sequence which changes the streaming-SVE mode before/after the call. I've considered using BUNDLEs in Machine IR, but found that using this is not possible for a few reasons: * Most passes don't look inside BUNDLEs, but some passes would need to look inside these call-sequence bundles, for example the PrologEpilog pass (to remove the CALLSEQSTART/END), a PostRA pass to remove COPY instructions, or the AArch64PseudoExpand pass. * Within the streaming-mode-changing call sequence, one of the instructions is a CALLSEQEND. The corresponding CALLSEQBEGIN (AArch64::ADJCALLSTACKUP) is outside this sequence. This means we'd end up with a BUNDLE that has [SMSTART, COPY, BL, ADJCALLSTACKUP, COPY, SMSTOP]. The MachineVerifier doesn't accept this, and we also can't move the CALLSEQSTART into the call sequence. Maybe in the future we could model this differently by modelling the runtime vector-length as a value that's used by certain operations (similar to e.g. NCZV flags) and clobbered by SMSTART/MMSTOP, such that the register allocator can consider these as actual dependences and avoid rematerialization. For now we just want to address the immediate problem. Reviewed By: paulwalker-arm, aemerson Differential Revision: https://reviews.llvm.org/D159193	2023-09-01 12:13:27 +00:00
Sander de Smalen	a6293228fd	Reland "[AArch64][SME] Add support for Copy/Spill/Fill of strided ZPR2/ZPR4 registers." This patch contains a few changes: * It changes the alignment of the strided/contiguous ZPR2/ZPR4 registers to 128-bits. This is important, because when we spill these registers to the stack, the address doesn't need to be 256/512 bits aligned because we split the single-store/reload pseudo instruction up into multiple STR_ZXI/LDR_ZXI (single vector store/load) instructions, which only require a 128-bit alignment. Additionally, an alignment larger than the stack-alignment is not supported for scalable vectors. * It adds support for these register classes in storeRegToStackSlot, loadRegFromStackSlot and copyPhysReg. * It adds tests only for the strided forms. There is no need to also test the contiguous forms, because a register such as z2_z3 or z4_z5_z6_z7 are also part of the regular ZPR2 and ZPR4 register classes, respectively, which are already covered and tested. Reviewed By: dtemirbulatov Differential Revision: https://reviews.llvm.org/D159189	2023-08-31 15:03:19 +00:00
Sander de Smalen	d6bd6f244e	Revert "[AArch64][SME] Add support for Copy/Spill/Fill of strided ZPR2/ZPR4 registers." This reverts commit 64da981b8b259c18313560bf629e1a8b3b7c1d52.	2023-08-31 14:14:56 +00:00
Sander de Smalen	64da981b8b	[AArch64][SME] Add support for Copy/Spill/Fill of strided ZPR2/ZPR4 registers. This patch contains a few changes: * It changes the alignment of the strided/contiguous ZPR2/ZPR4 registers to 128-bits. This is important, because when we spill these registers to the stack, the address doesn't need to be 256/512 bits aligned because we split the single-store/reload pseudo instruction up into multiple STR_ZXI/LDR_ZXI (single vector store/load) instructions, which only require a 128-bit alignment. Additionally, an alignment larger than the stack-alignment is not supported for scalable vectors. * It adds support for these register classes in storeRegToStackSlot, loadRegFromStackSlot and copyPhysReg. * It adds tests only for the strided forms. There is no need to also test the contiguous forms, because a register such as z2_z3 or z4_z5_z6_z7 are also part of the regular ZPR2 and ZPR4 register classes, respectively, which are already covered and tested. Reviewed By: dtemirbulatov Differential Revision: https://reviews.llvm.org/D159189	2023-08-31 13:47:46 +00:00
Daniel Hoekwater	0982d96186	[CodeGen][AArch64] Don't split inline asm goto blocks or their targets Machine function splitting + branch relaxation currently don't properly handle inline asm goto blocks that conditional branch to cold goto labels. While such inline asm is technically invalid, machine function splitting is the only thing that exposes it as such. Since machine function splitting doesn't help too much in these circumstances anyway, disable it for asm goto blocks and their targets. Differential Revision: https://reviews.llvm.org/D158647	2023-08-29 20:24:38 +00:00
Daniel Hoekwater	ef1c25eb50	[CodeGen][AArch64] Don't split jump table basic blocks Jump tables on AArch64 are label-relative rather than table-relative, so having jump table destinations that are in different sections causes problems with relocation. Jump table lookups have a max range of 1MB, so all destinations must be in the same section as the lookup code. Both of these restrictions can be mitigated with some careful and complex logic, but doing so doesn't gain a huge performance benefit. Efficiently ensuring jump tables are correct and can be compressed on AArch64 is a TODO item. In the meantime, don't split blocks that can cause problems. Differential Revision: https://reviews.llvm.org/D157124	2023-08-28 21:47:57 +00:00
Daniel Hoekwater	8c249c44d4	[CodeGen][AArch64] Don't split functions with a red zone on AArch64 Because unconditional branch relaxation on AArch64 grows the stack to spill a register, splitting a function would cause the red zone to be overwritten. Explicitly disable MFS for such functions. Differential Revision: https://reviews.llvm.org/D157127	2023-08-24 21:57:35 +00:00
David Tellenbach	979e8ae4fc	[AArch64] Check opcode before trying to extract register from operand When matching FNEG patterns for the MachineCombiner we need to check for opcodes first, before trying to extract a register from an operand. Otherwise handling of instructions with non-register operands causes the compiler to crash. Differential Revision: https://reviews.llvm.org/D158473	2023-08-23 14:46:31 -07:00
Daniel Hoekwater	90ab85a1b2	Reland "[CodeGen][AArch64] Make MFS testable on AArch64" Reverted by 3d22dac6c3b97d7bb92f243886dfb0d32a5c42e9 because it depended on b9d079d6188b50730e0a67267b7fee36008435ce, which broke some tests.	2023-08-22 20:21:33 +00:00
Daniel Hoekwater	d7bca8e494	[AArch64] Relax cross-section branches Because the code layout is not known during compilation, the distance of cross-section jumps is not knowable at compile-time. Because of this, we should assume that any cross-sectional jumps are out of range. This assumption is necessary for machine function splitting on AArch64, which introduces cross-section branches in the middle of functions. The linker relaxes out-of-range unconditional branches, but it clobbers X16 to do so; it doesn't relax conditional branches, which must be manually relaxed by the compiler. Differential Revision: https://reviews.llvm.org/D145211	2023-08-16 01:43:07 +00:00
Eric Christopher	b7c7b1e530	Remove elses after return.	2023-08-15 19:10:22 +00:00
Eric Christopher	7f7ef2f309	Remove unused include of Compiler.h	2023-08-15 19:10:22 +00:00
Eric Christopher	ec203c4db9	Sink AArch64ExpandImm.h include from header file to use.	2023-08-15 19:10:21 +00:00
Anatoly Trosinenko	81300f75f4	[AArch64][PAC] Remove the duplication of LR sign/auth implementations In the machine outliner implementation for AArch64, `signOutlinedFunction()` reimplements signing the LR value in prologue and authenticating it in epilogue of the outlined function. This patch factors out `signLR()` and `authenticateLR()` functions from AArch64FrameLowering code and reuses them in `signOutlinedFunction()`. The `mergeOutliningCandidateAttributes()` outliner callback is introduced as well to further unify signing and authentication of the LR value. Reviewed By: tmatheson Differential Revision: https://reviews.llvm.org/D157320	2023-08-11 14:39:18 +03:00
Jon Roelofs	fceabd0de6	Remove a reference to rdar://11522048 The context provided by the radar is mostly "administrative" in nature, and doesn't appear relevant compared to the beautiful comment preceeding it.	2023-08-09 11:26:55 -07:00
Daniel Hoekwater	3435a6a0bb	[AArch64] [XRay] Account for XRay event instrs in Branch Relaxation PATCHABLE_TYPED_EVENT_CALL and PATCHABLE_EVENT_CALL are pseudo instructions that expand to XRay sleds, so getInstSizeInBytes should reflect the size of the sleds, not the pseudo-instructions. Differential Revision: https://reviews.llvm.org/D156272	2023-07-27 17:10:58 +00:00
Alexander Kornienko	0def4e6b0f	Revert "[AArch64] Merge LDRSWpre-LD[U]RSW pair into LDPSWpre" This reverts commit b0093e13fcfdd4eea5bbd7ae57d3d1b82f4135c3 due to a miscompile under MSan. See https://reviews.llvm.org/D152407#4533478 for more details. Reviewed By: asmok-g Differential Revision: https://reviews.llvm.org/D156328	2023-07-26 16:22:24 +02:00
Zhuojia Shen	b0093e13fc	[AArch64] Merge LDRSWpre-LD[U]RSW pair into LDPSWpre This patch optimizes a pair of LDRSWpre and LDRSWui (or LDURSWi) instructions into a single LDPSWpre instruction. This is a missing case in D99272. MIR test cases in D152564 are updated to verify the optimization. Differential Revision: https://reviews.llvm.org/D152407	2023-07-18 09:46:47 -07:00
Sander de Smalen	ec6af93d02	[AArch64] NFC: Replace 'forceStreamingCompatibleSVE' with 'isNeonAvailable'. The AArch64Subtarget interface 'isNeonAvailable' is more appropriate going forward, as we may also want to generate 'streaming SVE' code (not just 'streaming-compatible SVE' code), but here we must still make sure not to use NEON instructions which are invalid in streaming SVE mode.	2023-07-17 08:24:10 +00:00
Fangrui Song	665ccc19d3	[MC] Add SMLoc to MCCFIInstruction to help debug and report better diagnostics for functions like relaxDwarfCallFrameFragment (D153167). In MCStreamer, some emitCFI* functions already take a SMLoc argument. Add a SMLoc argument to the remaining functions that generate a MCCFIInstruction.	2023-06-26 17:58:29 -07:00
Ricardo Jesus	887362ddb5	[AArch64] Neoverse V2 scheduling model This adds a scheduling model for the Neoverse V2. All information is taken from the Neoverse V2 Software Optimisation Guide: https://developer.arm.com/documentation/PJDOC-466751330-593177/r0p2 Differential Revision: https://reviews.llvm.org/D151894	2023-06-14 15:19:42 +00:00
Dávid Bolvanský	09515f2c20	[SDAG] Preserve unpredictable metadata, teach X86CmovConversion to respect this metadata Sometimes an developer would like to have more control over cmov vs branch. We have unpredictable metadata in LLVM IR, but currently it is ignored by X86 backend. Propagate this metadata and avoid cmov->branch conversion in X86CmovConversion for cmov with this metadata. Example: ``` int MaxIndex(int n, int a) { int t = 0; for (int i = 1; i < n; i++) { // cmov is converted to branch by X86CmovConversion if (a[i] > a[t]) t = i; } return t; } int MaxIndex2(int n, int a) { int t = 0; for (int i = 1; i < n; i++) { // cmov is preserved if (__builtin_unpredictable(a[i] > a[t])) t = i; } return t; } ``` Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D118118	2023-06-01 20:56:44 +02:00
Matt Devereau	004bf170c6	[AArch64] Emit FNMADD instead of FNEG(FMADD) Emit FNMADD instead of FNEG(FMADD) for optimization levels above Oz when fast-math flags (nsz+contract) permit it. Differential Revision: https://reviews.llvm.org/D149260	2023-05-10 12:45:54 +00:00
Manoj Gupta	4157625cea	Revert "[AArch64] Emit FNMADD instead of FNEG(FMADD)" This reverts commit ea228bd0bd0173ffd4aac497a312a852e8f7ffad. Cuases a crash on AArch64. Testcase provided at D149260.	2023-05-07 16:38:08 -07:00
Matt Devereau	ea228bd0bd	[AArch64] Emit FNMADD instead of FNEG(FMADD) Emit FNMADD instead of FNEG(FMADD) for optimization levels above Oz when fast-math flags (nsz+contract) permit it. Differential Revision: https://reviews.llvm.org/D149260	2023-05-05 13:35:51 +00:00
Matt Devereau	f9ff2468af	Revert "[AArch64] Emit FNMADD instead of FNEG(FMADD)" This reverts commit caa95c2408677d7af8c7be4da203ea9271854f46.	2023-05-05 10:50:23 +00:00
Matt Devereau	caa95c2408	[AArch64] Emit FNMADD instead of FNEG(FMADD) Emit FNMADD instead of FNEG(FMADD) for optimization levels above Oz when fast-math flags (nsz+contract) permit it. Differential Revision: https://reviews.llvm.org/D149260	2023-05-05 08:14:17 +00:00
Kazu Hirata	972983539b	[llvm] Apply fixes from readability-redundant-control-flow (NFC)	2023-04-16 00:13:46 -07:00
Daniel Hoekwater	6b62166b4c	Account for PATCHABLE instrs in Branch Relaxation PATCHABLE_* instructions expand to up to 36-byte sleds. Updating the size of PATCHABLE instructions causes them to be outlined, so we need to add a check to prevent the outliner from considering basic blocks that contain PATCHABLE instructions. Differential Revision: https://reviews.llvm.org/D147982	2023-04-14 16:14:50 -07:00
Mingming Liu	ec864a5371	[AArch64][PeepholeOpt]Optimize ALU + compare to flag-setting ALU The motivating example is in https://godbolt.org/z/45nbdYMK9 - For this example, `subs` is generated for the good case; `sub` followed by `cmp` is generated for the bad case. Since signed overflow is undefined behavior in C/C++ (indicated as `nsw` flag in LLVM IR), `subs` should be generated for the good case as well. This patch relaxes one restriction from "quit optimization when V is used" to "continue if MI produces poison value when signed overflow occurs". This is not meant to be C/C++ specific since it looks at 'NoSWrap' since it looks at MachineInstr flags. Reviewed By: dmgreen Differential Revision: https://reviews.llvm.org/D146820	2023-03-27 10:55:45 -07:00
Amara Emerson	41e9c4b88c	[NFC][Outliner] Delete default ctors for Candidate & OutlinedFunction. I think it's good practice to avoid having default ctors unless they're really valid/useful. For OutlinedFunction the default ctor was used to represent a bail-out value for getOutliningCandidateInfo(), so I changed the API to return an optional<getOutliningCandidateInfo> instead which seems a tad cleaner. Differential Revision: https://reviews.llvm.org/D146375	2023-03-20 11:17:10 -07:00
Hsiangkai Wang	0847cc06a6	[NFC][AArch64] Use 'i' to encode the offset form of load/store. STG, STZG, ST2G, STZ2G are the exceptions to append 'Offset' to name the offset format of load/store instructions. All other load/store instructions use 'i' as the appendix. If there is no special reason to do so, we should make the naming consistent. Differential Revision: https://reviews.llvm.org/D141819	2023-03-06 12:34:19 +00:00
duk	d61d591411	[MachineOutliner] Make getOutliningType partially target-independent The motivation behind this patch is to unify some of the outliner logic across architectures. This looks nicer in general and makes fixing [issues like this](https://reviews.llvm.org/D124707#3483805) easier. There are some notable changes here: 1. `isMetaInstruction()` is used directly instead of checking for specific meta-instructions like `IMPLICIT_DEF` or `KILL`. This was already done in the RISC-V implementation, but other architectures still did hardcoded checks. - As an exception to this, CFI instructions are explicitly delegated to the target because RISC-V has different handling for those. 2. `isTargetIndex()` checks are replaced with an assert; none of the architectures supported actually use `MO_TargetIndex` at this point in time. 3. `isCFIIndex()` and `isFI()` checks are also replaced with asserts, since these operands should not exist in [any context](https://reviews.llvm.org/D122635#3447214) at this stage in the pipeline. Reviewed by: paquette Differential Revision: https://reviews.llvm.org/D125072	2023-02-09 14:35:00 -05:00
Jessica Paquette	4de8521bc5	[MachineOutliner][AArch64] NFC: Split MBBs into "outlinable ranges" Recommit with bug fixes + added testcases to the outliner. Also adds some debug output. We found a case in the Swift benchmarks where the MachineOutliner introduces about a 20% compile time overhead in comparison to building without the MachineOutliner. The origin of this slowdown is that the benchmark has long blocks which incur lots of LRU checks for lots of candidates. Imagine a case like this: ``` bb: i1 i2 i3 ... i123456 ``` Now imagine that all of the outlining candidates appear early in the block, and that something like, say, NZCV is defined at the end of the block. The outliner has to check liveness for certain registers across all candidates, because outlining from areas where those registers are used is unsafe at call boundaries. This is fairly wasteful because in the previously-described case, the outlining candidates will never appear in an area where those registers are live. To avoid this, precalculate areas where we will consider outlining from. Anything outside of these areas is mapped to illegal and not included in the outlining search space. This allows us to reduce the size of the outliner's suffix tree as well, giving us a potential memory win. By precalculating areas, we can also optimize other checks too, like whether or not LR is live across an outlining candidate. Doing all of this is about a 16% compile time improvement on the case. This is likely useful for other targets (e.g. ARM + RISCV) as well, but for now, this only implements the AArch64 path. The original "is the MBB safe" method still works as before.	2023-02-03 15:33:37 -08:00
Evgenii Stepanov	bd3ee371e9	Revert "[AArch64][v8.3A] Avoid inserting implicit landing pads (PACI*SP)" Linux kernel sets SCTRL_EL1.BT0 and BT1 to 1 unconditionally, which makes PACIASP equivalent to BTI C + PACIA LR,SP. Use the shorter instruction sequence by default. I'm not aware of anyone who needs the opposite. They are welcome to revert to the current behavior under a subtarget feature or an environment check. This reverts commit 571c8c5263a79293aaadae07b11feb36726eaf53. Differential Revision: https://reviews.llvm.org/D141978	2023-01-19 14:09:22 -08:00
Craig Topper	79858d1908	[CodeGen][Target] Remove uses of Register::isPhysicalRegister/isVirtualRegister. NFC Use isPhysical/isVirtual methods.	2023-01-13 23:12:48 -08:00
Guillaume Chatelet	8fd5558b29	[NFC] Use TypeSize::geFixedValue() instead of TypeSize::getFixedSize() This change is one of a series to implement the discussion from https://reviews.llvm.org/D141134.	2023-01-11 16:49:38 +00:00
Guillaume Chatelet	48f5d77eee	[NFC] Use TypeSize::getKnownMinValue() instead of TypeSize::getKnownMinSize() This change is one of a series to implement the discussion from https://reviews.llvm.org/D141134.	2023-01-11 16:36:39 +00:00
KAWASHIMA Takahiro	6d0c3eb49d	[AArch64] Add SVE int instructions to isAssociativeAndCommutative Differential Revision: https://reviews.llvm.org/D140398	2023-01-10 10:39:49 +09:00
KAWASHIMA Takahiro	1088ef5cb0	[AArch64] Add SVE FP instructions to isAssociativeAndCommutative Differential Revision: https://reviews.llvm.org/D140396	2023-01-10 10:39:48 +09:00
serge-sans-paille	38818b60c5	Move from llvm::makeArrayRef to ArrayRef deduction guides - llvm/ part Use deduction guides instead of helper functions. The only non-automatic changes have been: 1. ArrayRef(some_uint8_pointer, 0) needs to be changed into ArrayRef(some_uint8_pointer, (size_t)0) to avoid an ambiguous call with ArrayRef((uint8_t), (uint8_t)) 2. CVSymbol sym(makeArrayRef(symStorage)); needed to be rewritten as CVSymbol sym{ArrayRef(symStorage)}; otherwise the compiler is confused and thinks we have a (bad) function prototype. There was a few similar situation across the codebase. 3. ADL doesn't seem to work the same for deduction-guides and functions, so at some point the llvm namespace must be explicitly stated. 4. The "reference mode" of makeArrayRef(ArrayRef<T> &) that acts as no-op is not supported (a constructor cannot achieve that). Per reviewers' comment, some useless makeArrayRef have been removed in the process. This is a follow-up to https://reviews.llvm.org/D140896 that introduced the deduction guides. Differential Revision: https://reviews.llvm.org/D140955	2023-01-05 14:11:08 +01:00
KAWASHIMA Takahiro	347d2be7be	[AArch64] Add Neon int instructions to isAssociativeAndCommutative Differential Revision: https://reviews.llvm.org/D139810	2022-12-20 23:47:51 +09:00
KAWASHIMA Takahiro	673b4ad645	[AArch64] Add FP16 instructions to isAssociativeAndCommutative `-mcpu=` in `llvm/test/CodeGen/AArch64/machine-combiner.ll` is changed to `neoverse-n2` to use FP16 and SVE/SVE2 instructions. By this, the register allocation and/or instruction scheduling are slightly changed and some existing `CHECK` lines need to be updated. Differential Revision: https://reviews.llvm.org/D139809	2022-12-20 23:47:51 +09:00
Christudasan Devadasan	b5efec4b27	[CodeGen] Additional Register argument to storeRegToStackSlot/loadRegFromStackSlot With D134950, targets get notified when a virtual register is created and/or cloned. Targets can do the needful with the delegate callback. AMDGPU propagates the virtual register flags maintained in the target file itself. They are useful to identify a certain type of machine operands while inserting spill stores and reloads. Since RegAllocFast spills the physical register itself, there is no way its virtual register can be mapped back to retrieve the flags. It can be solved by passing the virtual register as an additional argument. This argument has no use when the spill interfaces are called during the greedy allocator or even the PrologEpilogInserter and can pass a null register in such cases. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D138656	2022-12-17 11:55:34 +05:30
Matt Arsenault	588ecc11b8	AArch64: Stop storing MachineFunction in MachineFunctionInfo The constructor should not depend on the MachineFunction	2022-12-16 12:30:03 -05:00
KAWASHIMA Takahiro	a008b892a9	[AArch64][NFC] Change order of instructions in isAssociativeAndCommutative Before this change, the order of instructions in `case` labels was inconsistent. It is alphabetical order for FP instructions but another order for integer instructions. This commit changes the order to 1) instruction set (base/FP/SIMD), 2) mnemonic, 3) element type. I believe this change makes it consistent, improves understandability, and makes it easy to add/remove a group of instructions. Differential Revision: https://reviews.llvm.org/D139607	2022-12-12 09:52:03 +09:00

1 2 3 4 5 ...

544 Commits