llvm-project

Author	SHA1	Message	Date
Kai Luo	56414220df	[PowerPC] Use 'sync; ld; cmp; bc; isync' for atomic load seq-cst on 32-bit platform (#75905 ) `cmp; bc; isync` is more performant than `lwsync` theoretically. 64-bit platform already features it, now implement it for 32-bit platform.	2023-12-20 10:01:02 +08:00
Kai Luo	2f82662ce9	[PowerPC] Let base implementation decide if MI is rematerizable by default (#75772 ) If MI is not PPC specific instructions, let base implementation decide if MI is rematerizable. This can fix failure in #75570 after #75271 .	2023-12-18 17:39:22 +08:00
Vitaly Buka	36477f7562	[XCOFF] Fix UB 'left shift of negative value' after #72532	2023-12-14 11:28:48 -08:00
stephenpeckham	2fd7657b66	[XCOFF] Display branch-absolute targets in hex. (#72532 ) Branch-absolute instructions are currently printed in decimal, and negative addresses are printed as positive numbers. With this change, addresses are printed in hex and negative addresses are converted to an unsigned 32- or 64-bit address.	2023-12-13 12:55:47 -06:00
Kazu Hirata	586ecdf205	[llvm] Use StringRef::{starts,ends}_with (NFC) (#74956 ) This patch replaces uses of StringRef::{starts,ends}with with StringRef::{starts,ends}_with for consistency with std::{string,string_view}::{starts,ends}_with in C++20. I'm planning to deprecate and eventually remove StringRef::{starts,ends}with.	2023-12-11 21:01:36 -08:00
Kazu Hirata	286ef12b47	[Target] Remove unnecessary includes (NFC)	2023-12-07 21:03:56 -08:00
Craig Topper	e87f33d9ce	[RISCV][MC] Pass MCSubtargetInfo down to shouldForceRelocation and evaluateTargetFixup. (#73721 ) Instead of using the STI stored in RISCVAsmBackend, try to get it from the MCFragment. This addresses the issue raised here https://discourse.llvm.org/t/possible-problem-related-to-subtarget-usage/75283	2023-12-07 13:17:58 -08:00
Stefan Pintilie	ea8b95d0d5	[PowerPC] Add a set of extended mnemonics that are missing from Power 10. (#73003 ) This patch adds the majority of the missing extended mnemonics that were introduced in Power 10. The only extended mnemonics that were not added are related to the plq and pstq instructions. These will be added in a separate patch as the instructions themselves would also have to be added.	2023-12-07 13:40:00 -05:00
Chen Zheng	4b932d84f4	[PowerPC] redesign the target flags (#69695 ) 12 bit is not enough for PPC's target specific flags. If 8 bit for the bitmask flags, 4 bit for the direct mask, PPC can total have 16 direct mask and 8 bitmask. Not enough for PPC, see this issue in https://github.com/llvm/llvm-project/pull/66316 Redesign how PPC target set the target specific flags. With this patch, all ppc target flags are direct flags. No bitmask flag in PPC anymore. This patch aligns with some targets like X86 which also has many target specific flags. The patch also fixes a bug related to flag `MO_TLSGDM_FLAG` and `MO_LO`. They are the same value and the test case changes in this PR shows the bug.	2023-12-07 12:47:25 +08:00
Alex Bradbury	b717365216	[MachineScheduler][NFCI] Add Offset and OffsetIsScalable args to shouldClusterMemOps (#73778 ) These are picked up from getMemOperandsWithOffsetWidth but weren't then being passed through to shouldClusterMemOps, which forces backends to collect the information again if they want to use the kind of heuristics typically used for the similar shouldScheduleLoadsNear function (e.g. checking the offset is within 1 cache line). This patch just adds the parameters, but doesn't attempt to use them. There is potential to use them in the current PPC and AArch64 shouldClusterMemOps implementation, and I intend to use the offset in the heuristic for RISC-V. I've left these for future patches in the interest of being as incremental as possible. As noted in the review and in an inline FIXME, an ElementCount-style abstraction may later be used to condense these two parameters to one argument. ElementCount isn't quite suitable as it doesn't support negative offsets.	2023-12-06 15:30:48 +00:00
Kazu Hirata	92c2529ccd	[llvm] Stop including vector (NFC) Identified with clangd.	2023-12-03 22:32:21 -08:00
Stefan Pintilie	4069299d71	[PowerPC] Fix the register class for the instruction paddi. (#73995 ) The register class for the PADDI definition is incorrect as register zero for RA is treated as an actual zero.	2023-12-01 15:53:08 -05:00
Ramkumar Ramachandra	9468de48fc	TargetInstrInfo: make getOperandLatency return optional (NFC) (#73769 ) getOperandLatency has the following behavior: it returns -1 as a special value, negative numbers other than -1 on some target-specific overrides, or a valid non-negative latency. This behavior can be surprising, as some callers do arithmetic on these negative values. Change the interface of getOperandLatency to return a std::optional<unsigned> to prevent surprises in callers. While at it, change the interface of getInstrLatency to return unsigned instead of int. This change was inspired by a refactoring in TargetSchedModel::computeOperandLatency.	2023-12-01 11:29:19 +00:00
David Spickett	da1aff2b2a	[llvm][PowerPC] Correct handling of spill slots for SPE when EXPENSIVE_CHECKS is enabled (#73940 ) This was modifying a container as it iterated it, which tripped a check in libstdc++'s debug checks. Instead, just assign to the item via the reference we already have. This fixes the following expensive checks failures on my machine: LLVM :: CodeGen/PowerPC/fp-strict.ll LLVM :: CodeGen/PowerPC/pr55463.ll LLVM :: CodeGen/PowerPC/register-pressure.ll LLVM :: CodeGen/PowerPC/spe.ll Which are some of the tests noted by #68594.	2023-12-01 10:40:24 +00:00
Alex Bradbury	6cf3566850	[NFC][MachineScheduler] Rename NumLoads parameter of shouldClusterMemOps to ClusterSize (#73757 ) As the same hook is called for both load and store clustering, NumLoads is a misleading name. Use ClusterSize instead.	2023-11-29 09:47:03 +00:00
Stefan Pintilie	d896b1f5a6	[PowerPC] Do not string pool globals that are part of llvm used. (#66848 ) The string pooling pass was incorrectly pooling global varables that were part of llvm.used or llvm.compiler.used. This patch fixes the pass to prevent that by checking each candidate to make sure that it is not in either of those lists.	2023-11-24 12:21:28 -05:00
Craig Topper	74dcc2da62	[PowerPC] Silence -Woverloaded-virtual warning. NFC	2023-11-22 15:45:20 -08:00
Craig Topper	a845061935	[AArch64] Use the same fast math preservation for MachineCombiner reassociation as X86/PowerPC/RISCV. (#72820 ) Don't blindly copy the original flags from the pre-reassociated instrutions. This copied the integer poison flags which are not safe to preserve after reassociation. For the FP flags, I think we should only keep the intersection of the flags. Override setSpecialOperandAttr to do this. Fixes #72777.	2023-11-22 14:17:45 -08:00
Sander de Smalen	81b7f115fb	[llvm][TypeSize] Fix addition/subtraction in TypeSize. (#72979 ) It seems TypeSize is currently broken in the sense that: TypeSize::Fixed(4) + TypeSize::Scalable(4) => TypeSize::Fixed(8) without failing its assert that explicitly tests for this case: assert(LHS.Scalable == RHS.Scalable && ...); The reason this fails is that `Scalable` is a static method of class TypeSize, and LHS and RHS are both objects of class TypeSize. So this is evaluating if the pointer to the function Scalable == the pointer to the function Scalable, which is always true because LHS and RHS have the same class. This patch fixes the issue by renaming `TypeSize::Scalable` -> `TypeSize::getScalable`, as well as `TypeSize::Fixed` to `TypeSize::getFixed`, so that it no longer clashes with the variable in FixedOrScalableQuantity. The new methods now also better match the coding standard, which specifies that: * Variable names should be nouns (as they represent state) * Function names should be verb phrases (as they represent actions)	2023-11-22 08:52:53 +00:00
Simon Pilgrim	cfee7152d4	[DAG] clang-format createBranchMacroFusionDAGMutation calls. NFC. Reduces diff in #72227	2023-11-20 12:13:09 +00:00
Kai Luo	eb7698254a	[PowerPC][EarlyIfConversion] Do not insert `isel` if subtarget doesn't support `isel` (#72211 ) Some subtargets of PPC don't support `isel` instruction, early-ifcvt should not insert this instruction.	2023-11-20 09:17:04 +08:00
Qiu Chaofan	426ad99bb2	[PowerPC] Forbid f128 SELECT_CC optimized into fsel (#71497 )	2023-11-15 12:20:06 +08:00
Kazu Hirata	8842d59c9f	[llvm] Stop including llvm/ADT/BitVector.h (NFC) Identified with clangd.	2023-11-11 13:24:01 -08:00
Kazu Hirata	01702c3f7f	[llvm] Stop including llvm/ADT/SmallSet.h (NFC) Identified with clangd.	2023-11-11 12:32:15 -08:00
Kazu Hirata	d4360e428f	[llvm] Stop including llvm/ADT/DenseMap.h (NFC) Ientified with clangd.	2023-11-11 10:07:19 -08:00
Kazu Hirata	ac4a272913	[llvm] Stop including llvm/ADT/DenseSet.h (NFC) Identified with clangd.	2023-11-11 09:48:29 -08:00
Kazu Hirata	84a48ee9fb	[llvm] Stop including llvm/ADT/SetVector.h (NFC) Identified with clangd.	2023-11-10 23:50:23 -08:00
Craig Topper	c4821073cd	[GISel] Make target's PartMapping, ValueMapping, and BankIDToCopyMapIdx arrays const. (#71079 ) AMDGPU arrays were already const.	2023-11-09 17:03:56 -08:00
Jay Foad	d5f3b3b3b1	[RegScavenger] Simplify state tracking for backwards scavenging (#71202 ) Track the live register state immediately before, instead of after, MBBI. This makes it simple to track the state at the start or end of a basic block without a separate (and poorly named) Tracking flag. This changes the API of the backward(MachineBasicBlock::iterator I) method, which now recedes to the state just before, instead of just after, *I. Some clients are simplified by this change. There is one small functional change shown in the lit tests where multiple spilled registers all need to be reloaded before the same instruction. The reloads will now be inserted in the opposite order. This should not affect correctness.	2023-11-08 09:49:07 +00:00
Qiu Chaofan	5f295552f1	[PowerPC] Fix incorrect symbol name of frexp libcall (#71626 ) frexpl is for ppc_fp128. The correct symbol name for f128 is frexpf128.	2023-11-08 14:41:19 +08:00
Paulo Matos	7b9d73c2f9	[NFC] Remove Type::getInt8PtrTy (#71029 ) Replace this with PointerType::getUnqual(). Followup to the opaque pointer transition. Fixes an in-code TODO item.	2023-11-07 17:26:26 +01:00
Stefan Pintilie	423ad04c67	[PowerPC] Add an alias for -mregnames so that full register names used in assembly. (#70255 ) This option already exists on GCC and so it is being added to LLVM so that we use the same option as them.	2023-11-06 12:30:19 -05:00
Youngsuk Kim	e69e066bfe	[llvm][PowerPC] Remove no-op ptr-to-ptr bitcasts (NFC) Opaque ptr cleanup effort.	2023-11-01 16:40:32 -05:00
Fangrui Song	8e247b8f47	Replace TypeSize::{getFixed,getScalable} with canonical TypeSize::{Fixed,Scalable}. NFC	2023-10-27 00:30:41 -07:00
Craig Topper	2f4328e697	[GISel] Make assignValueToReg take CCValAssign by const reference. (#70086 ) This was previously passed by value. It used to be passed by non-const reference, but it was changed to value in D110610. I'm not sure why.	2023-10-24 15:47:04 -07:00
Craig Topper	9f592cbc18	[GISel] Pass MPO and VA to assignValueToAddress by const reference. NFC (#69810 ) Previously they were passed by non-const reference. No in tree target modifies the values. This makes it possible to call assignValueToAddress from assignCustomValue without a const_cast. For example in this patch https://github.com/llvm/llvm-project/pull/69138.	2023-10-24 09:58:22 -07:00
Qiu Chaofan	53edf45233	[PowerPC] Remove HTM instruction from P10 SchedModel (#69579 ) Power10 does not support Hardware Transactional Memory instructions. Remove to keep consistency.	2023-10-20 17:17:01 +08:00
Arthur Eubanks	5e4ec53b8e	[llc][PPC] Move PIC check into TargetMachine (#66727 ) Matches other code like the code model checking.	2023-10-16 10:41:20 -07:00
Kazu Hirata	4a0ccfa865	Use llvm::endianness::{big,little,native} (NFC) Note that llvm::support::endianness has been renamed to llvm::endianness while becoming an enum class as opposed to an enum. This patch replaces support::{big,little,native} with llvm::endianness::{big,little,native}.	2023-10-12 21:21:45 -07:00
Kai Luo	3104681686	[PowerPC][Atomics] Remove redundant block to clear reservation (#68430 ) This PR is following what https://reviews.llvm.org/D134783 does for quardword CAS.	2023-10-13 10:59:27 +08:00
Nikita Popov	127ed9ae26	[PowerPC] Use zext instead of anyext in custom and combine (#68784 ) This custom combine currently converts `and(anyext(x),c)` into `anyext(and(x,c))`. This is not correct, because the original expression guaranteed that the high bits are zero, while the new one sets them to undef. Emit `zext(and(x,c))` instead. Fixes https://github.com/llvm/llvm-project/issues/68783.	2023-10-12 09:32:17 +02:00
Kai Luo	dcf9d946f5	[PowerPC][Atomics] Simplify atomicrmw i128 patterns. NFC. (#68779 ) Most fragments of these patterns are the same, we can simplify them by defining a common pattern.	2023-10-12 12:00:35 +08:00
Kazu Hirata	a9d5056862	Use llvm::endianness (NFC) Now that llvm::support::endianness has been renamed to llvm::endianness, we can use the shorter form. This patch replaces support::endianness with llvm::endianness.	2023-10-10 21:54:15 -07:00
Lei	529ad40e05	[PowerPC] Fix missing kill flag update for XVCVDPSP transformations (#67997 ) Add transformed register to kill flag work list for XVCVDPSP tranformations. Ref: reviews.llvm.org/D133103	2023-10-06 10:24:54 -04:00
Matthias Braun	5181156b37	Use BlockFrequency type in more places (NFC) (#68266 ) The `BlockFrequency` class abstracts `uint64_t` frequency values. Use it more consistently in various APIs and disable implicit conversion to make usage more consistent and explicit. - Use `BlockFrequency Freq` parameter for `setBlockFreq`, `getProfileCountFromFreq` and `setBlockFreqAndScale` functions. - Return `BlockFrequency` in `getEntryFreq()` functions. - While on it change some `const BlockFrequency& Freq` parameters to plain `BlockFreqency Freq`. - Mark `BlockFrequency(uint64_t)` constructor as explicit. - Add missing `BlockFrequency::operator!=`. - Remove `uint64_t BlockFreqency::getMaxFrequency()`. - Add `BlockFrequency BlockFrequency::max()` function.	2023-10-05 11:40:17 -07:00
Stefan Pintilie	23e7da8f10	[PowerPC] Add the SCV instruction. (#68063 ) The SCV instruciton was added on PowerPC on Power 9. This patch adds the SCV so that it may be used as part of inline asm but does not provide patterns for it or scheduling information. Co-authored-by: Stefan Pintilie <stefanp@ca.ibm.com>	2023-10-05 12:51:04 -04:00
Nikita Popov	5e81d6742a	[PPCBoolRetToInt] Avoid use of ConstantExpr::getZExt() (NFC) Use IRBuilder instead, which will either insert an instruction or constant fold.	2023-09-28 16:01:51 +02:00
Kishan Parmar	696ea67f19	Disable call to fma for soft-float PowerPC backend generate calls to libc function calls for soft-float, regardless of the -nostdlib /-ffreestanding flag. fma is not a function provided by compiler-rt builtins and thus should not be generated here. PR : [[ https://github.com/llvm/llvm-project/issues/55230 \| #55230 ]] Below is patch given by @nemanjai Reviewed By: jhibbits Differential Revision: https://reviews.llvm.org/D156344	2023-09-28 14:06:54 +05:30
Wael Yehia	da55b1b52f	[XCOFF] Do not generate the special .ref for zero-length sections (#66805 ) Co-authored-by: Wael Yehia <wyehia@ca.ibm.com>	2023-09-28 01:33:41 -04:00
esmeyi	f2f61a99f7	[PowerPC] A fix for D159073. Do not optimize when register classes are different in src and dst. For example: ``` %298:g8rc = RLDICL %297:g8rc, 0, 48 %299:gprc = COPY killed %298.sub_32:g8rc dead %498:gprc = ANDI_rec killed %299:gprc, 1, implicit-def dead $cr0, implicit-def $cr0gt ```	2023-09-26 09:28:26 -04:00

1 2 3 4 5 ...

7234 Commits