llvm-project

Author	SHA1	Message	Date
aankit-ca	f9d3278901	[Hexagon] Add saturating add instructions (#148132 ) Generate the saturating add instructions for sadd.sat for scalar and vector instructions Co-authored-by: aankit-quic <aankit@quicinc.com> Co-authored-by: Jyotsna Verma <jverma@quicinc.com>	2025-07-11 15:00:05 -07:00
Simon Pilgrim	e608e3f023	Fix MSVC "result of 32-bit shift implicitly converted to 64 bits" warning. NFC.	2025-07-11 10:12:32 +01:00
pkarveti	1f39b92a16	[Hexagon] Handle bitcast of i32/v2i16/v4i8 -> v32i1 when Hvx is enabled (#147466 )	2025-07-11 01:26:53 -07:00
Kazu Hirata	6433030f69	[Hexagon] Use range-based for loops (NFC) (#147157 )	2025-07-05 12:26:42 -07:00
YunQiang Su	5a7e72d575	Hexagon: sfmax/sfmin instructions are IEEE754-2019 (#139056 ) The min/max instructions of Hexagon follow IEEE754-2019 minimumNumber/maximumNumber, aka FMINIMUMNUM and FMAXIMUMNUM instead of FMAXNUM and FMINNUM	2025-05-14 11:55:11 +08:00
Abinaya Saravanan	9c65e6ac11	[HEXAGON] Add support to lower "FREEZE a half(f16)" instruction on Hexagon and fix the isel-buildvector-v2f16.ll assertion (#130977 )	2025-03-12 16:58:26 -05:00
aankit-ca	29d3fc3f11	[HEXAGON] Fix hvx-isel for extract_subvector op (#129672 ) Fixes a crash with extract_subvectors in Hexagon backend seen when the source vector is a vector-pair and result vector is not hvx vector size. LLVM Issue: https://github.com/llvm/llvm-project/issues/128775 Fixes #128775 --------- Co-authored-by: aankit-quic <aankit@quicinc.com>	2025-03-06 17:02:10 -06:00
Sudharsan Veeravalli	27c034a9c6	[Hexagon] Fix typos discovered by codespell (NFC) (#126233 ) Found using https://github.com/codespell-project/codespell ``` codespell Hexagon --write-changes \ --ignore-words-list=CarryIn,CreateOr,ORE,COPYs,ArchVer,predicable,UE,MIs,isNT,Vor,CountR,DUM,GEs,AddD,ToI, \ CopyIn,TheI,TotalIn,vor,MOne,contigious,Contigious ```	2025-02-11 11:31:31 +05:30
Craig Topper	bc282605df	[SelectionDAG] Require last operand of (STRICT_)FP_ROUND to be a TargetConstant. (#117639 ) Fix all the places I could find that did't do this. We were already mostly correct for FP_ROUND after 9a976f36615dbe15e76c12b22f711b2e597a8e51, but not STRICT_FP_ROUND.	2024-11-25 21:36:33 -08:00
Nikita Popov	22fdc57140	[Hexagon] Avoid implicit truncation in getConstant() Use getSignedConstant() or change variable type as appropriate. This will avoid assertion failures when implicit truncation is disabled.	2024-11-22 15:07:31 +01:00
Kazu Hirata	7df9da7d78	[llvm] Construct SmallVector with ArrayRef (NFC) (#101872 )	2024-08-04 08:54:23 -07:00
Santanu Das	2771ea4ea4	[Hexagon] Fix concat lowering for HVX for 64B vector length (#98318 ) When concatenation of vector instructions is formed, as a part of it vector rotation is performed. The direction of the shift was not correctly calculated. This fixes the rotation factor.	2024-08-01 11:12:47 -05:00
Nikita Popov	f2f18459d4	Revert "Intrinsic: introduce minimumnum and maximumnum (#93841 )" As far as I can tell, this pull request was not approved, and did not go through an RFC on discourse. This reverts commit 89881480030f48f83af668175b70a9798edca2fb. This reverts commit 225d8fc8eb24fb797154c1ef6dcbe5ba033142da.	2024-06-21 08:34:04 +02:00
YunQiang Su	8988148003	Intrinsic: introduce minimumnum and maximumnum (#93841 ) Currently, on different platform, the behaivor of llvm.minnum is different if one operand is sNaN: When we compare sNaN vs NUM: ARM/AArch64/PowerPC: follow the IEEE754-2008's minNUM: return qNaN. RISC-V/Hexagon follow the IEEE754-2019's minimumNumber: return NUM. X86: Returns NUM but not same with IEEE754-2019's minimumNumber as +0.0 is not always greater than -0.0. MIPS/LoongArch/Generic: return NUM. LIBCALL: returns qNaN. So, let's introduce llvm.minmumnum/llvm.maximumnum, which always follow IEEE754-2019's minimumNumber/maximumNumber. Half-fix: #93033	2024-06-21 11:53:08 +08:00
Alex Bradbury	197214e39b	[RFC][SelectionDAG] Add and use SDNode::getAsZExtVal() helper (#76710 ) This follows on from #76708, allowing `cast<ConstantSDNode>(N)->getZExtValue()` to be replaced with just `N->getAsZextVal();` Introduced via `git grep -l "cast<ConstantSDNode>\(.\).getZExtValue" \| xargs sed -E -i 's/cast<ConstantSDNode>\((.*)\)->getZExtValue/\1->getAsZExtVal/'` and then using `git clang-format` on the result.	2024-01-09 12:25:17 +00:00
Alex Bradbury	80aeb62211	[llvm][NFC] Use SDValue::getConstantOperandVal(i) where possible (#76708 ) This helper function shortens examples like `cast<ConstantSDNode>(Node->getOperand(1))->getZExtValue();` to `Node->getConstantOperandVal(1);`. Implemented with: `git grep -l "cast<ConstantSDNode>\(.->getOperand\(.\)\)->getZExtValue\(\)" \| xargs sed -E -i 's/cast<ConstantSDNode>\((.)->getOperand\((.)\)\)->getZExtValue\(\)/\1->getConstantOperandVal(\2)/` and `git grep -l "cast<ConstantSDNode>\(.\.getOperand\(.\)\)->getZExtValue\(\)" \| xargs sed -E -i 's/cast<ConstantSDNode>\((.)\.getOperand\((.)\)\)->getZExtValue\(\)/\1.getConstantOperandVal(\2)/'`. With a couple of simple manual fixes needed. Result then processed by `git clang-format`.	2024-01-02 13:14:28 +00:00
Sander de Smalen	81b7f115fb	[llvm][TypeSize] Fix addition/subtraction in TypeSize. (#72979 ) It seems TypeSize is currently broken in the sense that: TypeSize::Fixed(4) + TypeSize::Scalable(4) => TypeSize::Fixed(8) without failing its assert that explicitly tests for this case: assert(LHS.Scalable == RHS.Scalable && ...); The reason this fails is that `Scalable` is a static method of class TypeSize, and LHS and RHS are both objects of class TypeSize. So this is evaluating if the pointer to the function Scalable == the pointer to the function Scalable, which is always true because LHS and RHS have the same class. This patch fixes the issue by renaming `TypeSize::Scalable` -> `TypeSize::getScalable`, as well as `TypeSize::Fixed` to `TypeSize::getFixed`, so that it no longer clashes with the variable in FixedOrScalableQuantity. The new methods now also better match the coding standard, which specifies that: * Variable names should be nouns (as they represent state) * Function names should be verb phrases (as they represent actions)	2023-11-22 08:52:53 +00:00
Kazu Hirata	384b81539e	[Hexagon] Use isNullConstant (NFC)	2023-09-02 12:12:14 -07:00
David Green	2802739dfd	[NFC] Replace ;; with ;	2023-06-11 10:25:24 +01:00
Krzysztof Parzyszek	b9e5dacc0d	[Hexagon] Split SETCC on float16/float32 HVX pairs	2023-05-17 10:42:58 -07:00
Krzysztof Parzyszek	35742743d2	[Hexagon] Fix number of arguments in call to DAG.getNode for VINSERTW0 HexagonISD::VINSERTW0 takes two inputs, but only one was provided.	2023-02-16 09:57:58 -08:00
Krzysztof Parzyszek	073d5e5945	[Hexagon] Further improve code generation for shuffles * Concatenate partial shuffles into longer ones whenever possible: In selection DAG, shuffle's operands and return type must all agree. This is not the case in LLVM IR, and non-conforming IR-level shuffles will be rewritten to match DAG's requirements. This can also make a shuffle that can be matched to a single HVX instruction become shuffles that require more complex handling. Example: anything that takes two single vectors and returns a pair (e.g. V6_vshuffvdd). This is avoided by concatenating such shuffles into ones that take a vector pair, and an undef pair, and produce a vector pair. * Recognize perfect shuffles when masks contain `undef` values. * Use funnel shifts for contracting shuffles. * Recognize rotations as a separate step. These changes go into a single commit, because each one on their own introduced some regressions.	2022-11-29 14:19:29 -08:00
Krzysztof Parzyszek	d9ed93da86	[Hexagon] Don't lower legal EXTRACT_SUBVECTOR to EXTRACT_SUBREG EXTRACT_SUBREG is a machine opcode and cannot be a part of an input selection pattern.	2022-11-24 18:10:42 -08:00
Krzysztof Parzyszek	8407c9916d	[Hexagon] Use BUILD_PAIR instead of HexagonISD::COMBINE in lowering	2022-11-17 12:31:48 -08:00
Stanislav Mekhanoshin	bcaf31ec3f	[AMDGPU] Allow finer grain control of an unaligned access speed A target can return if a misaligned access is 'fast' as defined by the target or not. In reality there can be different levels of 'fast' and 'slow'. This patch changes the boolean 'Fast' argument of the allowsMisalignedMemoryAccesses family of functions to an unsigned representing its speed. A target can still define it as it wants and the direct translation of the current code uses 0 and 1 for current false and true. This makes the change an NFC. Subsequent patch will start using an actual value of speed in the load/store vectorizer to compare if a vectorized access going to be not just fast, but not slower than before. Differential Revision: https://reviews.llvm.org/D124217	2022-11-17 09:23:53 -08:00
Krzysztof Parzyszek	9422a8d94c	[Hexagon] Break up vectors into HVX-sized chunks in HvxIdioms This will allow recognizing Q.31 multiplications on vectors that are multiplies of HVX vectors. At the moment this comes at the expense of Q.15 multiplications, which now are handled as 32-bit multiplications with shifts. In the longer term this will likely be replaced by a different scheme of "legalizing" vectors, which is necessary for idiom recognition, at least where using direct HVX instrinsics is desired.	2022-10-29 11:13:28 -07:00
Krzysztof Parzyszek	2ec380b23f	[Hexagon] Improve handling of 32-bit mulh/mul_lohi on HVX Handle MULH[US] by normalizing them into newly invented nodes HexagonISD::(S\|U\|US)MUL_LOHI. On HVX v60, if only the high part of SMUL_LOHI is used, use the original MULHS expansion. In all other cases, expand the full product. On HVX v62, always expand the full product. Introduce Hexagon-specific LLVM IR intrinsics for 32x32 multiplication returning low/high parts.	2022-10-22 15:08:27 -07:00
Krzysztof Parzyszek	6a8cfe9a72	[Hexagon] Use shifts by scalar for funnel shifts by scalar HVX has vector shifts by a scalar register. Use those in the expansions of funnel shifts where profitable.	2022-10-18 09:49:17 -07:00
Krzysztof Parzyszek	9fde8e907b	[Hexagon] Fix MULHS lowering for HVX v60 The carry bit from an intermediate addition was not properly propagated. For example mulhs(7fffffff, 7fffffff) was evaluated as 3ffeffff, while the correct result is 3fffffff.	2022-10-18 07:54:38 -07:00
Krzysztof Parzyszek	705e77abed	[Hexagon] Lower funnel shifts for HVX HVX v62+ has bidirectional shifts, which do not mask the shift amount to the bit width. Instead, the shift amount is sign-extended from the log(BW) bit value, and a negative value causes a shift in the other direction. For the shift amount being -log(BW), this reversed shift will shift all bits out, inserting 0s or sign bits depending on the type and direction.	2022-10-14 14:13:18 -07:00
Krzysztof Parzyszek	7f4ce3f1eb	[Hexagon] Introduce PS_vsplat[ir][bhw] pseudo instructions HVX v60 only has splats that take a 32-bit word as input, while v62+ has splats that take 8- or 16-bit value. This makes writing output patterns that need to use a splat annoying, because the entire output pattern needs to be replicated for various versions of HVX. To avoid this, the patterns will always use the pseudos, and then the pseudos will be handled using a post-ISel hook.	2022-10-14 12:03:13 -07:00
Fangrui Song	8805e5d1b7	[Hexagon] Fix -Wunused-variable in non-assertion builds after f6e7ad56040ee8531be3375e04312d30700a58b2	2022-09-21 14:14:45 -07:00
Krzysztof Parzyszek	f6e7ad5604	[Hexagon] Revamp type legalization of ext/trunc/sat in HVX Resizing operations (e.g. sign extension) in DAG can go from any width to any other width, e.g. i8 -> i32. If the input and the result differ by a factor larger than 2, the operation cannot be legal in HVX, since the only two legal vector sizes in HVX are a single vector and a pair of vectors. To simplify the legalization, such operations are expanded into steps that only double/halve the type size, so that each such step can be fully legalized on its own. The complication is that DAG will automatically fold these steps back into one, e.g. sext(sext) -> sext. To prevent that new HexagonISD nodes are introduced: TL_EXTEND and TL_TRUNCATE. Once legalized, these nodes are replaced with the original opcodes. The type legalization is now common to aext/sext/zext/trunc and Hexagon- specific ssat/usat nodes.	2022-09-21 11:25:27 -07:00
Simon Pilgrim	0015edeefd	Fix MSVC "result of 32-bit shift implicitly converted to 64 bits" warning. NFCI.	2022-09-20 14:24:07 +01:00
Simon Pilgrim	70582bc4d3	Fix MSVC "result of 32-bit shift implicitly converted to 64 bits" warnings. NFCI.	2022-09-20 10:35:32 +01:00
Haojian Wu	eec19987c0	Fix one more unused warning in release build, NFC	2022-09-19 20:56:39 +02:00
Haojian Wu	20822e2d42	Fix an unused warning in release build, NFC	2022-09-19 20:45:51 +02:00
Krzysztof Parzyszek	94a71361d6	[Hexagon] Implement [SU]INT_TO_FP and FP_TO_[SU]INT for HVX	2022-09-19 11:11:20 -07:00
Krzysztof Parzyszek	ec51e38062	[Hexagon] Add HVX patterns for ISD::ABS	2022-09-19 10:12:15 -07:00
Krzysztof Parzyszek	3eee45cdc8	[Hexagon] Rework SplitHvxPairOp to be a general vector splitting utiity Enable creating an idiom: V -> opJoin(SplitVectorOp(V))	2022-09-19 09:42:13 -07:00
Krzysztof Parzyszek	e5844462f6	[Hexagon] Use proper output chain when widening HVX loads	2022-09-19 09:04:13 -07:00
Shao-Ce SUN	662b9fa02c	[NFC][CodeGen] Add a setTargetDAGCombine use ArrayRef Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D122557	2022-03-29 09:53:24 +08:00
Krzysztof Parzyszek	0792161c00	[Hexagon] Fix operation actions for v128f16 There were more cases of operations that should have been "Custom" for v128f16, but ended up "Legal" (e.g. load and store).	2022-02-08 15:28:37 -08:00
Krzysztof Parzyszek	7403c02f06	[Hexagon] Fix crash with shuffle_vector of v128f16	2022-02-08 13:05:22 -08:00
Mikael Holmen	e2078d56ed	[Hexagon] Fix warning about unused variable when compiling without asserts	2022-01-05 08:35:16 +01:00
Krzysztof Parzyszek	78f5014fea	[Hexagon] Conversions to/from FP types, HVX and scalar Co-authored-by: Anirudh Sundar Subramaniam <quic_sanirudh@quicinc.com> Co-authored-by: Sumanth Gundapaneni <sgundapa@quicinc.com>	2022-01-04 11:03:51 -08:00
Krzysztof Parzyszek	db83e3e507	[Hexagon] Generate HVX/FP arithmetic instructions Co-authored-by: Anirudh Sundar Subramaniam <quic_sanirudh@quicinc.com> Co-authored-by: Sumanth Gundapaneni <sgundapa@quicinc.com> Co-authored-by: Joshua Herrera <joshherr@quicinc.com>	2021-12-30 12:47:30 -08:00
Krzysztof Parzyszek	9e6afbedb0	[Hexagon] Generate HVX/FP compare instructions Co-authored-by: Anirudh Sundar Subramaniam <quic_sanirudh@quicinc.com>	2021-12-30 12:17:22 -08:00
Krzysztof Parzyszek	eb574259b6	[Hexagon] Handle HVX/FP {masked,wide} loads/stores Co-authored-by: Rahul Utkoor <quic_rutkoor@quicinc.com> Co-authored-by: Anirudh Sundar Subramaniam <quic_sanirudh@quicinc.com>	2021-12-30 10:14:00 -08:00
Krzysztof Parzyszek	23423638cc	[Hexagon] Handle HVX/FP shuffles, insertion and extraction Co-authored-by: Anirudh Sundar Subramaniam <quic_sanirudh@quicinc.com>	2021-12-30 08:44:10 -08:00

1 2 3

143 Commits