llvm-project

Author	SHA1	Message	Date
Craig Topper	fd50cdfb94	[AMDGPU] Use MCRegister. NFC	2024-09-28 11:40:25 -07:00
Ivan Kosarev	f0fe6c66cb	[AMDGPU][NFC] Rename isHi() to isHi16Reg() for clarity. (#103888 ) And declare it to take an MCRegister. Also rename related entities and remove a comment for the function that depending on its purpose is either irrelevant or misleading.	2024-08-14 17:04:15 +01:00
Ivan Kosarev	c7107ca7f8	[AMDGPU][NFCI] Mark AGPRs and VGPRs with different flags in HWEncoding. (#102650 ) Simplifies checks for AGPRs and VGPRs and makes them more explicit and less fragile.	2024-08-12 13:06:19 +01:00
Ivan Kosarev	162386693f	[AMDGPU][MC] Support UC_VERSION_* constants. (#95618 ) Our other tools support them, so we want them in LLVM assembler/disassembler too.	2024-06-18 15:44:14 +01:00
Shilei Tian	e963d0740e	[AMDGPU] Replace `isInlinableLiteral16` with specific version (#84402 ) The current implementation of `isInlinableLiteral16` assumes, a 16-bit inlinable literal is either an `i16` or a `fp16`. This is not always true because of `bf16`. However, we can't tell `fp16` and `bf16` apart by just looking at the value. This patch splits `isInlinableLiteral16` into three versions, `i16`, `fp16`, `bf16` respectively, and call the corresponding version.	2024-03-08 14:49:52 -05:00
Shilei Tian	46734aa1e5	[AMDGPU] Use `bf16` instead of `i16` for bfloat (#80908 ) Currently we generally use `i16` to represent `bf16` in those tablegen files. This patch is trying to use `bf16` directly. Fix #79369.	2024-02-16 15:58:30 -05:00
Matt Arsenault	9bbde62b52	AMDGPU: Simplify else if to else in AMDGPUMCCodeEmitter Fixes #79737	2024-01-30 08:17:04 +05:30
Ivan Kosarev	5a458767dd	[AMDGPU][True16] Support source DPP operands. (#79025 )	2024-01-23 09:52:49 +00:00
Nicolai Hähnle	49b492048a	AMDGPU: Fix packed 16-bit inline constants (#76522 ) Consistently treat packed 16-bit operands as 32-bit values, because that's really what they are. The attempt to treat them differently was ultimately incorrect and lead to miscompiles, e.g. when using non-splat constants such as (1, 0) as operands. Recognize 32-bit float constants for i/u16 instructions. This is a bit odd conceptually, but it matches HW behavior and SP3. Remove isFoldableLiteralV216; there was too much magic in the dependency between it and its use in SIFoldOperands. Instead, we now simply rely on checking whether a constant is an inline constant, and trying a bunch of permutations of the low and high halves. This is more obviously correct and leads to some new cases where inline constants are used as shown by tests. Move the logic for switching packed add vs. sub into SIFoldOperands. This has two benefits: all logic that optimizes for inline constants in packed math is now in one place; and it applies to both SelectionDAG and GISel paths. Disable the use of opsel with v_dot* instructions on gfx11. They are documented to ignore opsel on src0 and src1. It may be interesting to re-enable to use of opsel on src2 as a future optimization. A similar "proper" fix of what inline constants mean could potentially be applied to unpacked 16-bit ops. However, it's less clear what the benefit would be, and there are surely places where we'd have to carefully audit whether values are properly sign- or zero-extended. It is best to keep such a change separate. Fixes: Corruption in FSR 2.0 (latent bug exposed by an LLPC change)	2024-01-04 00:10:15 +01:00
Mariusz Sikora	7f55d7de1a	[AMDGPU] GFX12: Add Split Workgroup Barrier (#74836 ) Co-authored-by: Vang Thao <Vang.Thao@amd.com>	2023-12-13 15:01:13 +01:00
Ivan Kosarev	d1e3d32088	[AMDGPU][NFCI] Decouple actual register encodings from HWEncoding values. (#69452 ) The HWEncoding values currently form a strange mix of actual register codes for some subtargets and types of operands and informational flags. This patch removes the dependency allowing arbitrary changes in the structure of HWEncoding values without breaking register encodings. Such changes, in turn, would make it possible to speed up and simplify getAVOperandEncoding() testing for AGPRs as well as other functions dealing with register codes downstream. They would also allow to maintain the same format of HWEncoding values across our downstream code bases, thus simplifying merging in mainline changes.	2023-10-25 13:24:50 +01:00
Kazu Hirata	c40902c41c	[AMDGPU] Use llvm::endianness::little (NFC) Note that llvm::support::endianness has been renamed to llvm::endianness. This patch replaces support::endianness::little with llvm::endianness::little.	2023-10-12 21:02:02 -07:00
Stanislav Mekhanoshin	ab6c3d5034	[AMDGPU] Change the representation of double literals in operands (#68740 ) A 64-bit literal can be used as a 32-bit zero or sign extended operand. In case of double zeroes are added to the low 32 bits. Currently asm parser stores only high 32 bits of a double into an operand. To support codegen as requested by the https://github.com/llvm/llvm-project/issues/67781 we need to change the representation to store a full 64-bit value so that codegen can simply add immediates to an instruction. There is some code to support compatibility with existing tests and asm kernels. We allow to use short hex strings to represent only a high 32 bit of a double value as a valid literal.	2023-10-12 14:45:45 -07:00
Kazu Hirata	b05dbc4d5f	[llvm] Use llvm::endianness::{big,little,native} (NFC) Now that llvm::support::endianness has been renamed to llvm::endianness, we can use the shorter form. This patch replaces support::endianness::{big,little,native} with llvm::endianness::{big,little,native}.	2023-10-10 20:14:20 -07:00
Ivan Kosarev	9310baa596	[AMDGPU][NFC] Add True16 operand definitions. Reviewed By: Joe_Nash Differential Revision: https://reviews.llvm.org/D156103	2023-09-25 16:48:46 +01:00
Ivan Kosarev	5ae7fde004	[AMDGPU][NFC] Rename SIMCCodeEmitter.cpp to match the new emitter class name. The class was renamed in <https://reviews.llvm.org/D154337>. Reviewed By: foad Differential Revision: https://reviews.llvm.org/D154426	2023-07-05 10:20:26 +01:00
Ivan Kosarev	ee165cdb1b	[AMDGPU] Eliminate SIMCCodeEmitter and de-virtualise encoding methods. Simplifies some future changes needed for <https://github.com/llvm/llvm-project/issues/62629>. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D154337	2023-07-05 10:13:33 +01:00
Daniil Fukalov	48958d02d2	[NFC][AMDGPU] Reduce includes dependencies. 1. Splitted out some parts of R600 target to separate modules/headers. 2. Reduced some include lists in headers. 3. Found and fixed issue with override `GCNTargetMachine::getSubtargetImpl()` and `R600TargetMachine::getSubtargetImpl()` had different return value type than base class. 4. Minor forward declarations cleanup. Reviewed By: foad Differential Revision: https://reviews.llvm.org/D108596	2021-08-25 12:01:55 +03:00
Chandler Carruth	2946cd7010	Update the file headers across all of the LLVM projects in the monorepo to reflect the new license. We understand that people may be surprised that we're moving the header entirely to discuss the new license. We checked this carefully with the Foundation's lawyer and we believe this is the correct approach. Essentially, all code in the project is now made available by the LLVM project under our new license, so you will see that the license headers include that license only. Some of our contributors have contributed code under our old license, and accordingly, we have retained a copy of our old license notice in the top-level files in each project and repository. llvm-svn: 351636	2019-01-19 08:50:56 +00:00
Adrian Prantl	5f8f34e459	Remove \brief commands from doxygen comments. We've been running doxygen with the autobrief option for a couple of years now. This makes the \brief markers into our comments redundant. Since they are a visual distraction and we don't want to encourage more \brief markers in new code either, this patch removes them all. Patch produced by for i in $(git grep -l '\\brief'); do perl -pi -e 's/\\brief //g' $i & done Differential Revision: https://reviews.llvm.org/D46290 llvm-svn: 331272	2018-05-01 15:54:18 +00:00
Tom Stellard	45bb48ea19	R600 -> AMDGPU rename llvm-svn: 239657	2015-06-13 03:28:10 +00:00

21 Commits