llvm-project

Author	SHA1	Message	Date
Matt Arsenault	47685633a7	AMDGPU: Make v4bf16 a legal type (#76217 ) Gets a few code quality improvements. A few cases are worse from losing load narrowing. Depends #76213 #76214 #76215	2024-01-05 08:35:07 +07:00
Matt Arsenault	460ffcddd9	AMDGPU: Make bf16/v2bf16 legal types (#76215 ) There are some intrinsics are using i16 vectors in place of bfloat vectors. Move towards making bf16 vectors legal so these can migrate. Leave the larger vectors for a later change. Depends #76213 #76214	2024-01-04 22:31:18 +07:00
Matt Arsenault	b01adc6bed	AMDGPU: Strengthen some bfloat tests Fix bitcast test, which was splitting apart phis intended to force bitcasts that survive all the way to selection. Disable the amdgpu-codegenprepare phi splitting, which defeats the technique of using a phi to ensure a bitcast reaches all the way to selection. Also add a variety of bfloat tests. These probably need revisiting to avoid the cast folding into argument loads. Also round out set of bfloat bitcast and ABI tests. Add codegen tests for more bf16 operations The promotion of these works contrary to the comment.	2023-12-20 19:33:45 +07:00
Nikita Popov	bdf2fbba9c	[AMDGPU] Convert some tests to opaque pointers (NFC)	2022-12-19 12:41:13 +01:00
Matt Arsenault	ada6aa3f5c	AMDGPU: Fold undef rcp to qnan This matches the behavior in instcombine, and for fdiv.	2022-11-04 15:49:37 -07:00
Matt Arsenault	7a84624079	AMDGPU: Make various vector undefs legal Surprisingly these were getting legalized to something zero initialized. This fixes an infinite loop when combining some vector types. Also fixes zero initializing some undef values. SimplifyDemandedVectorElts / SimplifyDemandedBits are not checking for the legality of the output undefs they are replacing unused operations with. This resulted in turning vectors into undefs that were later re-legalized back into zero vectors.	2022-09-28 10:48:52 -04:00
Stanislav Mekhanoshin	c12d64ab16	Moved dag-combine-select-undef.ll into amdgpu. NFC. Tests really needs target arch to be specified. llvm-svn: 347115	2018-11-17 00:17:15 +00:00
Stanislav Mekhanoshin	0ff7c8309d	DAG combiner: fold (select, C, X, undef) -> X Differential Revision: https://reviews.llvm.org/D54646 llvm-svn: 347110	2018-11-16 23:13:38 +00:00

8 Commits