llvm-project

Author	SHA1	Message	Date
Stanislav Mekhanoshin	57c1e01e48	[AMDGPU] Don't allow wgp mode on gfx1250 (#153680 ) - gfx1250 only supports cu mode	2025-08-14 15:16:56 -07:00
Stanislav Mekhanoshin	80d430df5d	[AMDGPU] Add MSG_SAVEWAVE_HAS_TDM on gfx1250 (#153483 )	2025-08-13 23:01:50 -07:00
Stanislav Mekhanoshin	fc911fe928	[AMDGPU] Add HW_REG_IB_STS2 on gfx1250 (#153479 )	2025-08-13 23:01:28 -07:00
Stanislav Mekhanoshin	cc0d227154	[AMDGPU] Disable s_setkill on gfx1250 (#153471 )	2025-08-13 23:01:04 -07:00
Stanislav Mekhanoshin	d0ee82040c	[AMDGPU] Add s_barrier_init\|join\|leave instructions (#153296 )	2025-08-12 15:07:07 -07:00
Stanislav Mekhanoshin	dddeb07c2e	[AMDGPU] Restrict packed math FP32 instructions to read only one SGPR per operand on gfx12+ (#152465 ) Sec. 4.6.7.1 of the gfx1250 SPG states that if an SGPR is used as an operand, only one SGPR will be read for both the low and high operations. As a result, the corresponding bits in `op_sel` and `op_sel_hi` must be the same when the operand is an SGPR. Co-authored-by: Tian, Shilei <Shilei.Tian@amd.com> Co-authored-by: Tian, Shilei <Shilei.Tian@amd.com>	2025-08-07 16:13:34 -07:00
Stanislav Mekhanoshin	b296ea9c14	[AMDGPU] s_get_shader_cycles_u64 gfx1250 instruction (#152390 ) It is the same as reading SHADER_CYCLES_LO and SHADER_CYCLES_HI but with a single instruction.	2025-08-06 15:32:28 -07:00
Stanislav Mekhanoshin	66392a8d8d	[AMDGPU] Add XNACK_STATE_PRIV and _MASK gfx1250 registers (#152374 ) Co-authored-by: Pierre Vanhoutryve <pierre.vanhoutryve@amd.com> Co-authored-by: Pierre Vanhoutryve <pierre.vanhoutryve@amd.com>	2025-08-06 14:44:17 -07:00
Stanislav Mekhanoshin	c3103068b7	[AMDGPU] Add more gfx1250 MC tests. NFC. (#152388 ) These are already working, but left downstream.	2025-08-06 14:38:28 -07:00
Stanislav Mekhanoshin	184821b63d	[AMDGPU] Add gfx1250 DS MC tests. NFC. (#152378 )	2025-08-06 14:15:35 -07:00
Stanislav Mekhanoshin	34aed0ed56	[AMDGPU] Add gfx1250 wmma_scale[16]_f32_32x16x128_f4 instructions (#152194 )	2025-08-05 15:15:21 -07:00
Stanislav Mekhanoshin	d08c2977e8	[AMDGPU] Add MC support for new gfx1250 src_flat_scratch_base_lo/hi (#152203 )	2025-08-05 14:35:48 -07:00
Stanislav Mekhanoshin	37fe9f6382	[AMDGPU] Add gfx1250 v_wmma_scale[16]_f32_16x16x128_f8f6f4 MC support (#152014 ) This adds new VOP3PX2e encoding	2025-08-04 14:20:12 -07:00
Stanislav Mekhanoshin	dd0737bd99	[AMDGPU] gfx1250 v_wmma_ld_scale instructions (#152010 )	2025-08-04 11:36:48 -07:00
Stanislav Mekhanoshin	d18511e10a	[AMDGPU] v_cvt_scalef32_sr_pk16_* gfx1250 instructions (#151810 )	2025-08-02 15:21:59 -07:00
Stanislav Mekhanoshin	bc463c059c	[AMDGPU] v_cvt_scalef32_pk16_* gfx1250 instructions (#151807 )	2025-08-02 12:42:12 -07:00
Stanislav Mekhanoshin	7598c25b5a	[AMDGPU] v_cvt_scale_pk16 gfx1250 instructions (#151804 )	2025-08-02 10:45:02 -07:00
Stanislav Mekhanoshin	0988510ad4	[AMDGPU] gfx1250 v_perm_pk16_* instructions (#151773 )	2025-08-01 20:12:35 -07:00
Stanislav Mekhanoshin	cc3932bf29	[AMDGPU] gfx1250 v_cvt_scalef32_sr_pk8_* instructions (#151765 )	2025-08-01 19:25:57 -07:00
Stanislav Mekhanoshin	962ee7a568	[AMDGPU] gfx1250 v_cvt_scalef32_pk8_* instructions (#151758 )	2025-08-01 18:29:45 -07:00
Stanislav Mekhanoshin	33abf05af4	[AMDGPU] gfx1250 v_permlane_* instructions (#151749 )	2025-08-01 16:14:19 -07:00
Stanislav Mekhanoshin	c7bb105e97	[AMDGPU] Add v_cvt_scale_pk8_* gfx1250 instructions (#151616 )	2025-07-31 18:55:59 -07:00
Stanislav Mekhanoshin	49d89bc9f4	[AMDGPU] Add gfx1250 cvt_pk\|sr_fp8\|bf8_f32 instructions (#151595 )	2025-07-31 16:04:46 -07:00
Stanislav Mekhanoshin	e46d938ddf	[AMDGPU] v_cvt_sr_pk_f16_f32 gfx1250 instruction (#151482 )	2025-07-31 12:25:55 -07:00
Stanislav Mekhanoshin	7f93487862	[AMDGPU] Add v_cvt_pk_f16_f32 instruction for gfx1250 (#151469 )	2025-07-31 10:45:06 -07:00
Stanislav Mekhanoshin	ce40863209	[AMDGPU] Add v_cvt_sr\|pk_bf8\|fp8_f16 gfx1250 instructions (#151415 )	2025-07-30 17:24:45 -07:00
Stanislav Mekhanoshin	b3b36d3590	[AMDGPU] Add V_ASHR_PK_I8_I32 and V_ASHR_PK_U8_I32 on gfx1250 (#151389 )	2025-07-30 16:30:47 -07:00
Stanislav Mekhanoshin	62187a60e6	[AMDGPU] Add gfx1250 v_cvt_sr_pk_bf16_f32 instruction (#151385 )	2025-07-30 14:02:03 -07:00
Jun Wang	82f00ea40a	[AMDGPU][MC] In GFX11+ v_pk_fmac_f16 should not allow DPP (#148751 ) In GFX11+ the instruction v_pk_fmac_f16 should not allow DPP.	2025-07-30 13:44:24 -07:00
Stanislav Mekhanoshin	d70f228e83	[AMDGPU] Add gfx1250 V_ADD_{MIN\|MAX}_{U\|I}32 instructions (#151379 )	2025-07-30 13:12:14 -07:00
Stanislav Mekhanoshin	3dfd939a16	[AMDGPU] gfx1250 V_{MIN\|MAX}_{I\|U}64 opcodes (#151256 )	2025-07-29 19:13:51 -07:00
Changpeng Fang	9b4a44d63d	[AMDGPU] Update MC tests for vflat instructions on GFX1250 (#151232 ) These instructions have already been supported (at MC layer) with current upstream code base.	2025-07-29 15:39:14 -07:00
Stanislav Mekhanoshin	7eaf1f2b2d	[AMDGPU] Bitop3 opcodes for gfx1250 (#151235 )	2025-07-29 15:36:56 -07:00
Stanislav Mekhanoshin	d99238263c	[AMDGPU] Implement v_mad_u32/v_mad_nc_u\|i64_u32 on gfx1250 (#151226 )	2025-07-29 15:06:35 -07:00
Changpeng Fang	6184ef1c2f	[AMDGPU] Support f64 atomics on gfx1250 (#151172 ) - BUF/FLAT/GLOBAL_ADD/MIN/MAX_F64 - DS_ADD_F64 Co-authored-by: Konstantin Zhuravlyov <Konstantin Zhuravlyov@amd.com>	2025-07-29 09:41:00 -07:00
Changpeng Fang	67e2faa50c	[AMDGPU] MC support for async load and store on gfx1250 (#151030 )	2025-07-28 13:45:37 -07:00
Changpeng Fang	ecd793cbb1	[AMDGPU] Add v_fma_mix_f32_f16 as an alias of v_fma_mix_f32 on gfx1250 (#150502 ) Co-authored-by: Jay Foad <Jay.Foad@amd.com>	2025-07-24 12:42:30 -07:00
Changpeng Fang	34b6587249	[AMDGPU] MC support for load monitor instructions on gfx1250 (#150496 )	2025-07-24 12:16:47 -07:00
Stanislav Mekhanoshin	a70f7dafc1	[AMDGPU] gfx1250 flat and global prefetch MC support (#150455 )	2025-07-24 11:00:56 -07:00
Changpeng Fang	473bc0d188	[AMDGPU] Support V_FMA_MIX*_BF16 instructions on gfx1250 (#150381 ) Co-authored-by: Stanislav Mekhanoshin <Stanislav.Mekhanoshin@amd.com>	2025-07-24 09:43:49 -07:00
Changpeng Fang	9a563b08e2	[AMDGPU] Support V_PK_MIN3/MAX3_NUM_F16 on gfx1250 (#150326 ) Co-authored-by: Stanislav Mekhanoshin <Stanislav.Mekhanoshin@amd.com>	2025-07-23 15:15:19 -07:00
Changpeng Fang	203ea0a97e	AMDGPU: Support V_PK_MAXIMUM3_F16 and V_PK_MINIMUM3_F16 on gfx1250 (#150307 ) Co-authored-by: Stanislav Mekhanoshin <Stanislav.Mekhanoshin@amd.com>	2025-07-23 13:45:01 -07:00
Stanislav Mekhanoshin	2346968807	[AMDGPU] Add V_ADD\|SUB\|MUL_U64 gfx1250 opcodes (#150291 )	2025-07-23 13:17:56 -07:00
Changpeng Fang	bc1f85d234	AMDGPU: Support packed bf16 instructions on gfx1250 (#150283 ) Co-authored-by: Stanislav Mekhanoshin <Stanislav.Mekhanoshin@amd.com>	2025-07-23 12:01:23 -07:00
Changpeng Fang	81185f7a2b	AMDGPU: Add packed fp32 instructions for gfx1250 (#150253 ) Co-authored-by: Stanislav Mekhanoshin <Stanislav.Mekhanoshin@amd.com>	2025-07-23 10:26:50 -07:00
Changpeng Fang	d385e9d86b	AMDGPU: Support V_PK_ADD_{MIN\|MAX}_{I\|U}16 and V_{MIN\|MAX}3_{I\|U}16 on gfx1250 (#150155 ) Co-authored-by: Stanislav Mekhanoshin <Stanislav.Mekhanoshin@amd.com>	2025-07-23 00:17:22 -07:00
Shilei Tian	fbeb801a71	[AMDGPU] Add support for `v_cvt_pk_bf16_f32` on gfx1250 (#150053 ) Co-authored-by: Mekhanoshin, Stanislav <Stanislav.Mekhanoshin@amd.com>	2025-07-22 20:00:50 -04:00
Stanislav Mekhanoshin	a0b854d576	[AMDGPU] MC support for gfx1250 scale_offset modifier (#149881 )	2025-07-21 15:04:59 -07:00
Stanislav Mekhanoshin	b66084acd9	[AMDGPU] Verify asm VGPR alignment on gfx1250 (#149880 ) Co-authored-by: Shilei Tian <Shilei.Tian@amd.com>	2025-07-21 14:23:27 -07:00
Jun Wang	234338852c	Reapply "[AMDGPU][MC] Allow op_sel in v_alignbit_b32 etc in GFX9 and … (#149262 ) Fixed the problem in ce7851f6b7d59e50f92cb4e8dbfd801576c8b641. This reverts commit ba271cc07334c74df55741701e5b22032c0cddbb.	2025-07-21 10:14:45 -07:00

1 2 3 4 5 ...

1144 Commits