llvm-project

History

[mlir][vector] fix: unroll vector.from_elements in gpu pipelines (#154774 )

### Problem

PR #142944 introduced a new canonicalization pattern which caused
failures in the following GPU-related integration tests:

-
mlir/test/Integration/GPU/CUDA/TensorCore/sm80/transform-mma-sync-matmul-f16-f16-accum.mlir
-
mlir/test/Integration/GPU/CUDA/TensorCore/sm80/transform-mma-sync-matmul-f32.mlir

The issue occurs because the new canonicalization pattern can generate
multi-dimensional `vector.from_elements` operations (rank > 1), but the
GPU lowering pipelines were not equipped to handle these during the
conversion to LLVM.

### Fix

This PR adds `vector::populateVectorFromElementsLoweringPatterns` to the
GPU lowering passes that are integrated in `gpu-lower-to-nvvm-pipeline`:

- `GpuToLLVMConversionPass`: the general GPU-to-LLVM conversion pass.
- `LowerGpuOpsToNVVMOpsPass`: the NVVM-specific lowering pass.

Co-authored-by: Yang Bai <yangb@nvidia.com>

2025-08-21 21:46:06 -05:00

AffineToStandard

[mlir][NFC] update Conversion create APIs (4/n) (#149879 )

2025-07-23 10:49:35 -05:00

AMDGPUToROCDL

[mlir][AMDGPU] Add PermlaneSwapOp (#154345 )

2025-08-21 18:21:43 +02:00

ArithCommon

[mlir][llvm] Port overflowFlags to a native operation property (RELAND) (#89410 )

2024-04-19 09:23:00 -07:00

ArithToAMDGPU

[mlir][ArithToAMDGPU] Use native packing support (#150342 )

2025-07-24 12:26:03 -05:00

ArithToArmSME

[mlir][NFC] update Conversion create APIs (4/n) (#149879 )

2025-07-23 10:49:35 -05:00

ArithToEmitC

[mlir][NFC] update mlir create APIs (34/n) (#150660 )

2025-07-25 12:36:54 -05:00

ArithToLLVM

[mlir][LLVM] ArithToLLVM: Add 1:N support for arith.select lowering (#153944 )

2025-08-18 09:42:37 +02:00

ArithToSPIRV

[mlir][spirv] Add 8-bit float type emulation (#148811 )

2025-07-30 17:39:49 -05:00

ArmNeon2dToIntr

[mlir] Remove unused includes (NFC) (#150476 )

2025-07-24 11:23:53 -07:00

ArmSMEToLLVM

[mlir][armsme][vector] Replace splat with broadcast (#148024 )

2025-07-23 10:18:09 -07:00

ArmSMEToSCF

[mlir][armsme][vector] Replace splat with broadcast (#148024 )

2025-07-23 10:18:09 -07:00

AsyncToLLVM

[mlir] Remove unused includes (NFC) (#150476 )

2025-07-24 11:23:53 -07:00

BufferizationToMemRef

[mlir][NFC] update mlir create APIs (34/n) (#150660 )

2025-07-25 12:36:54 -05:00

ComplexCommon

[mlir][NFC] update Conversion create APIs (5/n) (#149887 )

2025-07-22 10:40:45 -04:00

ComplexToLibm

[mlir][NFC] update Conversion create APIs (5/n) (#149887 )

2025-07-22 10:40:45 -04:00

ComplexToLLVM

[mlir][NFC] update Conversion create APIs (5/n) (#149887 )

2025-07-22 10:40:45 -04:00

ComplexToROCDLLibraryCalls

[MLIR] Add cpow support in ComplexToROCDLLibraryCalls (#153183 )

2025-08-20 17:18:30 +00:00

ComplexToSPIRV

[mlir] Remove unused includes (NFC) (#147101 )

2025-07-04 13:30:21 -07:00

ComplexToStandard

[mlir] Remove unused includes (NFC) (#150476 )

2025-07-24 11:23:53 -07:00

ControlFlowToLLVM

[mlir][LLVM] Fix build (#153947 )

2025-08-16 13:06:58 +02:00

ControlFlowToSCF

[mlir][NFC] update mlir create APIs (34/n) (#150660 )

2025-07-25 12:36:54 -05:00

ControlFlowToSPIRV

[mlir][spirv] Add 8-bit float type emulation (#148811 )

2025-07-30 17:39:49 -05:00

ConvertToEmitC

[mlir] Remove unused includes (NFC) (#150476 )

2025-07-24 11:23:53 -07:00

ConvertToLLVM

[MLIR] Adopt LDBG() debug macro in ConvertToLLVMPass (NFC) (#154616 )

2025-08-20 21:29:35 +00:00

FuncToEmitC

[mlir][NFC] update Conversion create APIs (5/n) (#149887 )

2025-07-22 10:40:45 -04:00

FuncToLLVM

[mlir][LLVM] FuncToLLVM: Add 1:N type conversion support (#153823 )

2025-08-16 09:45:08 +02:00

FuncToSPIRV

[mlir][spirv] Add 8-bit float type emulation (#148811 )

2025-07-30 17:39:49 -05:00

GPUCommon

[mlir][vector] fix: unroll vector.from_elements in gpu pipelines (#154774 )

2025-08-21 21:46:06 -05:00

GPUToLLVMSPV

[mlir][NFC] update Conversion create APIs (5/n) (#149887 )

2025-07-22 10:40:45 -04:00

GPUToNVVM

[mlir][vector] fix: unroll vector.from_elements in gpu pipelines (#154774 )

2025-08-21 21:46:06 -05:00

GPUToROCDL

[mlir][ROCDL] Annotate lane ID functions with noundef, ranges (#151396 )

2025-08-13 17:44:03 -05:00

GPUToSPIRV

[mlir][spirv] Fix lookup logic spirv.target_env for gpu.module (#147262 )

2025-08-01 06:54:04 -04:00

IndexToLLVM

[mlir][NFC] update Conversion create APIs (6/n) (#149888 )

2025-07-22 08:16:53 -04:00

IndexToSPIRV

[mlir][NFC] update Conversion create APIs (6/n) (#149888 )

2025-07-22 08:16:53 -04:00

LinalgToStandard

[mlir][NFC] update Conversion create APIs (6/n) (#149888 )

2025-07-22 08:16:53 -04:00

LLVMCommon

[mlir][LLVM] FuncToLLVM: Add 1:N type conversion support (#153823 )

2025-08-16 09:45:08 +02:00

MathToEmitC

[mlir][EmitC] Add MathToEmitC pass for math function lowering to EmitC (#113799 )

2025-01-20 09:26:41 +01:00

MathToFuncs

[MLIR] Migrate some conversion passes and dialects to LDBG() macro (NFC) (#151349 )

2025-07-30 17:58:54 +02:00

MathToLibm

[mlir][NFC] update Conversion create APIs (6/n) (#149888 )

2025-07-22 08:16:53 -04:00

MathToLLVM

[mlir][NFC] update Conversion create APIs (6/n) (#149888 )

2025-07-22 08:16:53 -04:00

MathToROCDL

[MLIR] Migrate some conversion passes and dialects to LDBG() macro (NFC) (#151349 )

2025-07-30 17:58:54 +02:00

MathToSPIRV

[MLIR][SPIRV] Add spirv.IsFinite and lower math.{isfinite,isinf,isnan} to spirv. (#151552 )

2025-07-31 13:54:14 -04:00

MemRefToEmitC

[mlir] Update builders to use new form. (#154132 )

2025-08-18 15:19:34 +00:00

MemRefToLLVM

[mlir][LLVM][NFC] Simplify computeSizes function (#153588 )

2025-08-14 17:00:03 +02:00

MemRefToSPIRV

[mlir] MemRefToSPIRV propagate alignment attributes from MemRef ops. (#151723 )

2025-08-07 12:18:23 -04:00

MPIToLLVM

[mlir][NFC] update mlir create APIs (34/n) (#150660 )

2025-07-25 12:36:54 -05:00

NVGPUToNVVM

[mlir][LLVM] FuncToLLVM: Add 1:N type conversion support (#153823 )

2025-08-16 09:45:08 +02:00

NVVMToLLVM

[MLIR][NVVM] Improve inline_ptx, add readwrite support (#154358 )

2025-08-21 17:42:18 +02:00

OpenACCToSCF

[mlir][NFC] update Conversion create APIs (6/n) (#149888 )

2025-07-22 08:16:53 -04:00

OpenMPToLLVM

[mlir][NFC] update Conversion create APIs (6/n) (#149888 )

2025-07-22 08:16:53 -04:00

PDLToPDLInterp

[mlir][NFC] Use getDefiningOp<OpTy>() instead of dyn_cast<OpTy>(getDefiningOp()) (#150428 )

2025-07-25 10:35:51 +08:00

ReconcileUnrealizedCasts

[MLIR][NFC] Retire let constructor for passes in Conversion directory (part1) (#127403 )

2025-02-17 10:55:27 +01:00

SCFToControlFlow

[mlir][SCF] scf.for: Add support for unsigned integer comparison (#153379 )

2025-08-15 10:59:14 +02:00

SCFToEmitC

[mlir][SCF] scf.for: Add support for unsigned integer comparison (#153379 )

2025-08-15 10:59:14 +02:00

SCFToGPU

[mlir] Remove unused includes (NFC) (#150476 )

2025-07-24 11:23:53 -07:00

SCFToOpenMP

[mlir][NFC] update Conversion create APIs (7/n) (#149889 )

2025-07-22 10:41:06 -04:00

SCFToSPIRV

[mlir][SCF] scf.for: Add support for unsigned integer comparison (#153379 )

2025-08-15 10:59:14 +02:00

ShapeToStandard

[mlir] Remove unused includes (NFC) (#150476 )

2025-07-24 11:23:53 -07:00

ShardToMPI

[MLIR] Migrate some conversion passes and dialects to LDBG() macro (NFC) (#151349 )

2025-07-30 17:58:54 +02:00

SPIRVCommon

[MLIR][GPU-LLVM] Convert gpu.func to llvm.func (#101664 )

2024-08-09 16:09:11 +02:00

SPIRVToLLVM

[mlir][NFC] update mlir create APIs (34/n) (#150660 )

2025-07-25 12:36:54 -05:00

TensorToLinalg

[mlir] Remove unused includes (NFC) (#147101 )

2025-07-04 13:30:21 -07:00

TensorToSPIRV

[mlir][spirv] Add 8-bit float type emulation (#148811 )

2025-07-30 17:39:49 -05:00

TosaToArith

[mlir][NFC] update Conversion create APIs (7/n) (#149889 )

2025-07-22 10:41:06 -04:00

TosaToLinalg

[mlir][tosa] Use typeConverter->convertType<T> (#150578 )

2025-08-04 17:28:31 +08:00

TosaToMLProgram

[mlir] Remove unused includes (NFC) (#150476 )

2025-07-24 11:23:53 -07:00

TosaToSCF

[mlir][NFC] update Conversion create APIs (7/n) (#149889 )

2025-07-22 10:41:06 -04:00

TosaToTensor

[mlir][tosa] Use typeConverter->convertType<T> (#150578 )

2025-08-04 17:28:31 +08:00

UBToLLVM

[mlir][NFC] Mark type converter in populate... functions as const (#111250 )

2024-10-05 21:32:40 +02:00

UBToSPIRV

[mlir][spirv] Fix some issues related to converting ub.poison to SPIR-V (#125905 )

2025-02-06 19:07:34 +01:00

VectorToAMX

[mlir][amx] Vector to AMX conversion pass (#151121 )

2025-08-13 11:08:52 +02:00

VectorToArmSME

[mlir][armsme][vector] Replace splat with broadcast (#148024 )

2025-07-23 10:18:09 -07:00

VectorToGPU

[MLIR] Migrate some conversion passes and dialects to LDBG() macro (NFC) (#151349 )

2025-07-30 17:58:54 +02:00

VectorToLLVM

[mlir][vector] Support multi-dimensional vectors in VectorFromElementsLowering (#151175 )

2025-08-18 10:09:12 -07:00

VectorToSCF

[mlir][Vector] Remove vector.extractelement and vector.insertelement ops (#149603 )

2025-07-28 11:01:14 -07:00

VectorToSPIRV

[mlir][Vector] Remove vector.extractelement and vector.insertelement ops (#149603 )

2025-07-28 11:01:14 -07:00

VectorToXeGPU

[MLIR][XeGPU] Add lowering from transfer_read/transfer_write to load_gather/store_scatter (#152429 )

2025-08-14 11:27:07 -07:00

XeVMToLLVM

[mlir] Remove unused includes (NFC) (#150476 )

2025-07-24 11:23:53 -07:00

CMakeLists.txt

[mlir][amx] Vector to AMX conversion pass (#151121 )

2025-08-13 11:08:52 +02:00