llvm-project/Dialect at 11e5a0d290bac87f3290547fa2b0aaffc98a798e - llvm-project - shylie's gitea

shylie/llvm-project

History

Andrzej Warzynski 678360fd9d [mlir][linalg] Add scalar broadcast load case to the vectoriser

This patch extends the Linalg vectoriser so that scalar loads are
correctly identified as scalar rather than gather loads. Below is an
example of a scalar load (note that both indices are loop invariant):
```
func.func @example(%arg0: tensor<80x16xf32>, %arg2: tensor<1x4xf32>) -> tensor<1x4xf32> {
%c8 = arith.constant 8 : index
%c16 = arith.constant 16 : index
%1 = linalg.generic {
    indexing_maps = [affine_map<(d0, d1) -> (d0, d1)>],
    iterator_types = ["parallel", "parallel"]
  } outs(%arg2 : tensor<1x4xf32>) {
  ^bb0(%out: f32):
    %2 = linalg.index 0 : index
    %extracted = tensor.extract %arg0[%2, %c16] : tensor<80x16xf32>
    linalg.yield %extracted : f32
  } -> tensor<1x4xf32>
  return %1 : tensor<1x4xf32>
}
```

This patch also makes sure that these scalar loads are indeed lowered to
a scalar load followed by a broadcast:
```
    %extracted = tensor.extract %arg0[%1, %c16] : tensor<80x16xf32>
    %2 = vector.broadcast %extracted : f32 to vector<1x4xf32>
```

Differential Revision: https://reviews.llvm.org/D149678

2023-06-12 15:18:42 +01:00

..

Fix fold of 0-result 0-trip-count affine.for

2023-05-29 01:53:37 +05:30

[mlir][AMDGPU] Add emulation pass for atomics on AMDGPU targets

2023-05-03 21:18:48 +00:00

[mlir][NFC] Update textual references of func to func.func in AMX/Arithmetic/ArmSVE/Async tests

2022-04-20 22:17:28 -07:00

[mlir][arith] Disallow zero ranked tensors for select's condition

2023-06-01 12:12:46 +05:30

[mlir][NFC] Update textual references of func to func.func in AMX/Arithmetic/ArmSVE/Async tests

2022-04-20 22:17:28 -07:00

[mlir] Add pass to enable Armv9 Streaming SVE mode

2023-05-25 09:20:36 +00:00

[mlir][NFC] Update textual references of func to func.func in AMX/Arithmetic/ArmSVE/Async tests

2022-04-20 22:17:28 -07:00

[mlir][async] Allow to call async.execute inside async.func

2023-01-13 16:04:24 -08:00

[mlir][bufferization] Fix bug in findValueInReverseUseDefChain

2023-05-23 15:30:08 +02:00

[mlir] Add test-convergence option to Canonicalizer tests

2023-01-04 12:02:21 +01:00

[mlir][complex] Canonicalize re/im(neg(create))

2023-05-29 17:52:48 -07:00

[mlir] Add test-convergence option to Canonicalizer tests

2023-01-04 12:02:21 +01:00

[mlir] Cleanup lingering problems surrounding attribute/type aliases

2022-11-30 17:02:54 -08:00

[MLIR][EmitC] Add empty emitc.constant check

2023-04-27 16:03:53 +00:00

[MLIR] Add pass to deduplicate functions

2023-02-27 10:59:53 -05:00

[mlir][sparse][gpu] unify dnmat and dnvec handle and ops

2023-06-09 17:16:48 +00:00

[mlir] Avoid folding index.remu and index.rems for 0 rhs

2023-05-31 10:45:26 -07:00

[mlir][irdl] Add irdl.c_pred

2023-06-08 11:40:48 +01:00

[mlir][linalg] Add scalar broadcast load case to the vectoriser

2023-06-12 15:18:42 +01:00

[mlir][vector][transform] Drop redundant "apply_" from op names

2023-06-08 09:00:12 +02:00

[mlir][llvm] Ensure immediate usage in intrinsics

2023-06-12 06:57:42 +00:00

Revert "Revert "Fix handling of special and large vals in expand pattern for round" and "Add pattern that expands math.roundeven into math.round + arith""

2023-04-22 07:15:40 -07:00

[mlir][transform] Use separate ops instead of PatternRegistry

2023-06-06 11:53:03 +02:00

[mlir] Split MLProgram global load and store to Graph variants

2022-06-16 20:01:54 -07:00

[mlir][gpu][nvvm] refined sparsity selector test and verification of mma.sp

2023-03-17 15:50:36 -07:00

[mlir][openacc] Use new reduction design in acc.loop

2023-05-24 10:51:39 -07:00

[OpenMP][Flang][MLIR] Add MLIR support for OpenMP requires directive

2023-06-12 12:38:04 +01:00

[mlir] Add test-convergence option to Canonicalizer tests

2023-01-04 12:02:21 +01:00

[mlir:PDL] Add support for creating ranges in rewrites

2022-11-08 01:57:57 -08:00

[mlir] Add test-convergence option to Canonicalizer tests

2023-01-04 12:02:21 +01:00

[mlir] NFC: use !transform.any_op in relevant tests

2023-05-22 08:19:46 +00:00

Introduce MLIR Op Properties

2023-05-01 23:16:34 -07:00

Brings back "[mlir][sparse] moving inbound check for slice driven loop into before block of the WhileOp"

2023-06-09 17:45:46 +00:00

[mlir][spirv] Add a canonicalization pattern for UModOp

2023-06-08 10:32:01 -04:00

[mlir][tensor] Add pattern to rewrite tensor.generate as a constant

2023-06-09 12:56:07 +02:00

[MLIR][Tosa] Fix fp canonicalization for clamp

2023-06-07 10:09:24 -07:00

[mlir][transform] add a check for nested consumption in ApplyEachOpTrait

2023-06-09 10:44:24 +00:00

[mlir][vector][transform] Expose tensor slice -> transfer folding patterns

2023-06-09 16:23:25 +02:00

[mlir][NFC] Update textual references of func to func.func in Tensor/Tosa/Vector tests

2022-04-20 22:17:29 -07:00

traits.mlir

[mlir][NFC] Update textual references of func to func.func in IR/Interface tests

2022-04-20 22:17:30 -07:00