llvm-project

Author	SHA1	Message	Date
Boian Petkantchin	dc3258c617	[mlir][mesh] Add all-slice operation (#81218 ) This op is the inverse of all-gather. It is useful to have an explicit concise representation instead of having a blob of slicing logic. Add lowering for the op that slices from the tensor based on the in-group process index. Make resharding generate an all-slice instead of inserting the slicing logic directly.	2024-02-15 13:03:58 -08:00
Boian Petkantchin	31fc0a12e1	[mlir][mesh] Refactoring code organization, tests and docs (#79606 ) * Split out `MeshDialect.h` form `MeshOps.h` that defines the dialect class. Reduces include clutter if you care only about the dialect and not the ops. * Expose functions `getMesh` and `collectiveProcessGroupSize`. There functions are useful for outside users of the dialect. * Remove unused code. * Remove examples and tests of mesh.shard attribute in tensor encoding. Per the decision that Spmdization would be performed on sharding annotations and there will be no tensors with sharding specified in the type. For more info see this RFC comment: https://discourse.llvm.org/t/rfc-sharding-framework-design-for-device-mesh/73533/81	2024-01-31 07:20:14 -08:00
Boian Petkantchin	9a8437f504	[mlir][mesh] Rename cluster to mesh (#79484 ) Rename * Op mesh.cluster -> mesh.mesh * Op mesh.cluster_shape -> mesh.mesh_shape * variables and attributes. The name `mesh` is more specific to what it really represents. It is a mesh of devices. The name `cluster` implies a broader posibility of device configurations. When just the word `mesh` is used the meaning can often be inferred from the context whether it refers to the mesh dialect or a device mesh. The full name can be used when needed.	2024-01-26 07:03:29 -08:00
Emilio Cota	a1dc813f75	[mlir][mesh] fix unused variable error	2024-01-10 14:32:57 -05:00
Boian Petkantchin	79aa776267	[mlir][mesh] Add lowering of process multi-index op (#77490 ) * Rename mesh.process_index -> mesh.process_multi_index. * Add mesh.process_linear_index op. * Add lowering of mesh.process_multi_index into an expression using mesh.process_linear_index, mesh.cluster_shape and affine.delinearize_index. This is useful to lower mesh ops and prepare them for further lowering where the runtime may have only the linear index of a device/process. For example in MPI we have a rank (linear index) in a communicator.	2024-01-10 07:01:16 -08:00
Jie Fu	046dffce23	Fix -Wunused-variable in TestSimplifications.cpp (NFC) llvm-project/mlir/test/lib/Dialect/Mesh/TestSimplifications.cpp:36:17: error: unused variable 'status' [-Werror,-Wunused-variable] LogicalResult status = ^ 1 error generated.	2024-01-10 07:59:19 +08:00
Boian Petkantchin	ab590377a3	[mlir][mesh] Add folding of ClusterShapeOp (#77033 ) If the mesh has static size on some of the requested axes, the result is substituted with a constant.	2024-01-09 13:42:56 -08:00
Boian Petkantchin	1a8fb88719	[mlir][mesh] Add resharding spmdization on a 1D device mesh (#76179 ) The current implementation supports only sharding of tensor axes that have size divisible by the mesh axis size.	2024-01-02 15:50:07 -08:00
Boian Petkantchin	4b3446771f	[mlir][mesh] Add endomorphism simplification for all-reduce (#73150 ) Does transformations like all_reduce(x) + all_reduce(y) -> all_reduce(x + y) max(all_reduce(x), all_reduce(y)) -> all_reduce(max(x, y)) when the all_reduce element-wise op is max. Added general rewrite pattern HomomorphismSimplification and EndomorphismSimplification that encapsulate the general algorithm. Made specialization for all-reduce with respect to addf, addi, minsi, maxsi, minimumf and maximumf in the Arithmetic dialect.	2023-12-12 10:21:52 -08:00

9 Commits