The PR modifies the subgroup distribution pass to only sink insert_strided_slice operation if it becomes the last op before yield. It avoids sinking insert_strided_slice multiple times and cause potential issue in worst case.
The PR modifies the subgroup distribution pass to only sink insert_strided_slice operation if it becomes the last op before yield. It avoids sinking insert_strided_slice multiple times and cause potential issue in worst case.