llvm-project/offload/test/offloading/fortran/target-generic-loops.f90
Sergio Afonso 38a38bb056 [OpenMPOpt] Make parallel regions reachable from new DeviceRTL loop functions
This patch updates the OpenMP optimization pass to know about the new DeviceRTL
functions for loop constructs.

This change marks these functions as potentially containing parallel regions,
which fixes a current bug with the state machine rewrite optimization. It
previously failed to identify parallel regions located inside of the callbacks
passed to these new DeviceRTL functions, causing the resulting code to skip
executing these parallel regions.

As a result, Generic kernels produced by Flang that contain parallel regions
now work properly.

One known related issue not fixed by this patch is that the presence of calls
to these functions will prevent the SPMD-ization of Generic kernels by
OpenMPOpt. Previously, this was due to assuming there was no parallel region.
This is changed by this patch, but instead we now mark it temporarily as
unsupported in an SPMD context. The reason is that, without additional changes,
code intended for the main thread of the team located outside of the parallel
region would not be guarded properly, resulting in race conditions and
generally invalid behavior.
2025-08-15 15:50:06 +01:00

131 lines
2.7 KiB
Fortran

! Offloading test for generic target regions containing different kinds of
! loop constructs inside.
! REQUIRES: flang, amdgpu
! RUN: %libomptarget-compile-fortran-run-and-check-generic
program main
integer :: i1, i2, n1, n2, counter
n1 = 100
n2 = 50
counter = 0
!$omp target map(tofrom:counter)
!$omp teams distribute reduction(+:counter)
do i1=1, n1
counter = counter + 1
end do
!$omp end target
! CHECK: 1 100
print '(I2" "I0)', 1, counter
counter = 0
!$omp target map(tofrom:counter)
!$omp parallel do reduction(+:counter)
do i1=1, n1
counter = counter + 1
end do
!$omp parallel do reduction(+:counter)
do i1=1, n1
counter = counter + 1
end do
!$omp end target
! CHECK: 2 200
print '(I2" "I0)', 2, counter
counter = 0
!$omp target map(tofrom:counter)
counter = counter + 1
!$omp parallel do reduction(+:counter)
do i1=1, n1
counter = counter + 1
end do
counter = counter + 1
!$omp parallel do reduction(+:counter)
do i1=1, n1
counter = counter + 1
end do
counter = counter + 1
!$omp end target
! CHECK: 3 203
print '(I2" "I0)', 3, counter
counter = 0
!$omp target map(tofrom: counter)
counter = counter + 1
!$omp parallel do reduction(+:counter)
do i1=1, n1
counter = counter + 1
end do
counter = counter + 1
!$omp end target
! CHECK: 4 102
print '(I2" "I0)', 4, counter
counter = 0
!$omp target teams distribute reduction(+:counter)
do i1=1, n1
!$omp parallel do reduction(+:counter)
do i2=1, n2
counter = counter + 1
end do
end do
! CHECK: 5 5000
print '(I2" "I0)', 5, counter
counter = 0
!$omp target teams distribute reduction(+:counter)
do i1=1, n1
counter = counter + 1
!$omp parallel do reduction(+:counter)
do i2=1, n2
counter = counter + 1
end do
counter = counter + 1
end do
! CHECK: 6 5200
print '(I2" "I0)', 6, counter
counter = 0
!$omp target teams distribute reduction(+:counter)
do i1=1, n1
!$omp parallel do reduction(+:counter)
do i2=1, n2
counter = counter + 1
end do
!$omp parallel do reduction(+:counter)
do i2=1, n2
counter = counter + 1
end do
end do
! CHECK: 7 10000
print '(I2" "I0)', 7, counter
counter = 0
!$omp target teams distribute reduction(+:counter)
do i1=1, n1
counter = counter + 1
!$omp parallel do reduction(+:counter)
do i2=1, n2
counter = counter + 1
end do
counter = counter + 1
!$omp parallel do reduction(+:counter)
do i2=1, n2
counter = counter + 1
end do
counter = counter + 1
end do
! CHECK: 8 10300
print '(I2" "I0)', 8, counter
end program