Vyacheslav Levytskyy 0a443f13b4
[SPIR-V] Add implementation of G_SPLAT_VECTOR opcode and fix invalid types processing (#84766)
This PR:
* adds support for G_SPLAT_VECTOR generic opcode that may be legally
generated instead of G_BUILD_VECTOR by previous passes of the translator
(see https://github.com/llvm/llvm-project/pull/80378 for the source of
breaking changes);
* improves deduction of types for opaque pointers.

This PR also fixes the following issues:
* if a function has ptr argument(s), two functions that have different
SPIR-V type definitions may get identical LLVM function types and break
agreements of global register and duplicate checker;
* checks for pointer types do not account for TypedPointerType.

Update of tests:
* A test case is added to cover the issue with function ptr parameters.
* The first case, that is support for G_SPLAT_VECTOR generic opcode, is
covered by existing test cases.
* Multiple additional checks by `spirv-val` is added to cover more
possibilities of generation of invalid code.
2024-03-13 08:32:01 +01:00

32 lines
1.0 KiB
LLVM

; RUN: llc -O0 -mtriple=spirv32-unknown-unknown %s -o - | FileCheck %s --check-prefix=CHECK-SPIRV
; RUN: %if spirv-tools %{ llc -O0 -mtriple=spirv32-unknown-unknown %s -o - -filetype=obj | spirv-val %}
;; kernel void testConvertPtrToU(global int *a, global unsigned long *res) {
;; res[0] = (unsigned long)&a[0];
;; }
; CHECK-SPIRV: OpConvertPtrToU
define dso_local spir_kernel void @testConvertPtrToU(i32 addrspace(1)* noundef %a, i64 addrspace(1)* nocapture noundef writeonly %res) local_unnamed_addr {
entry:
%0 = ptrtoint i32 addrspace(1)* %a to i32
%1 = zext i32 %0 to i64
store i64 %1, i64 addrspace(1)* %res, align 8
ret void
}
;; kernel void testConvertUToPtr(unsigned long a) {
;; global unsigned int *res = (global unsigned int *)a;
;; res[0] = 0;
;; }
; CHECK-SPIRV: OpConvertUToPtr
define dso_local spir_kernel void @testConvertUToPtr(i64 noundef %a) local_unnamed_addr {
entry:
%conv = trunc i64 %a to i32
%0 = inttoptr i32 %conv to i32 addrspace(1)*
store i32 0, i32 addrspace(1)* %0, align 4
ret void
}