[SYCL][libclc] Add generic addrspace overloads of math builtins #13015

frasercrmck · 2024-03-13T18:52:49Z

The generic implementations of the math builtins which take pointer arguments were using unqualified address spaces. This could either resolve to the generic address space or the private address space, depending on whether the target supports the generic address space or not.

The newer unified OpenCL C specification is clearer in mandating that all targets must provide overloads on the explicitly qualified 'private' address space, as well as optionally defining ones on the (unqualified) generic address space. This meant that most of these math builtins were lacking one overload: either the private or generic one, depending on which target was compiling the builtins.

One notable exception here is NVIDIA, which maps the private and generic
address spaces to the same target address space. Thus declaring builtins
overloaded on these two address spaces results in a mangling clash,
which we can't have. Therefore we now say that NVIDIA targets don't
support the generic address space for the purposes of these builtins. In
reality, the builtins with the private address space are functionally
equivalent to the generic ones, so users won't notice.

For the sake of code clarity, although the 'generic' keyword is technically reserved, we know that clang defines it to be the corresponding unqualified generic address space, so we use that to be explicit. We always compile with clang so it shouldn't be a problem with portability.

With this we can also enable a LIT test for HIP, which was previously failing as it couldn't find the generic address space overloads of the fract and lgamma_r builtins.

There are other builtins that this treatment (may) need applied to, such as the vload and vstore variants. Those will be handled in a subsequent patch.

The generic implementations of the math builtins which take pointer arguments were using unqualified address spaces. This could either resolve to the generic address space or the private address space, depending on whether the target supports the generic address space or not. The newer unified OpenCL C specification is clearer in mandating that all targets must provide overloads on the explicitly qualified 'private' address space, as well as optionally defining ones on the (unqualified) generic address space. This meant that most of these math builtins were lacking one overload: either the private or generic one, depending on which target was compiling the builtins. One notable exception here is NVIDIA, which maps the private and generic address spaces to the same target address space. Thus declaring builtins overloaded on these two address spaces results in a mangling clash, which we can't have. Therefore we now say that NVIDIA targets don't support the generic address space for the purposes of these builtins. In reality, the builtins with the private address space are functionally equivalent to the generic ones, so users won't notice. For the sake of code clarity, although the 'generic' keyword is technically reserved, we know that clang defines it to be the corresponding unqualified generic address space, so we use that to be explicit. We always compile with clang so it shouldn't be a problem with portability. With this we can also enable a LIT test for HIP, which was previously failing as it couldn't find the generic address space overloads of fract and lgamma_r. There are other builtins that this treatment (may) need applied to, such as the vload and vstore variants. Those will be handled in a subsequent patch.

npmiller · 2024-03-19T14:45:22Z

Friendly ping @intel/llvm-reviewers-runtime @bso-intel

…ported on Native CPU (#13109) Similarly to what is done for `nvptx` in #13015, Native CPU maps `private` and `generic` to the same address spaces, so we need to avoid getting multiple definitions for the libclc builtins that use `generic`.

frasercrmck requested review from a team as code owners March 13, 2024 18:52

frasercrmck requested review from bso-intel and sergey-semenov March 13, 2024 18:52

frasercrmck had a problem deploying to WindowsCILock March 13, 2024 23:54 — with GitHub Actions Failure

frasercrmck force-pushed the libclc-generic branch 2 times, most recently from 6f32ac4 to 739cc92 Compare March 19, 2024 11:00

frasercrmck had a problem deploying to WindowsCILock March 19, 2024 11:06 — with GitHub Actions Failure

frasercrmck force-pushed the libclc-generic branch from 739cc92 to b6068f0 Compare March 19, 2024 11:27

frasercrmck temporarily deployed to WindowsCILock March 19, 2024 11:51 — with GitHub Actions Inactive

frasercrmck temporarily deployed to WindowsCILock March 19, 2024 12:11 — with GitHub Actions Inactive

npmiller approved these changes Mar 19, 2024

View reviewed changes

bso-intel approved these changes Mar 20, 2024

View reviewed changes

frasercrmck added 7 commits March 21, 2024 10:30

add missing clc decls

690ddeb

add spirv fract decl

369d13e

add spirv frexp decl

94f8d29

add spirv modf decl

0699362

add spirv remquo decl

0e0070c

add spirv lgamma_r decl

209f701

add spirv sincos decl

d3cc3d5

frasercrmck temporarily deployed to WindowsCILock March 21, 2024 10:59 — with GitHub Actions Inactive

frasercrmck temporarily deployed to WindowsCILock March 21, 2024 11:37 — with GitHub Actions Inactive

ldrumm merged commit 9e4768c into intel:sycl Mar 21, 2024
12 checks passed

frasercrmck deleted the libclc-generic branch March 21, 2024 14:15

PietroGhg mentioned this pull request Mar 22, 2024

[SYCL][NATIVECPU][libclc]Mark opencl_c_generic_address_space as unsupported on Native CPU #13109

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SYCL][libclc] Add generic addrspace overloads of math builtins #13015

[SYCL][libclc] Add generic addrspace overloads of math builtins #13015

frasercrmck commented Mar 13, 2024 •

edited

Loading

npmiller commented Mar 19, 2024

[SYCL][libclc] Add generic addrspace overloads of math builtins #13015

[SYCL][libclc] Add generic addrspace overloads of math builtins #13015

Conversation

frasercrmck commented Mar 13, 2024 • edited Loading

npmiller commented Mar 19, 2024

frasercrmck commented Mar 13, 2024 •

edited

Loading