simd directives #6

lucaparisi91 · 2024-06-26T16:37:44Z

What does #pragma omp simd actually do on the GPU ?

lucaparisi91 · 2024-07-15T07:33:37Z

NVidia GPUs can have vectorized instructions of length 2 or 4 . This is different from SIMT parallelism. The instruction will operate on 2(4) x 32 threads in a warp .
It looks like some old versions of CCE interprets simd as simt , and the value is required to use all threads in a warp.
That is not the case for new versions of cce . At the moment most compilers seem to ignore the simd directive.

lucaparisi91 added the openmp-topics label Jun 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

simd directives #6

simd directives #6

lucaparisi91 commented Jun 26, 2024

lucaparisi91 commented Jul 15, 2024 •

edited

Loading

simd directives #6

simd directives #6

Comments

lucaparisi91 commented Jun 26, 2024

lucaparisi91 commented Jul 15, 2024 • edited Loading

lucaparisi91 commented Jul 15, 2024 •

edited

Loading