Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

simd directives #6

Open
lucaparisi91 opened this issue Jun 26, 2024 · 1 comment
Open

simd directives #6

lucaparisi91 opened this issue Jun 26, 2024 · 1 comment

Comments

@lucaparisi91
Copy link
Collaborator

What does #pragma omp simd actually do on the GPU ?

@lucaparisi91
Copy link
Collaborator Author

lucaparisi91 commented Jul 15, 2024

NVidia GPUs can have vectorized instructions of length 2 or 4 . This is different from SIMT parallelism. The instruction will operate on 2(4) x 32 threads in a warp .
It looks like some old versions of CCE interprets simd as simt , and the value is required to use all threads in a warp.
That is not the case for new versions of cce . At the moment most compilers seem to ignore the simd directive.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant