v0.8.1
AMDGPU v0.8.1
Merged pull requests:
- Implement device-side RNG (#380) (@utkarsh530)
- Fix path detection in ubuntu like systems (#545) (@gbaraldi)
- Simplify ROCm discovery (#548) (@pxl-th)
- [rocSPARSE] Add new constructors (#550) (@amontoison)
- Check context is valid before freeing streams, arrays. (#552) (@pxl-th)
- [rocSPARSE] Update helpers.jl (#554) (@amontoison)
- Use Atomix.jl for atomics (#555) (@pxl-th)
- Reset exception holder immediately after exception (#556) (@pxl-th)
- Fix exception reporting (#557) (@pxl-th)
- Cleanup (#559) (@pxl-th)
Closed issues:
- Implement sparse BLAS routines (#15)
- Implement iterative solvers (#13)
- Create a Docker image for AMDGPU.jl (#33)
- Implement batched off-thread HSA signal waiting (#128)
- HSA_STATUS_ERROR_INVALID_CODE_OBJECT on gfx803 (#192)
hsa_executable_freeze
can hang during high GPU load (#208)- Implement copy!() (#218)
- ROCM/Hip not downloading (?) when ]added (#230)
- mapreducedim! is not implemented for AnyROCArray Types (#234)
- Test of AMDGPU fails on 5900HX - hipErrorNoBinaryForGpu (#244)
- Don't disable ROCm external library type definitions when non-functional (#350)
- AMDGPU.jl doesn't seem to work with 7900 series GPUs (#371)
- Support for rand from Julia Base on device code (#378)
- Detect hardware queue limit and use to limit queue pool size (#403)
- AMDGPU on windows (#465)
- Rely on Atomix.jl for atomics (#547)