Skip to content
Change the repository type filter

All

    Repositories list

    • MIOpen

      Public
      AMD's Machine Intelligence Library
      Assembly
      Other
      2351.1k24860Updated Jan 11, 2025Jan 11, 2025
    • RCCL Performance Benchmark Tests
      Cuda
      Other
      4153313Updated Jan 11, 2025Jan 11, 2025
    • rccl

      Public
      ROCm Communication Collectives Library (RCCL)
      C++
      Other
      1282881217Updated Jan 11, 2025Jan 11, 2025
    • Tensile

      Public
      Stretching GPU performance for GEMMs and tensor contractions.
      Python
      MIT License
      15422958Updated Jan 11, 2025Jan 11, 2025
    • jax

      Public
      Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
      Python
      Apache License 2.0
      2.8k19014Updated Jan 11, 2025Jan 11, 2025
    • This is the AMD-maintained fork of the LLVM git repository. This repository accepts pull requests and issues related to AMD fork-specific topics (amd/*). For all other issues/PRs, please submit upstream at https://github.com/llvm/llvm-project.
      LLVM
      Other
      12k1272411Updated Jan 11, 2025Jan 11, 2025
    • Fast and memory-efficient exact attention
      Python
      BSD 3-Clause "New" or "Revised" License
      1.4k1512412Updated Jan 11, 2025Jan 11, 2025
    • pytorch

      Public
      Tensors and Dynamic neural networks in Python with strong GPU acceleration
      Python
      Other
      23k2207639Updated Jan 11, 2025Jan 11, 2025
    • rocDecode

      Public
      rocDecode is a high performance video decode SDK for AMD hardware
      C++
      Other
      172130Updated Jan 11, 2025Jan 11, 2025
    • rocJPEG

      Public
      rocJPEG is a high-performance jpeg decode SDK for decoding jpeg images using a hardware-accelerated jpeg decoder on AMD’s GPUs.
      C++
      MIT License
      8310Updated Jan 11, 2025Jan 11, 2025
    • aomp

      Public
      AOMP is an open source Clang/LLVM based compiler with added support for the OpenMP® API on Radeon™ GPUs. Use this repository for releases, issues, documentation, packaging, and examples.
      Fortran
      Apache License 2.0
      48210145Updated Jan 11, 2025Jan 11, 2025
    • hipCUB

      Public
      Reusable software components for ROCm developers
      C++
      Other
      418127Updated Jan 11, 2025Jan 11, 2025
    • Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators
      C++
      Other
      1393312452Updated Jan 11, 2025Jan 11, 2025
    • triton

      Public
      Development repository for the Triton language and compiler
      C++
      MIT License
      1.7k102945Updated Jan 11, 2025Jan 11, 2025
    • hipSOLVER

      Public
      ROCm SOLVER marshalling library
      C++
      MIT License
      252402Updated Jan 11, 2025Jan 11, 2025
    • vllm

      Public
      A high-throughput and memory-efficient inference and serving engine for LLMs
      Python
      Apache License 2.0
      5.1k53116Updated Jan 11, 2025Jan 11, 2025
    • xformers

      Public
      Hackable and optimized Transformers building blocks, supporting a composable construction.
      Python
      Other
      6342283Updated Jan 11, 2025Jan 11, 2025
    • Libraries integrating migraphx with pytorch
      Python
      BSD 3-Clause "New" or "Revised" License
      26133Updated Jan 11, 2025Jan 11, 2025
    • rocBLAS

      Public
      Next generation BLAS implementation for ROCm platform
      C++
      Other
      17035451Updated Jan 11, 2025Jan 11, 2025
    • ROCm

      Public
      AMD ROCm™ Software - GitHub Home
      Shell
      MIT License
      3954.8k11214Updated Jan 11, 2025Jan 11, 2025
    • hipBLASLt

      Public
      hipBLASLt is a library that provides general matrix-matrix operations with a flexible API and extends functionalities beyond a traditional BLAS library
      Assembly
      MIT License
      9771762Updated Jan 11, 2025Jan 11, 2025
    • rocThrust

      Public
      ROCm Thrust - run Thrust dependent software on AMD GPUs
      C++
      Apache License 2.0
      4710435Updated Jan 11, 2025Jan 11, 2025
    • rocPRIM

      Public
      ROCm Parallel Primitives
      C++
      MIT License
      7216717Updated Jan 11, 2025Jan 11, 2025
    • Advanced Profiling and Analytics for AMD Hardware
      Python
      MIT License
      51139499Updated Jan 11, 2025Jan 11, 2025
    • rocRAND

      Public
      RAND library for HIP programming language
      C++
      MIT License
      7011416Updated Jan 11, 2025Jan 11, 2025
    • A system validation and diagnostics tool for monitoring, stress testing, detecting, and troubleshooting issues impacting AMD GPUs in high-performance computing environments
      C++
      MIT License
      396608Updated Jan 11, 2025Jan 11, 2025
    • hipRAND

      Public
      Random number library that generate pseudo-random and quasi-random numbers.
      C++
      MIT License
      242505Updated Jan 11, 2025Jan 11, 2025
    • hipfort

      Public
      Fortran interfaces for ROCm libraries
      Fortran
      Other
      3972157Updated Jan 11, 2025Jan 11, 2025
    • TransferBench is a utility capable of benchmarking simultaneous copies between user-specified devices (CPUs/GPUs)
      C++
      MIT License
      143802Updated Jan 11, 2025Jan 11, 2025
    • C++
      MIT License
      101786Updated Jan 10, 2025Jan 10, 2025