Skip to content

AMD ROCm™ Software

AMD ROCm software is AMD's Open Source stack for GPU computation.

To learn more about ROCm, check out our Documentation, Examples, and Developer Hub.

If you have questions or need help, reach out to us on GitHub.

Popular repositories Loading

  1. ROCm ROCm Public

    AMD ROCm™ Software - GitHub Home

    Shell 4.8k 395

  2. HIP HIP Public

    HIP: C++ Heterogeneous-Compute Interface for Portability

    C++ 3.8k 544

  3. MIOpen MIOpen Public

    AMD's Machine Intelligence Library

    Assembly 1.1k 235

  4. tensorflow-upstream tensorflow-upstream Public

    Forked from tensorflow/tensorflow

    TensorFlow ROCm port

    C++ 689 96

  5. HIPIFY HIPIFY Public

    HIPIFY: Convert CUDA to Portable C++ Code

    C++ 537 77

  6. ROCm-docker ROCm-docker Public

    Dockerfiles for the various software layers defined in the ROCm software platform

    Shell 439 68

Repositories

Showing 10 of 296 repositories
  • hipBLASLt Public

    hipBLASLt is a library that provides general matrix-matrix operations with a flexible API and extends functionalities beyond a traditional BLAS library

    ROCm/hipBLASLt’s past year of commit activity
    Assembly 71 MIT 97 7 63 Updated Jan 13, 2025
  • llvm-project Public Forked from llvm/llvm-project

    This is the AMD-maintained fork of the LLVM git repository. This repository accepts pull requests and issues related to AMD fork-specific topics (amd/*). For all other issues/PRs, please submit upstream at https://github.com/llvm/llvm-project.

    ROCm/llvm-project’s past year of commit activity
    LLVM 127 12,630 24 11 Updated Jan 13, 2025
  • aomp Public

    AOMP is an open source Clang/LLVM based compiler with added support for the OpenMP® API on Radeon™ GPUs. Use this repository for releases, issues, documentation, packaging, and examples.

    ROCm/aomp’s past year of commit activity
    Fortran 210 Apache-2.0 48 1 45 Updated Jan 12, 2025
  • rocPRIM Public

    ROCm Parallel Primitives

    ROCm/rocPRIM’s past year of commit activity
    C++ 167 MIT 71 1 7 Updated Jan 12, 2025
  • flash-attention Public Forked from Dao-AILab/flash-attention

    Fast and memory-efficient exact attention

    ROCm/flash-attention’s past year of commit activity
    Python 151 BSD-3-Clause 1,422 24 12 Updated Jan 12, 2025
  • composable_kernel Public

    Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators

    ROCm/composable_kernel’s past year of commit activity
    C++ 331 139 24 (1 issue needs help) 54 Updated Jan 12, 2025
  • MIOpen Public

    AMD's Machine Intelligence Library

    ROCm/MIOpen’s past year of commit activity
    Assembly 1,092 235 248 (4 issues need help) 58 Updated Jan 13, 2025
  • HIP Public

    HIP: C++ Heterogeneous-Compute Interface for Portability

    ROCm/HIP’s past year of commit activity
    C++ 3,829 MIT 544 24 37 Updated Jan 12, 2025
  • Tensile Public

    Stretching GPU performance for GEMMs and tensor contractions.

    ROCm/Tensile’s past year of commit activity
    Python 230 MIT 154 5 6 Updated Jan 12, 2025
  • hipCUB Public

    Reusable software components for ROCm developers

    ROCm/hipCUB’s past year of commit activity
    C++ 81 41 2 7 Updated Jan 12, 2025