Skip to content
Change the repository type filter

All

    Repositories list

    • HIP

      Public
      HIP: C++ Heterogeneous-Compute Interface for Portability
      C++
      MIT License
      5483.9k2139Updated Jan 27, 2025Jan 27, 2025
    • AMD's graph optimization engine.
      C++
      MIT License
      9019634745Updated Jan 27, 2025Jan 27, 2025
    • rocMLIR

      Public
      MLIR
      Other
      41133216Updated Jan 27, 2025Jan 27, 2025
    • Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators
      C++
      Other
      1413372348Updated Jan 27, 2025Jan 27, 2025
    • vllm

      Public
      A high-throughput and memory-efficient inference and serving engine for LLMs
      Python
      Apache License 2.0
      5.3k57424Updated Jan 27, 2025Jan 27, 2025
    • MIOpen

      Public
      AMD's Machine Intelligence Library
      Assembly
      Other
      2361.1k24754Updated Jan 27, 2025Jan 27, 2025
    • pytorch

      Public
      Tensors and Dynamic neural networks in Python with strong GPU acceleration
      Python
      Other
      23k2207838Updated Jan 27, 2025Jan 27, 2025
    • ROCm

      Public
      AMD ROCm™ Software - GitHub Home
      Shell
      MIT License
      3994.9k10311Updated Jan 27, 2025Jan 27, 2025
    • triton

      Public
      Development repository for the Triton language and compiler
      C++
      MIT License
      1.7k1031046Updated Jan 27, 2025Jan 27, 2025
    • xla

      Public
      A machine learning compiler for GPUs, CPUs, and ML accelerators
      C++
      Apache License 2.0
      4883014Updated Jan 27, 2025Jan 27, 2025
    • ROCm Documentation Python package for ReadTheDocs build standardization
      CSS
      Other
      1713810Updated Jan 27, 2025Jan 27, 2025
    • hipFFT

      Public
      hipFFT is a FFT marshalling library.
      C++
      Other
      345715Updated Jan 27, 2025Jan 27, 2025
    • jax

      Public
      Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
      Python
      Apache License 2.0
      2.9k19012Updated Jan 27, 2025Jan 27, 2025
    • rocDecode

      Public
      rocDecode is a high performance video decode SDK for AMD hardware
      C++
      Other
      182132Updated Jan 27, 2025Jan 27, 2025
    • Shell
      Apache License 2.0
      91853Updated Jan 27, 2025Jan 27, 2025
    • ROCm Systems Profiler
      C++
      MIT License
      615010Updated Jan 27, 2025Jan 27, 2025
    • rocSHMEM

      Public
      rocSHMEM intra-kernel networking runtime for AMD dGPUs on the ROCm platform.
      C++
      MIT License
      124880Updated Jan 27, 2025Jan 27, 2025
    • rocJPEG

      Public
      rocJPEG is a high-performance jpeg decode SDK for decoding jpeg images using a hardware-accelerated jpeg decoder on AMD’s GPUs.
      C++
      MIT License
      8310Updated Jan 27, 2025Jan 27, 2025
    • C++
      MIT License
      111757Updated Jan 27, 2025Jan 27, 2025
    • gpuaidev

      Public
      Repository to host ROCm Developer Hub Notebook Tutorials
      3200Updated Jan 27, 2025Jan 27, 2025
    • Advanced Profiling and Analytics for AMD Hardware
      Python
      MIT License
      511394711Updated Jan 27, 2025Jan 27, 2025
    • aotriton

      Public
      Ahead of Time (AOT) Triton Math Library
      Python
      MIT License
      1750112Updated Jan 27, 2025Jan 27, 2025
    • rocAL

      Public
      The AMD rocAL is designed to efficiently decode and process images and videos from a variety of storage formats and modify them through a processing graph programmable by the user.
      C++
      MIT License
      1513104Updated Jan 27, 2025Jan 27, 2025
    • CMake
      MIT License
      2200Updated Jan 27, 2025Jan 27, 2025
    • Tensile

      Public
      Stretching GPU performance for GEMMs and tensor contractions.
      Python
      MIT License
      15423134Updated Jan 27, 2025Jan 27, 2025
    • rocBLAS

      Public
      Next generation BLAS implementation for ROCm platform
      C++
      Other
      17135941Updated Jan 27, 2025Jan 27, 2025
    • TensorFlow ROCm port
      C++
      Apache License 2.0
      74k6897167Updated Jan 27, 2025Jan 27, 2025
    • Fast and memory-efficient exact attention
      Python
      BSD 3-Clause "New" or "Revised" License
      1.4k1522411Updated Jan 27, 2025Jan 27, 2025
    • Python
      Other
      71598Updated Jan 27, 2025Jan 27, 2025
    • rocSPARSE

      Public
      Next generation SPARSE implementation for ROCm platform
      C++
      MIT License
      5611810Updated Jan 27, 2025Jan 27, 2025