DGL compatibility#
2025-10-21
7 min read time
Deep Graph Library (DGL) is an easy-to-use, high-performance, and scalable Python package for deep learning on graphs. DGL is framework agnostic, meaning that if a deep graph model is a component in an end-to-end application, the rest of the logic is implemented using PyTorch.
DGL provides a high-performance graph object that can reside on either CPUs or GPUs. It bundles structural data features for better control and provides a variety of functions for computing with graph objects, including efficient and customizable message passing primitives for Graph Neural Networks.
Support overview#
The ROCm-supported version of DGL is maintained in the official ROCm/dgl repository, which differs from the dmlc/dgl upstream repository.
To get started and install DGL on ROCm, use the prebuilt Docker images, which include ROCm, DGL, and all required dependencies.
See the ROCm DGL installation guide for installation and setup instructions.
You can also consult the upstream Installation guide for additional context.
Version support#
DGL is supported on ROCm 6.4.0.
Supported devices#
Officially Supported: AMD Instinct™ MI300X (through hipBLASlt)
Partially Supported: AMD Instinct™ MI250X
Use cases and recommendations#
DGL can be used for Graph Learning, and building popular graph models like GAT, GCN, and GraphSage. Using these models, a variety of use cases are supported:
Recommender systems
Network Optimization and Analysis
1D (Temporal) and 2D (Image) Classification
Drug Discovery
Multiple use cases of DGL have been tested and verified.
However, a recommended example follows a drug discovery pipeline using the SE3Transformer.
Refer to the AMD ROCm blog,
where you can search for DGL examples and best practices to optimize your training workflows on AMD GPUs.
Coverage includes:
Single-GPU training/inference
Multi-GPU training
Docker image compatibility#
AMD validates and publishes DGL images with ROCm backends on Docker Hub. The following Docker image tags and associated inventories represent the latest available DGL version from the official Docker Hub. Click the to view the image on Docker Hub.
Key ROCm libraries for DGL#
DGL on ROCm depends on specific libraries that affect its features and performance. Using the DGL Docker container or building it with the provided Docker file or a ROCm base image is recommended. If you prefer to build it yourself, ensure the following dependencies are installed:
ROCm library |
ROCm 6.4.0 Version |
Purpose |
|---|---|---|
1.1.0 |
Enables faster execution of core operations like matrix multiplication (GEMM), convolutions and transformations. |
|
2.4.0 |
Provides GPU-accelerated Basic Linear Algebra Subprograms (BLAS) for matrix and vector operations. |
|
0.12.0 |
hipBLASLt is an extension of the hipBLAS library, providing additional features like epilogues fused into the matrix multiplication kernel or use of integer tensor cores. |
|
3.4.0 |
Provides a C++ template library for parallel algorithms for reduction, scan, sort and select. |
|
1.0.18 |
Provides GPU-accelerated Fast Fourier Transform (FFT) operations. |
|
2.12.0 |
Provides fast random number generation for GPUs. |
|
2.4.0 |
Provides GPU-accelerated solvers for linear systems, eigenvalues, and singular value decompositions (SVD). |
|
3.2.0 |
Accelerates operations on sparse matrices, such as sparse matrix-vector or matrix-matrix products. |
|
0.2.3 |
Accelerates operations on sparse matrices, such as sparse matrix-vector or matrix-matrix products. |
|
1.5.0 |
Optimizes for high-performance tensor operations, such as contractions. |
|
3.4.0 |
Optimizes deep learning primitives such as convolutions, pooling, normalization, and activation functions. |
|
2.12.0 |
Adds graph-level optimizations, ONNX models and mixed precision support and enable Ahead-of-Time (AOT) Compilation. |
|
3.2.0 |
Optimizes acceleration for computer vision and AI workloads like preprocessing, augmentation, and inferencing. |
|
2.3.0 |
Accelerates the data pipeline by offloading intensive preprocessing and augmentation tasks. rocAL is part of MIVisionX. |
|
2.2.0 |
Optimizes for multi-GPU communication for operations like AllReduce and Broadcast. |
|
0.10.0 |
Provides hardware-accelerated data decoding capabilities, particularly for image, video, and other dataset formats. |
|
0.8.0 |
Provides hardware-accelerated JPEG image decoding and encoding. |
|
1.9.10 |
Speeds up data augmentation, transformation, and other preprocessing steps. |
|
3.3.0 |
Provides a C++ template library for parallel algorithms like sorting, reduction, and scanning. |
|
1.7.0 |
Accelerates warp-level matrix-multiply and matrix-accumulate to speed up matrix multiplication (GEMM) and accumulation operations with mixed precision support. |
Supported features#
Many functions and methods available upstream are also supported in DGL on ROCm. Instead of listing them all, support is grouped into the following categories to provide a general overview.
DGL Base
DGL Backend
DGL Data
DGL Dataloading
DGL Graph
DGL Function
DGL Ops
DGL Sampling
DGL Transforms
DGL Utils
DGL Distributed
DGL Geometry
DGL Mpops
DGL NN
DGL Optim
DGL Sparse
Unsupported features#
GraphBolt
Partial TF32 Support (MI250X only)
Kineto/ROCTracer integration
Unsupported functions#
more_nnzformatmultiprocess_sparse_adam_state_dictrecord_stream_ndarrayhalf_spmmsegment_mmgather_mm_idx_bpgexplainersample_labors_probsample_labors_noprob