verl compatibility

verl compatibility#

2026-01-08

4 min read time

Applies to Linux

Volcano Engine Reinforcement Learning for LLMs (verl) is a reinforcement learning framework designed for large language models (LLMs). verl offers a scalable, open-source fine-tuning solution by using a hybrid programming model that makes it easy to define and run complex post-training dataflows efficiently.

Its modular APIs separate computation from data, allowing smooth integration with other frameworks. It also supports flexible model placement across GPUs for efficient scaling on different cluster sizes. verl achieves high training and generation throughput by building on existing LLM frameworks. Its 3D-HybridEngine reduces memory use and communication overhead when switching between training and inference, improving overall performance.

Support overview#

The ROCm-supported version of verl is maintained in the official ROCm/verl repository, which differs from the volcengine/verl upstream repository.
To get started and install verl on ROCm, use the prebuilt Docker image, which includes ROCm, verl, and all required dependencies.
- See the ROCm verl installation guide for installation and setup instructions.
- You can also consult the upstream verl documentation for additional context.

Compatibility matrix#

AMD validates and publishes verl Docker images with ROCm backends on Docker Hub. The following Docker image tag and associated inventories represent the latest verl version from the official Docker Hub. Click to view the image on Docker Hub.

Docker image	ROCm	verl	Ubuntu	PyTorch	Python	vllm	GPU
rocm/verl	7.0.0	0.6.0	22.04	2.9.0	3.12.11	0.11.0	MI300X
rocm/verl	6.2.0	0.3.0.post0	20.04	2.5.0	3.9.19	0.6.3	MI300X

Supported modules with verl on ROCm#

The following GPU-accelerated modules are supported with verl on ROCm:

FSDP: Training engine
vllm: Inference engine

Use cases and recommendations#

The benefits of verl in large-scale reinforcement learning from human feedback (RLHF) are discussed in the Reinforcement Learning from Human Feedback on AMD GPUs with verl and ROCm Integration blog. The blog post outlines how the Volcano Engine Reinforcement Learning (verl) framework integrates with the AMD ROCm platform to optimize training on AMD Instinct™ GPUs. The guide details the process of building a Docker image, setting up single-node and multi-node training environments, and highlights performance benchmarks demonstrating improved throughput and convergence accuracy. This resource serves as a comprehensive starting point for deploying verl on AMD GPUs, facilitating efficient RLHF training workflows.

Previous versions#

See verl version history to find documentation for previous releases of the ROCm/verl Docker image.