0.21.0

This release introduces the vLLM container image. It provides Intel GPU software support for vLLM inference and serving.

This image is available on Docker Hub, and you can build it from the Dockerfile.

Requirements

The vLLM container was validated on a host system running Ubuntu 24.04.4 with the following GPUs:

  • Intel® Arc™ Pro B50 Graphics

  • Intel® Arc™ Pro B60 Graphics

  • Intel® Arc™ Pro B65 Graphics

  • Intel® Arc™ Pro B70 Graphics

Components

This container is based on Intel® Open Middleware Xe (Intel OMIX) 0.1.0 and includes vLLM 0.21.0. For more information about this version, see the project’s Release Notes.

Security vulnerabilities

vLLM currently depends on diskcache version 5.6.3, which has been reported as affected by CVE-2025-69872. The vulnerability remains unresolved upstream as of the day of this release. According to initial analysis, the vLLM architecture does not expose the vulnerable code path, meaning vLLM is not impacted in practice, despite the dependency being formally flagged.