docker.io/vllm/vllm-openai:gemma4-unified linux/arm64

docker.io/vllm/vllm-openai:gemma4-unified - 国内下载镜像源 浏览次数:33 温馨提示: 这是一个 linux/arm64 系统架构镜像
这是镜像描述:

vllm/openai

基于 OpenAI 的 GPT-3 模型的 API 服务,支持自然语言处理等功能。

源镜像 docker.io/vllm/vllm-openai:gemma4-unified
国内镜像 swr.cn-north-4.myhuaweicloud.com/ddn-k8s/docker.io/vllm/vllm-openai:gemma4-unified-linuxarm64
镜像ID sha256:40961599052435ac6a09bda4c79b77d67b0edb525a14306535dff41d2333cc29
镜像TAG gemma4-unified-linuxarm64
大小 19.46GB
镜像源 docker.io
项目信息 Docker-Hub主页 🚀项目TAG 🚀
CMD
启动入口 vllm serve
工作目录 /vllm-workspace
OS/平台 linux/arm64
浏览量 33 次
贡献者
镜像创建 2026-06-03T16:46:11.136255302Z
同步时间 2026-06-11 04:14
环境变量
PATH=/usr/local/nvidia/bin:/usr/local/cuda/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin NVARCH=sbsa NVIDIA_REQUIRE_CUDA=cuda>=13.0 brand=unknown,driver>=535,driver<536 brand=grid,driver>=535,driver<536 brand=tesla,driver>=535,driver<536 brand=nvidia,driver>=535,driver<536 brand=quadro,driver>=535,driver<536 brand=quadrortx,driver>=535,driver<536 brand=nvidiartx,driver>=535,driver<536 brand=vapps,driver>=535,driver<536 brand=vpc,driver>=535,driver<536 brand=vcs,driver>=535,driver<536 brand=vws,driver>=535,driver<536 brand=cloudgaming,driver>=535,driver<536 brand=unknown,driver>=550,driver<551 brand=grid,driver>=550,driver<551 brand=tesla,driver>=550,driver<551 brand=nvidia,driver>=550,driver<551 brand=quadro,driver>=550,driver<551 brand=quadrortx,driver>=550,driver<551 brand=nvidiartx,driver>=550,driver<551 brand=vapps,driver>=550,driver<551 brand=vpc,driver>=550,driver<551 brand=vcs,driver>=550,driver<551 brand=vws,driver>=550,driver<551 brand=cloudgaming,driver>=550,driver<551 brand=unknown,driver>=565,driver<566 brand=grid,driver>=565,driver<566 brand=tesla,driver>=565,driver<566 brand=nvidia,driver>=565,driver<566 brand=quadro,driver>=565,driver<566 brand=quadrortx,driver>=565,driver<566 brand=nvidiartx,driver>=565,driver<566 brand=vapps,driver>=565,driver<566 brand=vpc,driver>=565,driver<566 brand=vcs,driver>=565,driver<566 brand=vws,driver>=565,driver<566 brand=cloudgaming,driver>=565,driver<566 brand=unknown,driver>=570,driver<571 brand=grid,driver>=570,driver<571 brand=tesla,driver>=570,driver<571 brand=nvidia,driver>=570,driver<571 brand=quadro,driver>=570,driver<571 brand=quadrortx,driver>=570,driver<571 brand=nvidiartx,driver>=570,driver<571 brand=vapps,driver>=570,driver<571 brand=vpc,driver>=570,driver<571 brand=vcs,driver>=570,driver<571 brand=vws,driver>=570,driver<571 brand=cloudgaming,driver>=570,driver<571 brand=unknown,driver>=575,driver<576 brand=grid,driver>=575,driver<576 brand=tesla,driver>=575,driver<576 brand=nvidia,driver>=575,driver<576 brand=quadro,driver>=575,driver<576 brand=quadrortx,driver>=575,driver<576 brand=nvidiartx,driver>=575,driver<576 brand=vapps,driver>=575,driver<576 brand=vpc,driver>=575,driver<576 brand=vcs,driver>=575,driver<576 brand=vws,driver>=575,driver<576 brand=cloudgaming,driver>=575,driver<576 NV_CUDA_CUDART_VERSION=13.0.88-1 CUDA_VERSION=13.0.1 LD_LIBRARY_PATH=/usr/local/nvidia/lib64:/usr/local/cuda/lib64:/usr/local/nvidia/lib:/usr/local/nvidia/lib64:/usr/local/cuda/lib64 NVIDIA_VISIBLE_DEVICES=all NVIDIA_DRIVER_CAPABILITIES=compute,utility DEBIAN_FRONTEND=noninteractive UV_HTTP_TIMEOUT=500 UV_INDEX_STRATEGY=unsafe-best-match UV_LINK_MODE=copy UV_PYTHON_INSTALL_DIR=/opt/uv/python UV_CACHE_DIR=/opt/uv/cache VLLM_ENABLE_CUDA_COMPATIBILITY=0 TORCH_CUDA_ARCH_LIST=8.7 8.9 9.0 10.0+PTX 12.0 12.1 VLLM_USAGE_SOURCE=production-docker-image VLLM_BUILD_COMMIT=unknown VLLM_BUILD_PIPELINE=local VLLM_BUILD_URL= VLLM_IMAGE_TAG=local/vllm-openai:dev
镜像标签
unknown: ai.vllm.build.commit local: ai.vllm.build.pipeline : ai.vllm.build.url local/vllm-openai:dev: ai.vllm.image.tag NVIDIA CORPORATION <cudatools@nvidia.com>: maintainer ubuntu: org.opencontainers.image.ref.name unknown: org.opencontainers.image.revision https://github.com/vllm-project/vllm: org.opencontainers.image.source : org.opencontainers.image.url local/vllm-openai:dev: org.opencontainers.image.version
镜像安全扫描 查看Trivy扫描报告

系统OS: ubuntu 22.04 扫描引擎: Trivy 扫描时间: 2026-06-11 04:17

低危漏洞:206 中危漏洞:2770 高危漏洞:36 严重漏洞:9

Docker拉取命令

docker pull swr.cn-north-4.myhuaweicloud.com/ddn-k8s/docker.io/vllm/vllm-openai:gemma4-unified-linuxarm64
docker tag  swr.cn-north-4.myhuaweicloud.com/ddn-k8s/docker.io/vllm/vllm-openai:gemma4-unified-linuxarm64  docker.io/vllm/vllm-openai:gemma4-unified

Containerd拉取命令

ctr images pull swr.cn-north-4.myhuaweicloud.com/ddn-k8s/docker.io/vllm/vllm-openai:gemma4-unified-linuxarm64
ctr images tag  swr.cn-north-4.myhuaweicloud.com/ddn-k8s/docker.io/vllm/vllm-openai:gemma4-unified-linuxarm64  docker.io/vllm/vllm-openai:gemma4-unified

Shell快速替换命令

sed -i 's#vllm/vllm-openai:gemma4-unified#swr.cn-north-4.myhuaweicloud.com/ddn-k8s/docker.io/vllm/vllm-openai:gemma4-unified-linuxarm64#' deployment.yaml

Ansible快速分发-Docker

#ansible k8s -m shell -a 'docker pull swr.cn-north-4.myhuaweicloud.com/ddn-k8s/docker.io/vllm/vllm-openai:gemma4-unified-linuxarm64 && docker tag  swr.cn-north-4.myhuaweicloud.com/ddn-k8s/docker.io/vllm/vllm-openai:gemma4-unified-linuxarm64  docker.io/vllm/vllm-openai:gemma4-unified'

Ansible快速分发-Containerd

#ansible k8s -m shell -a 'ctr images pull swr.cn-north-4.myhuaweicloud.com/ddn-k8s/docker.io/vllm/vllm-openai:gemma4-unified-linuxarm64 && ctr images tag  swr.cn-north-4.myhuaweicloud.com/ddn-k8s/docker.io/vllm/vllm-openai:gemma4-unified-linuxarm64  docker.io/vllm/vllm-openai:gemma4-unified'

镜像构建历史


# 2026-06-04 00:46:11  0.00B 配置容器启动时运行的命令
ENTRYPOINT ["vllm" "serve"]
                        
# 2026-06-04 00:46:11  0.00B 添加元数据标签
LABEL org.opencontainers.image.source=https://github.com/vllm-project/vllm org.opencontainers.image.revision=unknown org.opencontainers.image.version=local/vllm-openai:dev org.opencontainers.image.url= ai.vllm.build.commit=unknown ai.vllm.build.pipeline=local ai.vllm.build.url= ai.vllm.image.tag=local/vllm-openai:dev
                        
# 2026-06-04 00:46:11  0.00B 设置环境变量 VLLM_BUILD_COMMIT VLLM_BUILD_PIPELINE VLLM_BUILD_URL VLLM_IMAGE_TAG
ENV VLLM_BUILD_COMMIT=unknown VLLM_BUILD_PIPELINE=local VLLM_BUILD_URL= VLLM_IMAGE_TAG=local/vllm-openai:dev
                        
# 2026-06-04 00:46:11  0.00B 设置环境变量 VLLM_USAGE_SOURCE
ENV VLLM_USAGE_SOURCE=production-docker-image
                        
# 2026-06-04 00:46:11  0.00B 执行命令并创建新的镜像层
RUN |14 TARGETPLATFORM=linux/arm64 INSTALL_KV_CONNECTORS=true CUDA_VERSION=13.0.1 VLLM_BUILD_COMMIT= VLLM_BUILD_PIPELINE= VLLM_BUILD_URL= VLLM_IMAGE_TAG= PIP_INDEX_URL= UV_INDEX_URL= PIP_EXTRA_INDEX_URL= UV_EXTRA_INDEX_URL= torch_cuda_arch_list=8.7 8.9 9.0 10.0+PTX 12.0 12.1 MOONCAKE_WHEEL_AARCH64= MOONCAKE_WHEEL_X86_64= /bin/sh -c if [ "$INSTALL_KV_CONNECTORS" = "true" ]; then         if [ "$TARGETPLATFORM" = "linux/arm64" ]; then             WHEEL="${MOONCAKE_WHEEL_AARCH64}";         else             WHEEL="${MOONCAKE_WHEEL_X86_64}";         fi &&         if [ -n "${WHEEL}" ]; then             uv pip install --system "${WHEEL}" &&             CUDA_MAJOR="${CUDA_VERSION%%.*}" &&             if [ ! -f /usr/local/cuda/lib64/libcudart.so ] &&                [ -f "/usr/local/cuda/lib64/libcudart.so.${CUDA_MAJOR}" ]; then                 ln -s "libcudart.so.${CUDA_MAJOR}" /usr/local/cuda/lib64/libcudart.so;             fi;         fi;     fi # buildkit
                        
# 2026-06-04 00:46:10  0.00B 定义构建参数
ARG MOONCAKE_WHEEL_X86_64
                        
# 2026-06-04 00:46:10  0.00B 定义构建参数
ARG MOONCAKE_WHEEL_AARCH64
                        
# 2026-06-04 00:46:10  1.43GB 执行命令并创建新的镜像层
RUN |12 TARGETPLATFORM=linux/arm64 INSTALL_KV_CONNECTORS=true CUDA_VERSION=13.0.1 VLLM_BUILD_COMMIT= VLLM_BUILD_PIPELINE= VLLM_BUILD_URL= VLLM_IMAGE_TAG= PIP_INDEX_URL= UV_INDEX_URL= PIP_EXTRA_INDEX_URL= UV_EXTRA_INDEX_URL= torch_cuda_arch_list=8.7 8.9 9.0 10.0+PTX 12.0 12.1 /bin/sh -c CUDA_MAJOR="${CUDA_VERSION%%.*}";     CUDA_VERSION_DASH=$(echo $CUDA_VERSION | cut -d. -f1,2 | tr '.' '-');     CUDA_HOME=/usr/local/cuda;     BUILD_PKGS="libcusparse-dev-${CUDA_VERSION_DASH}                 libcublas-dev-${CUDA_VERSION_DASH}                 libcusolver-dev-${CUDA_VERSION_DASH}";     if [ "$INSTALL_KV_CONNECTORS" = "true" ]; then         uv pip install --system -r /tmp/kv_connectors.txt --no-build || (             apt-get update -y &&             apt-get install -y --no-install-recommends --allow-change-held-packages ${BUILD_PKGS} &&             uv pip install --system -r /tmp/kv_connectors.txt --no-build-isolation &&             apt-get purge -y ${BUILD_PKGS} &&             rm -rf /var/lib/apt/lists/*         );         uv pip install --system --force-reinstall --no-deps nixl-cu${CUDA_MAJOR};     fi # buildkit
                        
# 2026-06-04 00:42:49  0.00B 设置环境变量 TORCH_CUDA_ARCH_LIST
ENV TORCH_CUDA_ARCH_LIST=8.7 8.9 9.0 10.0+PTX 12.0 12.1
                        
# 2026-06-04 00:42:49  0.00B 定义构建参数
ARG torch_cuda_arch_list=7.5 8.0 8.6 8.9 9.0 10.0 11.0 12.0+PTX
                        
# 2026-06-04 00:42:49  0.00B 设置环境变量 UV_HTTP_TIMEOUT
ENV UV_HTTP_TIMEOUT=500
                        
# 2026-06-04 00:42:49  0.00B 定义构建参数
ARG PIP_EXTRA_INDEX_URL UV_EXTRA_INDEX_URL
                        
# 2026-06-04 00:42:49  0.00B 定义构建参数
ARG PIP_INDEX_URL UV_INDEX_URL
                        
# 2026-06-04 00:42:49  0.00B 定义构建参数
ARG VLLM_IMAGE_TAG
                        
# 2026-06-04 00:42:49  0.00B 定义构建参数
ARG VLLM_BUILD_URL
                        
# 2026-06-04 00:42:49  0.00B 定义构建参数
ARG VLLM_BUILD_PIPELINE
                        
# 2026-06-04 00:42:49  0.00B 定义构建参数
ARG VLLM_BUILD_COMMIT
                        
# 2026-06-04 00:42:49  0.00B 定义构建参数
ARG CUDA_VERSION
                        
# 2026-06-04 00:42:49  0.00B 定义构建参数
ARG INSTALL_KV_CONNECTORS=false
                        
# 2026-06-04 00:42:49  0.00B 定义构建参数
ARG TARGETPLATFORM
                        
# 2026-06-04 00:42:49  35.09KB 复制新文件或目录到容器中
COPY ./vllm/collect_env.py . # buildkit
                        
# 2026-06-04 00:42:49  1.06MB 复制新文件或目录到容器中
COPY benchmarks benchmarks # buildkit
                        
# 2026-06-04 00:42:49  1.33MB 复制新文件或目录到容器中
COPY examples examples # buildkit
                        
# 2026-06-04 00:42:49  0.00B 设置环境变量 LD_LIBRARY_PATH
ENV LD_LIBRARY_PATH=/usr/local/nvidia/lib64:/usr/local/cuda/lib64:/usr/local/nvidia/lib:/usr/local/nvidia/lib64:/usr/local/cuda/lib64
                        
# 2026-06-04 00:42:49  1.90GB 执行命令并创建新的镜像层
RUN |22 CUDA_VERSION=13.0.1 PYTHON_VERSION=3.12 DEADSNAKES_MIRROR_URL= DEADSNAKES_GPGKEY_URL= GET_PIP_URL=https://bootstrap.pypa.io/get-pip.py PYTORCH_CUDA_INDEX_BASE_URL=https://download.pytorch.org/whl FLASHINFER_VERSION=0.6.12 GDRCOPY_CUDA_VERSION=12.8 GDRCOPY_OS_VERSION=Ubuntu22_04 TARGETPLATFORM=linux/arm64 BITSANDBYTES_VERSION_X86=0.46.1 BITSANDBYTES_VERSION_ARM64=0.42.0 TIMM_VERSION=>=1.0.17 RUNAI_MODEL_STREAMER_VERSION=>=0.15.7 PIP_INDEX_URL= UV_INDEX_URL= PIP_EXTRA_INDEX_URL= UV_EXTRA_INDEX_URL= PYTORCH_CUDA_INDEX_BASE_URL=https://download.pytorch.org/whl PIP_KEYRING_PROVIDER=disabled UV_KEYRING_PROVIDER=disabled PYTORCH_NIGHTLY= /bin/sh -c flashinfer show-config && flashinfer download-cubin # buildkit
                        
# 2026-06-04 00:29:37  29.58MB 执行命令并创建新的镜像层
RUN |22 CUDA_VERSION=13.0.1 PYTHON_VERSION=3.12 DEADSNAKES_MIRROR_URL= DEADSNAKES_GPGKEY_URL= GET_PIP_URL=https://bootstrap.pypa.io/get-pip.py PYTORCH_CUDA_INDEX_BASE_URL=https://download.pytorch.org/whl FLASHINFER_VERSION=0.6.12 GDRCOPY_CUDA_VERSION=12.8 GDRCOPY_OS_VERSION=Ubuntu22_04 TARGETPLATFORM=linux/arm64 BITSANDBYTES_VERSION_X86=0.46.1 BITSANDBYTES_VERSION_ARM64=0.42.0 TIMM_VERSION=>=1.0.17 RUNAI_MODEL_STREAMER_VERSION=>=0.15.7 PIP_INDEX_URL= UV_INDEX_URL= PIP_EXTRA_INDEX_URL= UV_EXTRA_INDEX_URL= PYTORCH_CUDA_INDEX_BASE_URL=https://download.pytorch.org/whl PIP_KEYRING_PROVIDER=disabled UV_KEYRING_PROVIDER=disabled PYTORCH_NIGHTLY= /bin/sh -c uv pip install --system ep_kernels/dist/*.whl --verbose         --extra-index-url ${PYTORCH_CUDA_INDEX_BASE_URL}/cu$(echo $CUDA_VERSION | cut -d. -f1,2 | tr -d '.') # buildkit
                        
# 2026-06-04 00:29:36  0.00B 设置环境变量 LD_LIBRARY_PATH
ENV LD_LIBRARY_PATH=/usr/local/cuda/lib64:/usr/local/nvidia/lib:/usr/local/nvidia/lib64:/usr/local/cuda/lib64
                        
# 2026-06-04 00:29:36  0.00B 执行命令并创建新的镜像层
RUN |22 CUDA_VERSION=13.0.1 PYTHON_VERSION=3.12 DEADSNAKES_MIRROR_URL= DEADSNAKES_GPGKEY_URL= GET_PIP_URL=https://bootstrap.pypa.io/get-pip.py PYTORCH_CUDA_INDEX_BASE_URL=https://download.pytorch.org/whl FLASHINFER_VERSION=0.6.12 GDRCOPY_CUDA_VERSION=12.8 GDRCOPY_OS_VERSION=Ubuntu22_04 TARGETPLATFORM=linux/arm64 BITSANDBYTES_VERSION_X86=0.46.1 BITSANDBYTES_VERSION_ARM64=0.42.0 TIMM_VERSION=>=1.0.17 RUNAI_MODEL_STREAMER_VERSION=>=0.15.7 PIP_INDEX_URL= UV_INDEX_URL= PIP_EXTRA_INDEX_URL= UV_EXTRA_INDEX_URL= PYTORCH_CUDA_INDEX_BASE_URL=https://download.pytorch.org/whl PIP_KEYRING_PROVIDER=disabled UV_KEYRING_PROVIDER=disabled PYTORCH_NIGHTLY= /bin/sh -c . /etc/environment && uv pip list # buildkit
                        
# 2026-06-04 00:29:36  838.34MB 执行命令并创建新的镜像层
RUN |22 CUDA_VERSION=13.0.1 PYTHON_VERSION=3.12 DEADSNAKES_MIRROR_URL= DEADSNAKES_GPGKEY_URL= GET_PIP_URL=https://bootstrap.pypa.io/get-pip.py PYTORCH_CUDA_INDEX_BASE_URL=https://download.pytorch.org/whl FLASHINFER_VERSION=0.6.12 GDRCOPY_CUDA_VERSION=12.8 GDRCOPY_OS_VERSION=Ubuntu22_04 TARGETPLATFORM=linux/arm64 BITSANDBYTES_VERSION_X86=0.46.1 BITSANDBYTES_VERSION_ARM64=0.42.0 TIMM_VERSION=>=1.0.17 RUNAI_MODEL_STREAMER_VERSION=>=0.15.7 PIP_INDEX_URL= UV_INDEX_URL= PIP_EXTRA_INDEX_URL= UV_EXTRA_INDEX_URL= PYTORCH_CUDA_INDEX_BASE_URL=https://download.pytorch.org/whl PIP_KEYRING_PROVIDER=disabled UV_KEYRING_PROVIDER=disabled PYTORCH_NIGHTLY= /bin/sh -c if [ "${PYTORCH_NIGHTLY}" = "1" ]; then         echo "Installing torch nightly..."         && uv pip install --system $(cat torch_lib_versions.txt | xargs) --pre         --index-url ${PYTORCH_CUDA_INDEX_BASE_URL}/nightly/cu$(echo $CUDA_VERSION | cut -d. -f1,2 | tr -d '.')         && echo "Installing vLLM..."         && uv pip install --system dist/*.whl --verbose         --extra-index-url ${PYTORCH_CUDA_INDEX_BASE_URL}/nightly/cu$(echo $CUDA_VERSION | cut -d. -f1,2 | tr -d '.');     else         echo "Installing vLLM..."         && uv pip install --system dist/*.whl --verbose         --extra-index-url ${PYTORCH_CUDA_INDEX_BASE_URL}/cu$(echo $CUDA_VERSION | cut -d. -f1,2 | tr -d '.');     fi # buildkit
                        
# 2026-06-03 23:57:49  71.00B 复制新文件或目录到容器中
COPY /workspace/torch_lib_versions.txt torch_lib_versions.txt # buildkit
                        
# 2026-06-03 23:57:49  0.00B 定义构建参数
ARG PYTORCH_NIGHTLY
                        
# 2026-06-03 23:57:49  0.00B 定义构建参数
ARG PIP_KEYRING_PROVIDER UV_KEYRING_PROVIDER
                        
# 2026-06-03 23:57:49  0.00B 定义构建参数
ARG PYTORCH_CUDA_INDEX_BASE_URL
                        
# 2026-06-03 23:57:49  0.00B 定义构建参数
ARG PIP_EXTRA_INDEX_URL UV_EXTRA_INDEX_URL
                        
# 2026-06-03 23:57:49  0.00B 定义构建参数
ARG PIP_INDEX_URL UV_INDEX_URL
                        
# 2026-06-03 23:57:49  576.50MB 执行命令并创建新的镜像层
RUN |14 CUDA_VERSION=13.0.1 PYTHON_VERSION=3.12 DEADSNAKES_MIRROR_URL= DEADSNAKES_GPGKEY_URL= GET_PIP_URL=https://bootstrap.pypa.io/get-pip.py PYTORCH_CUDA_INDEX_BASE_URL=https://download.pytorch.org/whl FLASHINFER_VERSION=0.6.12 GDRCOPY_CUDA_VERSION=12.8 GDRCOPY_OS_VERSION=Ubuntu22_04 TARGETPLATFORM=linux/arm64 BITSANDBYTES_VERSION_X86=0.46.1 BITSANDBYTES_VERSION_ARM64=0.42.0 TIMM_VERSION=>=1.0.17 RUNAI_MODEL_STREAMER_VERSION=>=0.15.7 /bin/sh -c if [ "$TARGETPLATFORM" = "linux/arm64" ]; then         BITSANDBYTES_VERSION="${BITSANDBYTES_VERSION_ARM64}";     else         BITSANDBYTES_VERSION="${BITSANDBYTES_VERSION_X86}";     fi;     uv pip install --system accelerate modelscope         "bitsandbytes>=${BITSANDBYTES_VERSION}" "timm${TIMM_VERSION}" "runai-model-streamer[s3,gcs,azure]${RUNAI_MODEL_STREAMER_VERSION}" # buildkit
                        
# 2026-06-03 23:57:36  0.00B 定义构建参数
ARG RUNAI_MODEL_STREAMER_VERSION=>=0.15.7
                        
# 2026-06-03 23:57:36  0.00B 定义构建参数
ARG TIMM_VERSION=>=1.0.17
                        
# 2026-06-03 23:57:36  0.00B 定义构建参数
ARG BITSANDBYTES_VERSION_ARM64=0.42.0
                        
# 2026-06-03 23:57:36  0.00B 定义构建参数
ARG BITSANDBYTES_VERSION_X86=0.46.1
                        
# 2026-06-03 23:57:36  2.41MB 执行命令并创建新的镜像层
RUN |10 CUDA_VERSION=13.0.1 PYTHON_VERSION=3.12 DEADSNAKES_MIRROR_URL= DEADSNAKES_GPGKEY_URL= GET_PIP_URL=https://bootstrap.pypa.io/get-pip.py PYTORCH_CUDA_INDEX_BASE_URL=https://download.pytorch.org/whl FLASHINFER_VERSION=0.6.12 GDRCOPY_CUDA_VERSION=12.8 GDRCOPY_OS_VERSION=Ubuntu22_04 TARGETPLATFORM=linux/arm64 /bin/sh -c set -eux;     case "${TARGETPLATFORM}" in       linux/arm64) UUARCH="aarch64" ;;       linux/amd64) UUARCH="x64" ;;       *) echo "Unsupported TARGETPLATFORM: ${TARGETPLATFORM}" >&2; exit 1 ;;     esac;     /tmp/install_gdrcopy.sh "${GDRCOPY_OS_VERSION}" "${GDRCOPY_CUDA_VERSION}" "${UUARCH}" &&     rm /tmp/install_gdrcopy.sh # buildkit
                        
# 2026-06-03 23:57:30  1.44KB 复制新文件或目录到容器中
COPY tools/install_gdrcopy.sh /tmp/install_gdrcopy.sh # buildkit
                        
# 2026-06-03 23:57:26  0.00B 定义构建参数
ARG TARGETPLATFORM
                        
# 2026-06-03 23:57:26  0.00B 定义构建参数
ARG GDRCOPY_OS_VERSION=Ubuntu22_04
                        
# 2026-06-03 23:57:26  0.00B 定义构建参数
ARG GDRCOPY_CUDA_VERSION=12.8
                        
# 2026-06-03 23:57:26  2.27GB 执行命令并创建新的镜像层
RUN |7 CUDA_VERSION=13.0.1 PYTHON_VERSION=3.12 DEADSNAKES_MIRROR_URL= DEADSNAKES_GPGKEY_URL= GET_PIP_URL=https://bootstrap.pypa.io/get-pip.py PYTORCH_CUDA_INDEX_BASE_URL=https://download.pytorch.org/whl FLASHINFER_VERSION=0.6.12 /bin/sh -c uv pip install --system flashinfer-jit-cache==${FLASHINFER_VERSION}         --index-url https://flashinfer.ai/whl/cu$(echo $CUDA_VERSION | cut -d. -f1,2 | tr -d '.') # buildkit
                        
# 2026-06-03 23:56:09  0.00B 定义构建参数
ARG FLASHINFER_VERSION=0.6.12
                        
# 2026-06-03 23:56:09  8.34GB 执行命令并创建新的镜像层
RUN |6 CUDA_VERSION=13.0.1 PYTHON_VERSION=3.12 DEADSNAKES_MIRROR_URL= DEADSNAKES_GPGKEY_URL= GET_PIP_URL=https://bootstrap.pypa.io/get-pip.py PYTORCH_CUDA_INDEX_BASE_URL=https://download.pytorch.org/whl /bin/sh -c if [ "$(echo $CUDA_VERSION | cut -d. -f1)" = "12" ]; then         sed -i 's/^nvidia-cutlass-dsl\[cu13\]/nvidia-cutlass-dsl/' /tmp/requirements-cuda.txt;         sed -i 's/^humming-kernels\[cu13\]/humming-kernels[cu12]/' /tmp/requirements-cuda.txt;     fi &&     uv pip install --system -r /tmp/requirements-cuda.txt         --extra-index-url ${PYTORCH_CUDA_INDEX_BASE_URL}/cu$(echo $CUDA_VERSION | cut -d. -f1,2 | tr -d '.') &&     rm /tmp/requirements-cuda.txt /tmp/common.txt # buildkit
                        
# 2026-06-03 23:54:53  959.00B 复制新文件或目录到容器中
COPY requirements/cuda.txt /tmp/requirements-cuda.txt # buildkit
                        
# 2026-06-03 23:54:53  3.07KB 复制新文件或目录到容器中
COPY requirements/common.txt /tmp/common.txt # buildkit
                        
# 2026-06-03 23:54:53  0.00B 定义构建参数
ARG PYTORCH_CUDA_INDEX_BASE_URL
                        
# 2026-06-03 23:54:53  0.00B 设置环境变量 VLLM_ENABLE_CUDA_COMPATIBILITY
ENV VLLM_ENABLE_CUDA_COMPATIBILITY=0
                        
# 2026-06-03 23:54:53  3.69KB 执行命令并创建新的镜像层
RUN |5 CUDA_VERSION=13.0.1 PYTHON_VERSION=3.12 DEADSNAKES_MIRROR_URL= DEADSNAKES_GPGKEY_URL= GET_PIP_URL=https://bootstrap.pypa.io/get-pip.py /bin/sh -c chmod 0755 /usr/local/bin/vllm-nonroot-entrypoint.sh # buildkit
                        
# 2026-06-03 23:54:52  3.69KB 复制新文件或目录到容器中
COPY docker/entrypoints/vllm-nonroot-entrypoint.sh /usr/local/bin/vllm-nonroot-entrypoint.sh # buildkit
                        
# 2026-06-03 23:54:52  665.72KB 执行命令并创建新的镜像层
RUN |5 CUDA_VERSION=13.0.1 PYTHON_VERSION=3.12 DEADSNAKES_MIRROR_URL= DEADSNAKES_GPGKEY_URL= GET_PIP_URL=https://bootstrap.pypa.io/get-pip.py /bin/sh -c useradd --uid 2000 --gid 0 --create-home --home-dir /home/vllm         --shell /bin/bash vllm     && mkdir -p /home/vllm/.cache /home/vllm/.config     && chown -R 2000:0 /home/vllm     && chmod -R g+rwX /home/vllm     && chgrp 0 /etc/passwd /etc/group     && chmod g=u /etc/passwd /etc/group # buildkit
                        
# 2026-06-03 23:54:39  0.00B 执行命令并创建新的镜像层
RUN |5 CUDA_VERSION=13.0.1 PYTHON_VERSION=3.12 DEADSNAKES_MIRROR_URL= DEADSNAKES_GPGKEY_URL= GET_PIP_URL=https://bootstrap.pypa.io/get-pip.py /bin/sh -c mkdir -p "${UV_PYTHON_INSTALL_DIR}" "${UV_CACHE_DIR}"     && chgrp -R 0 /opt/uv     && chmod -R g+rwX,a+rX /opt/uv # buildkit
                        
# 2026-06-03 23:54:38  0.00B 设置环境变量 UV_CACHE_DIR
ENV UV_CACHE_DIR=/opt/uv/cache
                        
# 2026-06-03 23:54:38  0.00B 设置环境变量 UV_PYTHON_INSTALL_DIR
ENV UV_PYTHON_INSTALL_DIR=/opt/uv/python
                        
# 2026-06-03 23:54:38  0.00B 设置环境变量 UV_LINK_MODE
ENV UV_LINK_MODE=copy
                        
# 2026-06-03 23:54:38  0.00B 设置环境变量 UV_INDEX_STRATEGY
ENV UV_INDEX_STRATEGY=unsafe-best-match
                        
# 2026-06-03 23:54:38  0.00B 设置环境变量 UV_HTTP_TIMEOUT
ENV UV_HTTP_TIMEOUT=500
                        
# 2026-06-03 23:54:38  76.75MB 执行命令并创建新的镜像层
RUN |5 CUDA_VERSION=13.0.1 PYTHON_VERSION=3.12 DEADSNAKES_MIRROR_URL= DEADSNAKES_GPGKEY_URL= GET_PIP_URL=https://bootstrap.pypa.io/get-pip.py /bin/sh -c python3 -m pip install uv # buildkit
                        
# 2026-06-03 23:54:35  3.04GB 执行命令并创建新的镜像层
RUN |5 CUDA_VERSION=13.0.1 PYTHON_VERSION=3.12 DEADSNAKES_MIRROR_URL= DEADSNAKES_GPGKEY_URL= GET_PIP_URL=https://bootstrap.pypa.io/get-pip.py /bin/sh -c CUDA_VERSION_DASH=$(echo $CUDA_VERSION | cut -d. -f1,2 | tr '.' '-') &&     CUDA_VERSION_SHORT=$(echo $CUDA_VERSION | cut -d. -f1,2) &&     apt-get update -y &&     apt-get install -y --no-install-recommends --allow-change-held-packages         cuda-nvcc-${CUDA_VERSION_DASH}         cuda-cudart-${CUDA_VERSION_DASH}         cuda-nvrtc-${CUDA_VERSION_DASH}         cuda-cuobjdump-${CUDA_VERSION_DASH}         libcurand-dev-${CUDA_VERSION_DASH}         libcublas-dev-${CUDA_VERSION_DASH}         libnuma-dev         numactl &&     NCCL_VER=$(apt-cache madison libnccl-dev | grep "+cuda${CUDA_VERSION_SHORT}" | head -1 | awk -F'|' '{gsub(/^ +| +$/, "", $2); print $2}') &&     apt-get install -y --no-install-recommends --allow-change-held-packages libnccl-dev=${NCCL_VER} libnccl2=${NCCL_VER} &&     rm -rf /var/lib/apt/lists/* # buildkit
                        
# 2026-06-03 23:51:10  568.14MB 执行命令并创建新的镜像层
RUN |5 CUDA_VERSION=13.0.1 PYTHON_VERSION=3.12 DEADSNAKES_MIRROR_URL= DEADSNAKES_GPGKEY_URL= GET_PIP_URL=https://bootstrap.pypa.io/get-pip.py /bin/sh -c apt-get update -y     && apt-get install -y --no-install-recommends         software-properties-common         curl         sudo         ffmpeg         libsm6         libxext6         libgl1     && if [ ! -z ${DEADSNAKES_MIRROR_URL} ] ; then         if [ ! -z "${DEADSNAKES_GPGKEY_URL}" ] ; then             mkdir -p -m 0755 /etc/apt/keyrings ;             curl -L ${DEADSNAKES_GPGKEY_URL} | gpg --dearmor > /etc/apt/keyrings/deadsnakes.gpg ;             sudo chmod 644 /etc/apt/keyrings/deadsnakes.gpg ;             echo "deb [signed-by=/etc/apt/keyrings/deadsnakes.gpg] ${DEADSNAKES_MIRROR_URL} $(lsb_release -cs) main" > /etc/apt/sources.list.d/deadsnakes.list ;         fi ;     else         for i in 1 2 3; do             add-apt-repository -y ppa:deadsnakes/ppa && break ||             { echo "Attempt $i failed, retrying in 5s..."; sleep 5; };         done ;     fi     && apt-get update -y     && apt-get install -y --no-install-recommends         python${PYTHON_VERSION}         python${PYTHON_VERSION}-dev         python${PYTHON_VERSION}-venv         libibverbs-dev     && rm -rf /var/lib/apt/lists/*     && update-alternatives --install /usr/bin/python3 python3 /usr/bin/python${PYTHON_VERSION} 1     && update-alternatives --set python3 /usr/bin/python${PYTHON_VERSION}     && ln -sf /usr/bin/python${PYTHON_VERSION}-config /usr/bin/python3-config     && rm -f /usr/lib/python${PYTHON_VERSION}/EXTERNALLY-MANAGED     && curl -sS ${GET_PIP_URL} | python${PYTHON_VERSION}     && python3 --version && python3 -m pip --version # buildkit
                        
# 2026-06-03 23:49:43  136.00B 执行命令并创建新的镜像层
RUN |5 CUDA_VERSION=13.0.1 PYTHON_VERSION=3.12 DEADSNAKES_MIRROR_URL= DEADSNAKES_GPGKEY_URL= GET_PIP_URL=https://bootstrap.pypa.io/get-pip.py /bin/sh -c PYTHON_VERSION_STR=$(echo ${PYTHON_VERSION} | sed 's/\.//g') &&     echo "export PYTHON_VERSION_STR=${PYTHON_VERSION_STR}" >> /etc/environment # buildkit
                        
# 2026-06-03 23:49:27  0.00B 设置工作目录为/vllm-workspace
WORKDIR /vllm-workspace
                        
# 2026-06-03 23:49:27  0.00B 设置环境变量 DEBIAN_FRONTEND
ENV DEBIAN_FRONTEND=noninteractive
                        
# 2026-06-03 23:49:27  0.00B 定义构建参数
ARG GET_PIP_URL
                        
# 2026-06-03 23:49:27  0.00B 定义构建参数
ARG DEADSNAKES_GPGKEY_URL
                        
# 2026-06-03 23:49:27  0.00B 定义构建参数
ARG DEADSNAKES_MIRROR_URL
                        
# 2026-06-03 23:49:27  0.00B 定义构建参数
ARG PYTHON_VERSION
                        
# 2026-06-03 23:49:27  0.00B 定义构建参数
ARG CUDA_VERSION
                        
# 2025-09-09 01:23:07  0.00B 设置环境变量 NVIDIA_DRIVER_CAPABILITIES
ENV NVIDIA_DRIVER_CAPABILITIES=compute,utility
                        
# 2025-09-09 01:23:07  0.00B 设置环境变量 NVIDIA_VISIBLE_DEVICES
ENV NVIDIA_VISIBLE_DEVICES=all
                        
# 2025-09-09 01:23:07  17.29KB 复制新文件或目录到容器中
COPY NGC-DL-CONTAINER-LICENSE / # buildkit
                        
# 2025-09-09 01:23:07  0.00B 设置环境变量 LD_LIBRARY_PATH
ENV LD_LIBRARY_PATH=/usr/local/nvidia/lib:/usr/local/nvidia/lib64:/usr/local/cuda/lib64
                        
# 2025-09-09 01:23:07  0.00B 设置环境变量 PATH
ENV PATH=/usr/local/nvidia/bin:/usr/local/cuda/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin
                        
# 2025-09-09 01:23:07  22.00B 执行命令并创建新的镜像层
RUN |1 TARGETARCH=arm64 /bin/sh -c echo "/usr/local/cuda/lib64" >> /etc/ld.so.conf.d/nvidia.conf # buildkit
                        
# 2025-09-09 01:23:07  303.50MB 执行命令并创建新的镜像层
RUN |1 TARGETARCH=arm64 /bin/sh -c apt-get update && apt-get install -y --no-install-recommends     cuda-cudart-13-0=${NV_CUDA_CUDART_VERSION}     cuda-compat-13-0     && rm -rf /var/lib/apt/lists/* # buildkit
                        
# 2025-09-09 01:23:07  0.00B 设置环境变量 CUDA_VERSION
ENV CUDA_VERSION=13.0.1
                        
# 2025-09-09 01:23:07  10.21MB 执行命令并创建新的镜像层
RUN |1 TARGETARCH=arm64 /bin/sh -c apt-get update && apt-get install -y --no-install-recommends     gnupg2 curl ca-certificates &&     curl -fsSLO https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/${NVARCH}/cuda-keyring_1.1-1_all.deb &&     dpkg -i cuda-keyring_1.1-1_all.deb &&     apt-get purge --autoremove -y curl     && rm -rf /var/lib/apt/lists/* # buildkit
                        
# 2025-09-09 01:23:07  0.00B 添加元数据标签
LABEL maintainer=NVIDIA CORPORATION <cudatools@nvidia.com>
                        
# 2025-09-09 01:23:07  0.00B 定义构建参数
ARG TARGETARCH
                        
# 2025-09-09 01:23:07  0.00B 设置环境变量 NV_CUDA_CUDART_VERSION
ENV NV_CUDA_CUDART_VERSION=13.0.88-1
                        
# 2025-09-09 01:23:07  0.00B 设置环境变量 NVIDIA_REQUIRE_CUDA brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand
ENV NVIDIA_REQUIRE_CUDA=cuda>=13.0 brand=unknown,driver>=535,driver<536 brand=grid,driver>=535,driver<536 brand=tesla,driver>=535,driver<536 brand=nvidia,driver>=535,driver<536 brand=quadro,driver>=535,driver<536 brand=quadrortx,driver>=535,driver<536 brand=nvidiartx,driver>=535,driver<536 brand=vapps,driver>=535,driver<536 brand=vpc,driver>=535,driver<536 brand=vcs,driver>=535,driver<536 brand=vws,driver>=535,driver<536 brand=cloudgaming,driver>=535,driver<536 brand=unknown,driver>=550,driver<551 brand=grid,driver>=550,driver<551 brand=tesla,driver>=550,driver<551 brand=nvidia,driver>=550,driver<551 brand=quadro,driver>=550,driver<551 brand=quadrortx,driver>=550,driver<551 brand=nvidiartx,driver>=550,driver<551 brand=vapps,driver>=550,driver<551 brand=vpc,driver>=550,driver<551 brand=vcs,driver>=550,driver<551 brand=vws,driver>=550,driver<551 brand=cloudgaming,driver>=550,driver<551 brand=unknown,driver>=565,driver<566 brand=grid,driver>=565,driver<566 brand=tesla,driver>=565,driver<566 brand=nvidia,driver>=565,driver<566 brand=quadro,driver>=565,driver<566 brand=quadrortx,driver>=565,driver<566 brand=nvidiartx,driver>=565,driver<566 brand=vapps,driver>=565,driver<566 brand=vpc,driver>=565,driver<566 brand=vcs,driver>=565,driver<566 brand=vws,driver>=565,driver<566 brand=cloudgaming,driver>=565,driver<566 brand=unknown,driver>=570,driver<571 brand=grid,driver>=570,driver<571 brand=tesla,driver>=570,driver<571 brand=nvidia,driver>=570,driver<571 brand=quadro,driver>=570,driver<571 brand=quadrortx,driver>=570,driver<571 brand=nvidiartx,driver>=570,driver<571 brand=vapps,driver>=570,driver<571 brand=vpc,driver>=570,driver<571 brand=vcs,driver>=570,driver<571 brand=vws,driver>=570,driver<571 brand=cloudgaming,driver>=570,driver<571 brand=unknown,driver>=575,driver<576 brand=grid,driver>=575,driver<576 brand=tesla,driver>=575,driver<576 brand=nvidia,driver>=575,driver<576 brand=quadro,driver>=575,driver<576 brand=quadrortx,driver>=575,driver<576 brand=nvidiartx,driver>=575,driver<576 brand=vapps,driver>=575,driver<576 brand=vpc,driver>=575,driver<576 brand=vcs,driver>=575,driver<576 brand=vws,driver>=575,driver<576 brand=cloudgaming,driver>=575,driver<576
                        
# 2025-09-09 01:23:07  0.00B 设置环境变量 NVARCH
ENV NVARCH=sbsa
                        
# 2025-08-20 01:21:19  0.00B 
/bin/sh -c #(nop)  CMD ["/bin/bash"]
                        
# 2025-08-20 01:21:19  69.25MB 
/bin/sh -c #(nop) ADD file:5f2c65daac761cc691b34ee3e3e2ba42ec520d71fc59aef131d38058a7891ab8 in / 
                        
# 2025-08-20 01:21:17  0.00B 
/bin/sh -c #(nop)  LABEL org.opencontainers.image.version=22.04
                        
# 2025-08-20 01:21:17  0.00B 
/bin/sh -c #(nop)  LABEL org.opencontainers.image.ref.name=ubuntu
                        
# 2025-08-20 01:21:17  0.00B 
/bin/sh -c #(nop)  ARG LAUNCHPAD_BUILD_ARCH
                        
# 2025-08-20 01:21:17  0.00B 
/bin/sh -c #(nop)  ARG RELEASE
                        
                    

镜像信息

{
    "Id": "sha256:40961599052435ac6a09bda4c79b77d67b0edb525a14306535dff41d2333cc29",
    "RepoTags": [
        "vllm/vllm-openai:gemma4-unified",
        "swr.cn-north-4.myhuaweicloud.com/ddn-k8s/docker.io/vllm/vllm-openai:gemma4-unified-linuxarm64"
    ],
    "RepoDigests": [
        "vllm/vllm-openai@sha256:e828735fba48bca2cf9701864d41693c91953394c5b1455b4668edd7563ed450",
        "swr.cn-north-4.myhuaweicloud.com/ddn-k8s/docker.io/vllm/vllm-openai@sha256:284ea13101b13fc7dcc9d6d1beac095fa6490220548a53e4247b86b7242d1d90"
    ],
    "Parent": "",
    "Comment": "buildkit.dockerfile.v0",
    "Created": "2026-06-03T16:46:11.136255302Z",
    "Container": "",
    "ContainerConfig": null,
    "DockerVersion": "",
    "Author": "",
    "Config": {
        "Hostname": "",
        "Domainname": "",
        "User": "",
        "AttachStdin": false,
        "AttachStdout": false,
        "AttachStderr": false,
        "Tty": false,
        "OpenStdin": false,
        "StdinOnce": false,
        "Env": [
            "PATH=/usr/local/nvidia/bin:/usr/local/cuda/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin",
            "NVARCH=sbsa",
            "NVIDIA_REQUIRE_CUDA=cuda\u003e=13.0 brand=unknown,driver\u003e=535,driver\u003c536 brand=grid,driver\u003e=535,driver\u003c536 brand=tesla,driver\u003e=535,driver\u003c536 brand=nvidia,driver\u003e=535,driver\u003c536 brand=quadro,driver\u003e=535,driver\u003c536 brand=quadrortx,driver\u003e=535,driver\u003c536 brand=nvidiartx,driver\u003e=535,driver\u003c536 brand=vapps,driver\u003e=535,driver\u003c536 brand=vpc,driver\u003e=535,driver\u003c536 brand=vcs,driver\u003e=535,driver\u003c536 brand=vws,driver\u003e=535,driver\u003c536 brand=cloudgaming,driver\u003e=535,driver\u003c536 brand=unknown,driver\u003e=550,driver\u003c551 brand=grid,driver\u003e=550,driver\u003c551 brand=tesla,driver\u003e=550,driver\u003c551 brand=nvidia,driver\u003e=550,driver\u003c551 brand=quadro,driver\u003e=550,driver\u003c551 brand=quadrortx,driver\u003e=550,driver\u003c551 brand=nvidiartx,driver\u003e=550,driver\u003c551 brand=vapps,driver\u003e=550,driver\u003c551 brand=vpc,driver\u003e=550,driver\u003c551 brand=vcs,driver\u003e=550,driver\u003c551 brand=vws,driver\u003e=550,driver\u003c551 brand=cloudgaming,driver\u003e=550,driver\u003c551 brand=unknown,driver\u003e=565,driver\u003c566 brand=grid,driver\u003e=565,driver\u003c566 brand=tesla,driver\u003e=565,driver\u003c566 brand=nvidia,driver\u003e=565,driver\u003c566 brand=quadro,driver\u003e=565,driver\u003c566 brand=quadrortx,driver\u003e=565,driver\u003c566 brand=nvidiartx,driver\u003e=565,driver\u003c566 brand=vapps,driver\u003e=565,driver\u003c566 brand=vpc,driver\u003e=565,driver\u003c566 brand=vcs,driver\u003e=565,driver\u003c566 brand=vws,driver\u003e=565,driver\u003c566 brand=cloudgaming,driver\u003e=565,driver\u003c566 brand=unknown,driver\u003e=570,driver\u003c571 brand=grid,driver\u003e=570,driver\u003c571 brand=tesla,driver\u003e=570,driver\u003c571 brand=nvidia,driver\u003e=570,driver\u003c571 brand=quadro,driver\u003e=570,driver\u003c571 brand=quadrortx,driver\u003e=570,driver\u003c571 brand=nvidiartx,driver\u003e=570,driver\u003c571 brand=vapps,driver\u003e=570,driver\u003c571 brand=vpc,driver\u003e=570,driver\u003c571 brand=vcs,driver\u003e=570,driver\u003c571 brand=vws,driver\u003e=570,driver\u003c571 brand=cloudgaming,driver\u003e=570,driver\u003c571 brand=unknown,driver\u003e=575,driver\u003c576 brand=grid,driver\u003e=575,driver\u003c576 brand=tesla,driver\u003e=575,driver\u003c576 brand=nvidia,driver\u003e=575,driver\u003c576 brand=quadro,driver\u003e=575,driver\u003c576 brand=quadrortx,driver\u003e=575,driver\u003c576 brand=nvidiartx,driver\u003e=575,driver\u003c576 brand=vapps,driver\u003e=575,driver\u003c576 brand=vpc,driver\u003e=575,driver\u003c576 brand=vcs,driver\u003e=575,driver\u003c576 brand=vws,driver\u003e=575,driver\u003c576 brand=cloudgaming,driver\u003e=575,driver\u003c576",
            "NV_CUDA_CUDART_VERSION=13.0.88-1",
            "CUDA_VERSION=13.0.1",
            "LD_LIBRARY_PATH=/usr/local/nvidia/lib64:/usr/local/cuda/lib64:/usr/local/nvidia/lib:/usr/local/nvidia/lib64:/usr/local/cuda/lib64",
            "NVIDIA_VISIBLE_DEVICES=all",
            "NVIDIA_DRIVER_CAPABILITIES=compute,utility",
            "DEBIAN_FRONTEND=noninteractive",
            "UV_HTTP_TIMEOUT=500",
            "UV_INDEX_STRATEGY=unsafe-best-match",
            "UV_LINK_MODE=copy",
            "UV_PYTHON_INSTALL_DIR=/opt/uv/python",
            "UV_CACHE_DIR=/opt/uv/cache",
            "VLLM_ENABLE_CUDA_COMPATIBILITY=0",
            "TORCH_CUDA_ARCH_LIST=8.7 8.9 9.0 10.0+PTX 12.0 12.1",
            "VLLM_USAGE_SOURCE=production-docker-image",
            "VLLM_BUILD_COMMIT=unknown",
            "VLLM_BUILD_PIPELINE=local",
            "VLLM_BUILD_URL=",
            "VLLM_IMAGE_TAG=local/vllm-openai:dev"
        ],
        "Cmd": null,
        "Image": "",
        "Volumes": null,
        "WorkingDir": "/vllm-workspace",
        "Entrypoint": [
            "vllm",
            "serve"
        ],
        "OnBuild": null,
        "Labels": {
            "ai.vllm.build.commit": "unknown",
            "ai.vllm.build.pipeline": "local",
            "ai.vllm.build.url": "",
            "ai.vllm.image.tag": "local/vllm-openai:dev",
            "maintainer": "NVIDIA CORPORATION \u003ccudatools@nvidia.com\u003e",
            "org.opencontainers.image.ref.name": "ubuntu",
            "org.opencontainers.image.revision": "unknown",
            "org.opencontainers.image.source": "https://github.com/vllm-project/vllm",
            "org.opencontainers.image.url": "",
            "org.opencontainers.image.version": "local/vllm-openai:dev"
        }
    },
    "Architecture": "arm64",
    "Os": "linux",
    "Size": 19459122137,
    "GraphDriver": {
        "Data": {
            "LowerDir": "/var/lib/docker/overlay2/a94146f87179cdc672bf4755621a1f0d0519117c2129e0ff52059efaa0b104cc/diff:/var/lib/docker/overlay2/082ffdf54cf690d196406988b9708ead0da4761a19b96c2e06bf8fcd71c2b7b7/diff:/var/lib/docker/overlay2/1741cfe764a3082bc24f85f86c795fa121ff6c4815d03c37e77614999502e00e/diff:/var/lib/docker/overlay2/66f03a4c4a37ec666006fa93a13e004dd59579bbd57c3ba5b0741c7fb09bdd58/diff:/var/lib/docker/overlay2/39595130b6299841b16052f8a3a7035e25f073a9c80176f7cf7c084c4618bdd9/diff:/var/lib/docker/overlay2/de6fa321c82ba6719a42bdf1bc3f5265d269f91b4173025404b659d3a06dba64/diff:/var/lib/docker/overlay2/ad48bfac7b2cd530f67f4d9364cada9f0706b727e544954581ad90b41ccf5e32/diff:/var/lib/docker/overlay2/ceaf2ec7480b4a6955fed911f2fb3f2289fa0b0f58f0b804fc17f99ccad7bae1/diff:/var/lib/docker/overlay2/9f3422c9e7ad100f93391ad7ce02f8e1c1758345e10ed9a121dd94096ad1afac/diff:/var/lib/docker/overlay2/751c388d15f4ece8fe4640f6ce8a6f93ac2fd67569208438a9806e6de01af62b/diff:/var/lib/docker/overlay2/631435071b6a510983a2885fe9984656669533073298dc8df6f1de3ecc2b36ce/diff:/var/lib/docker/overlay2/5806ec0d61fa3f117ad6ed843a63ea128d08f385e1025d5bdae0c8749b460a9b/diff:/var/lib/docker/overlay2/0d39fa89da53c4c519da73141e305cd4dcf75ec792c5c1ae858aaddd70415817/diff:/var/lib/docker/overlay2/33a0098f67e6cd6f47b03e16e3429c47d8d4fd513757eb993db7ebb3991910d8/diff:/var/lib/docker/overlay2/e8b3438d73cf6919c02aa20268e21ea189293a623e428de09b5a897a01fea27a/diff:/var/lib/docker/overlay2/a118f61dba6a0d1c3d0f2c79b08ed30fa6ef5a1fa1c21ed614f6ab10ad5407ec/diff:/var/lib/docker/overlay2/eb3a07f8132e78457c2ec4cb2bda30806a92c63d50da144ed9c5fce26ec257cf/diff:/var/lib/docker/overlay2/8962310e6d315b1a29de7319d88166f094c656dabf52d2862e9c5443cf95f9df/diff:/var/lib/docker/overlay2/2dca53c31fcef464f3c6125715af252ff525410da45ce77d50c06732a35dc3e9/diff:/var/lib/docker/overlay2/952a358dd11ec2c76c2c432968d16ba4cab3f4b05e155ab0a93dba45198c5dd7/diff:/var/lib/docker/overlay2/b0c43f3213841835dfee792c9d3318b7bf922347ddf5ed35d13eba994456f58c/diff:/var/lib/docker/overlay2/7289847df4129d23ce70a4ac5d4a1f2e8515ea368358afcbb06ff3ab10820868/diff:/var/lib/docker/overlay2/f5ab7fb0d2eb49b9aa30ce2304e00de76e407e22c560e7524cb22557f778805a/diff:/var/lib/docker/overlay2/e4b69e0aa2e3f55da3b524a3bf1730d9f04f45dd09a02da9e49f4156f28333fd/diff:/var/lib/docker/overlay2/62d1f014655a9e25298fb4ae84797f1ea47ca4af9dcfd59fedfb517673476c73/diff:/var/lib/docker/overlay2/5490cc3132c51ed4f07558e24421ed3851fee5b68230454add5cdf6b6ddee856/diff:/var/lib/docker/overlay2/49ebb4eec6be40157950c5237054a5e5c3eb2997b448813fbacfa324ccd6ad78/diff:/var/lib/docker/overlay2/6272461b4219b6d580d023faabab4be9148476218c86c33e094b7bc86101722d/diff:/var/lib/docker/overlay2/0cb5ceda69595da1d08b2e5dbc9188244a1c7f96578b159cd05dd9d97739caa6/diff:/var/lib/docker/overlay2/fd857c938e50d6ac0bb664d86fa563b44b49494490c152c1959a583e9008450d/diff",
            "MergedDir": "/var/lib/docker/overlay2/ef592503f1a4d0566163616a90cbaa027391f04f2f2dceffe73994eb1a9577f9/merged",
            "UpperDir": "/var/lib/docker/overlay2/ef592503f1a4d0566163616a90cbaa027391f04f2f2dceffe73994eb1a9577f9/diff",
            "WorkDir": "/var/lib/docker/overlay2/ef592503f1a4d0566163616a90cbaa027391f04f2f2dceffe73994eb1a9577f9/work"
        },
        "Name": "overlay2"
    },
    "RootFS": {
        "Type": "layers",
        "Layers": [
            "sha256:3e04ae02e3b48a52cd15f8595efd67e8ab7b3e9766f7d1e12a25c8c76f3a86bf",
            "sha256:71adbbf158ee0f6f368160ea2ed102435749a2720b8eca69dfdb2d1f47a55680",
            "sha256:31dbd0fa0bdc5599759cd67617f924de243a225e1d95776b9e6a5c404f399e43",
            "sha256:4762a460f72d9950e1b8a8b6775e3e631bc418f9f915501879f29f7fec97fcd8",
            "sha256:495bbaf4cc5ca53d8286fe305d2fc2fd3ecb2c8791cfdd4473e984d2eb0f3ea1",
            "sha256:6b4fe05f62b5a83a1aa9fab9cd148cde79f3b3ecccdb69e2ec09c9d3788d5b3f",
            "sha256:21a437ff96dcbb22dea57256ba567dd568862a97315c3a2f39e309187f868c8c",
            "sha256:cd26f579a93e9ca1bef79d953e83f2a24d2aa28a136eb5ccf2d65f5352785b82",
            "sha256:bae04ca4cdd73b8c7d03a129cfdbecc5c6d77eb95fd9d667cc64aff4242a7865",
            "sha256:6cc43210f6aaf516742963fbad20837f3412f02fb8b72a4c06962cdabdb216fa",
            "sha256:8bf9afbc2293408a6f159ce19bd6206178ef92d4db9e4400517cb1ed8ad5fa16",
            "sha256:2223cd9353c1473559b0f6f1f5fdb26a5768b3115373a07afe724ae5d86fd1a2",
            "sha256:d579c02c06932f2c5be396d67080b15cf888a491267e384482ed72c89e18d381",
            "sha256:d579c02c06932f2c5be396d67080b15cf888a491267e384482ed72c89e18d381",
            "sha256:d184d33be1dd7d1004c703faa3e5156d9d1d7b0e7c9a9698fc3b7b2cf7bbfb4d",
            "sha256:9ed92493ba684b577e626eae6e5b4aef8da6c2e94d6c51374989465d61953306",
            "sha256:bde2e4b8ca764981420d576f097f50fecaf89006ac51be11c922bb7729b78a3d",
            "sha256:234fcbda5a3c5fe7c12b899ef691759839ed5c845ca8884fa57a3397f3365e70",
            "sha256:28b0a14d08d310abdb8c1cd7721bff10a72fc9a5b0f2694262d664518909fda3",
            "sha256:73e80f17d72ecef5ba717f2f144d008133c2246b9998464b6194b32277d7c06e",
            "sha256:399be1b0187b4ee88a9fb5c2a679f4a78ff2f651dc5578ca3d4ae7a58443ce9f",
            "sha256:3f16ed90e75511a68c0b981e1d4a755d365fb29997357c6288105fefd578ef90",
            "sha256:8941d622ec3d081c3213b455e7c213f2cd303e24c863606a919d6b0054d6e781",
            "sha256:5f70bf18a086007016e948b04aed3b82103a36bea41755b6cddfaf10ace3c6ef",
            "sha256:957161255b0dd3ac5aa3cef604a70a0c6dc46fe3d9e2a0b456ea183188baa5ff",
            "sha256:1cb2da68fb6ccad0f61b6f897deea0282b89d06f5c2ae6e30c34786a8a0c9f65",
            "sha256:95bdcbd27579e8e4464e78a9f443ca3357d3b03a9a557ef8ae65f9cf3cd55377",
            "sha256:93dcd26e7b7b0c0cccc6ab1365ffb280a8ace13dc46bf51f334dd35437d52bed",
            "sha256:ea7cbec394accf10113cbb5d38be2eebbebf5716c5a1057f535e43da00724770",
            "sha256:3fd3c622b106831ca31304867a2b2b8bb2b14cfc3af1da440901e5e91efaf0e9",
            "sha256:5f70bf18a086007016e948b04aed3b82103a36bea41755b6cddfaf10ace3c6ef"
        ]
    },
    "Metadata": {
        "LastTagTime": "2026-06-11T03:48:02.947664797+08:00"
    }
}

更多版本

docker.io/vllm/vllm-openai:v0.5.4

linux/amd64 docker.io9.90GB2024-09-07 06:20
3482

docker.io/vllm/vllm-openai:v0.6.0

linux/amd64 docker.io9.72GB2024-09-11 01:51
1887

docker.io/vllm/vllm-openai:v0.6.1.post2

linux/amd64 docker.io9.81GB2024-09-24 01:43
1430

docker.io/vllm/vllm-openai:latest

linux/amd64 docker.io10.24GB2024-10-11 00:43
9679

docker.io/vllm/vllm-openai:v0.6.4.post1

linux/amd64 docker.io10.64GB2024-11-19 00:42
1549

docker.io/vllm/vllm-openai:v0.6.4

linux/amd64 docker.io10.64GB2024-12-11 02:08
1215

docker.io/vllm/vllm-openai:v0.6.3

linux/amd64 docker.io10.43GB2024-12-12 02:41
1538

docker.io/vllm/vllm-openai:v0.6.6

linux/amd64 docker.io10.23GB2025-01-04 00:37
1982

docker.io/vllm/vllm-openai:v0.6.6.post1

linux/amd64 docker.io10.23GB2025-01-24 00:21
1639

docker.io/vllm/vllm-openai:v0.7.1

linux/amd64 docker.io16.53GB2025-02-08 02:05
1456

docker.io/vllm/vllm-openai:v0.7.2

linux/amd64 docker.io16.53GB2025-02-09 00:28
3421

docker.io/vllm/vllm-openai:v0.7.3

linux/amd64 docker.io16.43GB2025-02-24 00:50
4199

docker.io/vllm/vllm-openai:v0.8.0

linux/amd64 docker.io16.62GB2025-03-20 00:23
1712

docker.io/vllm/vllm-openai:v0.8.1

linux/amd64 docker.io16.62GB2025-03-21 00:28
1407

docker.io/vllm/vllm-openai:v0.8.2

linux/amd64 docker.io16.92GB2025-03-27 01:12
1632

docker.io/vllm/vllm-openai:v0.8.3

linux/amd64 docker.io17.13GB2025-04-08 00:58
1683

docker.io/vllm/vllm-openai:v0.8.4

linux/amd64 docker.io17.16GB2025-04-17 01:16
2204

docker.io/vllm/vllm-openai:v0.8.5

linux/amd64 docker.io17.30GB2025-04-30 02:45
3829

docker.io/vllm/vllm-openai:v0.8.5.post1

linux/amd64 docker.io17.30GB2025-05-07 02:06
4011

docker.io/vllm/vllm-openai:v0.9.0.1

linux/amd64 docker.io20.81GB2025-06-05 01:12
2729

docker.io/vllm/vllm-openai:v0.9.1

linux/amd64 docker.io20.85GB2025-06-12 01:29
3337

docker.io/vllm/vllm-openai:v0.9.2

linux/amd64 docker.io20.76GB2025-07-09 03:00
8168

docker.io/vllm/vllm-openai:v0.10.0

linux/amd64 docker.io26.13GB2025-07-26 03:15
2270

docker.io/vllm/vllm-openai:gptoss

linux/amd64 docker.io33.86GB2025-08-07 01:52
1597

docker.io/vllm/vllm-openai:v0.10.1

linux/amd64 docker.io20.25GB2025-08-20 03:05
1551

docker.io/vllm/vllm-openai:v0.10.1.1

linux/amd64 docker.io20.26GB2025-08-23 01:43
2644

docker.io/vllm/vllm-openai:v0.10.2

linux/amd64 docker.io22.49GB2025-09-16 03:40
2007

docker.io/vllm/vllm-openai:v0.2.7

linux/amd64 docker.io6.34GB2025-10-01 01:07
585

docker.io/vllm/vllm-openai:v0.11.0-x86_64

linux/amd64 docker.io25.86GB2025-10-09 02:14
2708

docker.io/vllm/vllm-openai:v0.10.2-x86_64

linux/amd64 docker.io22.49GB2025-10-09 02:22
713

docker.io/vllm/vllm-openai:v0.11.0

linux/amd64 docker.io25.86GB2025-10-09 11:24
2799

docker.io/vllm/vllm-openai:v0.11.0

linux/arm64 docker.io24.17GB2025-10-30 00:47
1195

docker.io/vllm/vllm-openai:v0.3.3

linux/amd64 docker.io9.13GB2025-11-18 01:01
545

docker.io/vllm/vllm-openai:v0.11.1

linux/amd64 docker.io28.72GB2025-11-21 01:03
937

docker.io/vllm/vllm-openai:v0.11.2

linux/amd64 docker.io28.82GB2025-11-22 00:46
2116

docker.io/vllm/vllm-openai:v0.11.1

linux/arm64 docker.io26.54GB2025-11-22 01:23
556

docker.io/vllm/vllm-openai:v0.4.0

linux/amd64 docker.io9.88GB2025-11-22 01:58
551

docker.io/vllm/vllm-openai:v0.11.2

linux/arm64 docker.io26.54GB2025-11-22 04:06
744

docker.io/vllm/vllm-openai:nightly

linux/amd64 docker.io18.74GB2025-12-03 02:43
2677

docker.io/vllm/vllm-openai:v0.12.0-aarch64

linux/arm64 docker.io17.89GB2025-12-05 03:12
1051

docker.io/vllm/vllm-openai:v0.12.0

linux/amd64 docker.io19.47GB2025-12-05 03:59
3260

docker.io/vllm/vllm-openai:v0.13.0

linux/amd64 docker.io19.51GB2026-01-22 01:41
1769

docker.io/vllm/vllm-openai:v0.14.0

linux/amd64 docker.io19.66GB2026-01-22 03:16
793

docker.io/vllm/vllm-openai:v0.14.1

linux/amd64 docker.io19.69GB2026-01-27 01:52
952

docker.io/vllm/vllm-openai:v0.15.0

linux/amd64 docker.io20.13GB2026-01-31 00:51
1239
662

docker.io/vllm/vllm-openai:v0.15.1

linux/amd64 docker.io20.14GB2026-02-06 01:14
1065

docker.io/vllm/vllm-openai:v0.15.1-cu130

linux/amd64 docker.io18.77GB2026-02-07 00:39
903

docker.io/vllm/vllm-openai:latest

linux/arm64 docker.io20.65GB2026-02-08 00:59
670

docker.io/vllm/vllm-openai:v0.15.1-aarch64-cu130

linux/arm64 docker.io19.60GB2026-02-10 00:44
729

docker.io/vllm/vllm-openai:glm5

linux/amd64 docker.io20.27GB2026-02-14 00:56
697

docker.io/vllm/vllm-openai:qwen3_5

linux/amd64 docker.io20.93GB2026-02-25 01:18
1681

docker.io/vllm/vllm-openai:qwen3_5-x86_64-cu129

linux/amd64 docker.io20.93GB2026-02-27 00:33
710
487

docker.io/vllm/vllm-openai:v0.16.0

linux/amd64 docker.io20.37GB2026-02-28 01:04
899

docker.io/vllm/vllm-openai:v0.16.0-cu130

linux/amd64 docker.io19.01GB2026-02-28 02:55
611
465

docker.io/vllm/vllm-openai:v0.5.1

linux/amd64 docker.io10.40GB2026-03-06 01:01
293

docker.io/vllm/vllm-openai:v0.4.0.post1

linux/amd64 docker.io9.88GB2026-03-06 01:29
355

docker.io/vllm/vllm-openai:cu130-nightly

linux/amd64 docker.io19.55GB2026-03-06 02:04
924
427

docker.io/vllm/vllm-openai:v0.17.0

linux/amd64 docker.io20.75GB2026-03-08 02:27
1906

docker.io/vllm/vllm-openai:v0.17.0-cu130

linux/amd64 docker.io19.55GB2026-03-10 01:39
479

docker.io/vllm/vllm-openai:v0.17.0

linux/arm64 docker.io21.50GB2026-03-11 01:42
520

docker.io/vllm/vllm-openai:v0.4.3

linux/amd64 docker.io7.86GB2026-03-11 02:04
323

docker.io/vllm/vllm-openai:v0.13.0

linux/arm64 docker.io17.98GB2026-03-11 03:52
238

docker.io/vllm/vllm-openai:v0.17.1-cu130

linux/amd64 docker.io19.55GB2026-03-13 02:22
916

docker.io/vllm/vllm-openai:v0.17.1

linux/amd64 docker.io20.75GB2026-03-14 02:34
905

docker.io/vllm/vllm-openai:v0.14.0

linux/arm64 docker.io20.19GB2026-03-17 02:20
288

docker.io/vllm/vllm-openai:v0.18.0

linux/amd64 docker.io22.40GB2026-03-23 01:43
987

docker.io/vllm/vllm-openai:v0.18.0-cu130

linux/amd64 docker.io19.65GB2026-03-24 01:29
770

docker.io/vllm/vllm-openai-cpu:latest-x86_64

linux/amd64 docker.io3.48GB2026-03-28 10:31
334

docker.io/vllm/vllm-openai-cpu:latest-arm64

linux/arm64 docker.io2.23GB2026-03-28 10:34
235

docker.io/vllm/vllm-openai-rocm:latest

linux/amd64 docker.io24.49GB2026-03-30 00:46
397

docker.io/vllm/vllm-openai:latest-cu130

linux/amd64 docker.io19.65GB2026-03-31 00:47
999

docker.io/vllm/vllm-openai:v0.18.1-cu130

linux/amd64 docker.io19.66GB2026-04-02 00:39
314

docker.io/vllm/vllm-openai:v0.18.1

linux/amd64 docker.io22.41GB2026-04-02 01:12
490

docker.io/vllm/vllm-openai:v0.19.0-ubuntu2404

linux/amd64 docker.io22.35GB2026-04-04 02:18
498

docker.io/vllm/vllm-openai:gemma4-cu130

linux/amd64 docker.io21.18GB2026-04-04 03:13
1691

docker.io/vllm/vllm-openai:v0.19.0

linux/amd64 docker.io22.41GB2026-04-04 03:47
1080

docker.io/vllm/vllm-openai-cpu:v0.19.0

linux/amd64 docker.io3.49GB2026-04-06 07:48
439

docker.io/vllm/vllm-openai-cpu:v0.19.0

linux/arm64 docker.io2.25GB2026-04-06 07:51
326

docker.io/vllm/vllm-openai:v0.19.0-cu130-ubuntu2404

linux/amd64 docker.io19.74GB2026-04-08 03:21
517

docker.io/vllm/vllm-openai:gemma4

linux/amd64 docker.io23.92GB2026-04-09 00:39
703
384

docker.io/vllm/vllm-openai:qwen3_5-cu130

linux/amd64 docker.io19.60GB2026-04-11 00:51
441

docker.io/vllm/vllm-openai:v0.19.0-cu130

linux/amd64 docker.io19.66GB2026-04-12 00:29
374

docker.io/vllm/vllm-openai:gemma4-0409-x86_64-cu130

linux/amd64 docker.io19.72GB2026-04-12 01:06
465

docker.io/vllm/vllm-openai:gemma4-cu130

linux/arm64 docker.io20.43GB2026-04-13 01:49
525

docker.io/vllm/vllm-openai:v0.19.0-cu130

linux/arm64 docker.io20.37GB2026-04-13 02:18
381

docker.io/vllm/vllm-openai:gemma4

linux/arm64 docker.io23.20GB2026-04-13 03:21
342

docker.io/vllm/vllm-openai:cu130-nightly-aarch64

linux/arm64 docker.io21.48GB2026-04-14 00:35
414

docker.io/vllm/vllm-openai:minimax27

linux/amd64 docker.io24.06GB2026-04-14 01:09
430

docker.io/vllm/vllm-openai:glm51

linux/amd64 docker.io22.46GB2026-04-16 03:22
472

docker.io/vllm/vllm-openai:glm51

linux/arm64 docker.io23.19GB2026-04-17 01:16
187

docker.io/vllm/vllm-openai:glm51-cu130

linux/amd64 docker.io19.71GB2026-04-17 01:41
408

docker.io/vllm/vllm-openai-cpu:v0.19.0-x86_64

linux/amd64 docker.io3.49GB2026-04-17 23:45
328
178

docker.io/vllm/vllm-openai:v0.19.1-cu130

linux/amd64 docker.io19.60GB2026-04-20 00:28
1087