广告图片

docker.io/vllm/vllm-openai:gemma4 linux/arm64

docker.io/vllm/vllm-openai:gemma4 - 国内下载镜像源 浏览次数:8 温馨提示: 这是一个 linux/arm64 系统架构镜像
这是镜像描述:

vllm/openai

基于 OpenAI 的 GPT-3 模型的 API 服务,支持自然语言处理等功能。

源镜像 docker.io/vllm/vllm-openai:gemma4
国内镜像 swr.cn-north-4.myhuaweicloud.com/ddn-k8s/docker.io/vllm/vllm-openai:gemma4-linuxarm64
镜像ID sha256:045b622b6bdc689fa46568f3b21a18af7e67b30d58c06647d9f6e2b61ac01185
镜像TAG gemma4-linuxarm64
大小 23.20GB
镜像源 docker.io
项目信息 Docker-Hub主页 🚀项目TAG 🚀
CMD
启动入口 vllm serve
工作目录 /vllm-workspace
OS/平台 linux/arm64
浏览量 8 次
贡献者
镜像创建 2026-04-10T03:53:01.415837076Z
同步时间 2026-04-13 03:21
环境变量
PATH=/usr/local/cuda/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin NVARCH=sbsa NVIDIA_REQUIRE_CUDA=cuda>=12.9 brand=unknown,driver>=535,driver<536 brand=grid,driver>=535,driver<536 brand=tesla,driver>=535,driver<536 brand=nvidia,driver>=535,driver<536 brand=quadro,driver>=535,driver<536 brand=quadrortx,driver>=535,driver<536 brand=nvidiartx,driver>=535,driver<536 brand=vapps,driver>=535,driver<536 brand=vpc,driver>=535,driver<536 brand=vcs,driver>=535,driver<536 brand=vws,driver>=535,driver<536 brand=cloudgaming,driver>=535,driver<536 brand=unknown,driver>=550,driver<551 brand=grid,driver>=550,driver<551 brand=tesla,driver>=550,driver<551 brand=nvidia,driver>=550,driver<551 brand=quadro,driver>=550,driver<551 brand=quadrortx,driver>=550,driver<551 brand=nvidiartx,driver>=550,driver<551 brand=vapps,driver>=550,driver<551 brand=vpc,driver>=550,driver<551 brand=vcs,driver>=550,driver<551 brand=vws,driver>=550,driver<551 brand=cloudgaming,driver>=550,driver<551 brand=unknown,driver>=560,driver<561 brand=grid,driver>=560,driver<561 brand=tesla,driver>=560,driver<561 brand=nvidia,driver>=560,driver<561 brand=quadro,driver>=560,driver<561 brand=quadrortx,driver>=560,driver<561 brand=nvidiartx,driver>=560,driver<561 brand=vapps,driver>=560,driver<561 brand=vpc,driver>=560,driver<561 brand=vcs,driver>=560,driver<561 brand=vws,driver>=560,driver<561 brand=cloudgaming,driver>=560,driver<561 brand=unknown,driver>=565,driver<566 brand=grid,driver>=565,driver<566 brand=tesla,driver>=565,driver<566 brand=nvidia,driver>=565,driver<566 brand=quadro,driver>=565,driver<566 brand=quadrortx,driver>=565,driver<566 brand=nvidiartx,driver>=565,driver<566 brand=vapps,driver>=565,driver<566 brand=vpc,driver>=565,driver<566 brand=vcs,driver>=565,driver<566 brand=vws,driver>=565,driver<566 brand=cloudgaming,driver>=565,driver<566 brand=unknown,driver>=570,driver<571 brand=grid,driver>=570,driver<571 brand=tesla,driver>=570,driver<571 brand=nvidia,driver>=570,driver<571 brand=quadro,driver>=570,driver<571 brand=quadrortx,driver>=570,driver<571 brand=nvidiartx,driver>=570,driver<571 brand=vapps,driver>=570,driver<571 brand=vpc,driver>=570,driver<571 brand=vcs,driver>=570,driver<571 brand=vws,driver>=570,driver<571 brand=cloudgaming,driver>=570,driver<571 NV_CUDA_CUDART_VERSION=12.9.79-1 CUDA_VERSION=12.9.1 LD_LIBRARY_PATH=/usr/local/nvidia/lib64:/usr/local/cuda/lib64:/usr/local/cuda/lib64 NVIDIA_VISIBLE_DEVICES=all NVIDIA_DRIVER_CAPABILITIES=compute,utility DEBIAN_FRONTEND=noninteractive UV_HTTP_TIMEOUT=500 UV_INDEX_STRATEGY=unsafe-best-match UV_LINK_MODE=copy VLLM_ENABLE_CUDA_COMPATIBILITY=0 TORCH_CUDA_ARCH_LIST=8.7 8.9 9.0 10.0+PTX 12.0 VLLM_USAGE_SOURCE=production-docker-image
镜像标签
NVIDIA CORPORATION <cudatools@nvidia.com>: maintainer ubuntu: org.opencontainers.image.ref.name 22.04: org.opencontainers.image.version

Docker拉取命令

docker pull swr.cn-north-4.myhuaweicloud.com/ddn-k8s/docker.io/vllm/vllm-openai:gemma4-linuxarm64
docker tag  swr.cn-north-4.myhuaweicloud.com/ddn-k8s/docker.io/vllm/vllm-openai:gemma4-linuxarm64  docker.io/vllm/vllm-openai:gemma4

Containerd拉取命令

ctr images pull swr.cn-north-4.myhuaweicloud.com/ddn-k8s/docker.io/vllm/vllm-openai:gemma4-linuxarm64
ctr images tag  swr.cn-north-4.myhuaweicloud.com/ddn-k8s/docker.io/vllm/vllm-openai:gemma4-linuxarm64  docker.io/vllm/vllm-openai:gemma4

Shell快速替换命令

sed -i 's#vllm/vllm-openai:gemma4#swr.cn-north-4.myhuaweicloud.com/ddn-k8s/docker.io/vllm/vllm-openai:gemma4-linuxarm64#' deployment.yaml

Ansible快速分发-Docker

#ansible k8s -m shell -a 'docker pull swr.cn-north-4.myhuaweicloud.com/ddn-k8s/docker.io/vllm/vllm-openai:gemma4-linuxarm64 && docker tag  swr.cn-north-4.myhuaweicloud.com/ddn-k8s/docker.io/vllm/vllm-openai:gemma4-linuxarm64  docker.io/vllm/vllm-openai:gemma4'

Ansible快速分发-Containerd

#ansible k8s -m shell -a 'ctr images pull swr.cn-north-4.myhuaweicloud.com/ddn-k8s/docker.io/vllm/vllm-openai:gemma4-linuxarm64 && ctr images tag  swr.cn-north-4.myhuaweicloud.com/ddn-k8s/docker.io/vllm/vllm-openai:gemma4-linuxarm64  docker.io/vllm/vllm-openai:gemma4'

镜像构建历史


# 2026-04-10 11:53:01  0.00B 配置容器启动时运行的命令
ENTRYPOINT ["vllm" "serve"]
                        
# 2026-04-10 11:53:01  0.00B 设置环境变量 VLLM_USAGE_SOURCE
ENV VLLM_USAGE_SOURCE=production-docker-image
                        
# 2026-04-10 11:53:01  2.08GB 执行命令并创建新的镜像层
RUN |8 TARGETPLATFORM=linux/arm64 INSTALL_KV_CONNECTORS=true CUDA_VERSION=12.9.1 PIP_INDEX_URL= UV_INDEX_URL= PIP_EXTRA_INDEX_URL= UV_EXTRA_INDEX_URL= torch_cuda_arch_list=8.7 8.9 9.0 10.0+PTX 12.0 /bin/sh -c CUDA_MAJOR="${CUDA_VERSION%%.*}";     CUDA_VERSION_DASH=$(echo $CUDA_VERSION | cut -d. -f1,2 | tr '.' '-');     CUDA_HOME=/usr/local/cuda;     BUILD_PKGS="libcusparse-dev-${CUDA_VERSION_DASH}                 libcublas-dev-${CUDA_VERSION_DASH}                 libcusolver-dev-${CUDA_VERSION_DASH}";     if [ "$INSTALL_KV_CONNECTORS" = "true" ]; then         if [ "$CUDA_MAJOR" -ge 13 ]; then             uv pip install --system nixl-cu13;         fi;         uv pip install --system -r /tmp/kv_connectors.txt --no-build || (             apt-get update -y &&             apt-get install -y --no-install-recommends ${BUILD_PKGS} &&             uv pip install --system -r /tmp/kv_connectors.txt --no-build-isolation &&             apt-get purge -y ${BUILD_PKGS} &&             rm -rf /var/lib/apt/lists/*         );     fi # buildkit
                        
# 2026-04-10 11:46:14  0.00B 设置环境变量 TORCH_CUDA_ARCH_LIST
ENV TORCH_CUDA_ARCH_LIST=8.7 8.9 9.0 10.0+PTX 12.0
                        
# 2026-04-10 11:46:14  0.00B 定义构建参数
ARG torch_cuda_arch_list=7.0 7.5 8.0 8.9 9.0 10.0 12.0
                        
# 2026-04-10 11:46:14  0.00B 设置环境变量 UV_HTTP_TIMEOUT
ENV UV_HTTP_TIMEOUT=500
                        
# 2026-04-10 11:46:14  0.00B 定义构建参数
ARG PIP_EXTRA_INDEX_URL UV_EXTRA_INDEX_URL
                        
# 2026-04-10 11:46:14  0.00B 定义构建参数
ARG PIP_INDEX_URL UV_INDEX_URL
                        
# 2026-04-10 11:46:14  0.00B 定义构建参数
ARG CUDA_VERSION
                        
# 2026-04-10 11:46:14  0.00B 定义构建参数
ARG INSTALL_KV_CONNECTORS=false
                        
# 2026-04-10 11:46:14  0.00B 定义构建参数
ARG TARGETPLATFORM
                        
# 2026-04-10 11:46:14  27.84KB 复制新文件或目录到容器中
COPY ./vllm/collect_env.py . # buildkit
                        
# 2026-04-10 11:46:14  958.90KB 复制新文件或目录到容器中
COPY benchmarks benchmarks # buildkit
                        
# 2026-04-10 11:46:14  1.21MB 复制新文件或目录到容器中
COPY examples examples # buildkit
                        
# 2026-04-10 11:46:14  0.00B 设置环境变量 LD_LIBRARY_PATH
ENV LD_LIBRARY_PATH=/usr/local/nvidia/lib64:/usr/local/cuda/lib64:/usr/local/cuda/lib64
                        
# 2026-04-10 11:46:14  39.70MB 执行命令并创建新的镜像层
RUN |22 CUDA_VERSION=12.9.1 PYTHON_VERSION=3.12 DEADSNAKES_MIRROR_URL= DEADSNAKES_GPGKEY_URL= GET_PIP_URL=https://bootstrap.pypa.io/get-pip.py PYTORCH_CUDA_INDEX_BASE_URL=https://download.pytorch.org/whl FLASHINFER_VERSION=0.6.6 GDRCOPY_CUDA_VERSION=12.8 GDRCOPY_OS_VERSION=Ubuntu22_04 TARGETPLATFORM=linux/arm64 BITSANDBYTES_VERSION_X86=0.46.1 BITSANDBYTES_VERSION_ARM64=0.42.0 TIMM_VERSION=>=1.0.17 RUNAI_MODEL_STREAMER_VERSION=>=0.15.7 PIP_INDEX_URL= UV_INDEX_URL= PIP_EXTRA_INDEX_URL= UV_EXTRA_INDEX_URL= PYTORCH_CUDA_INDEX_BASE_URL=https://download.pytorch.org/whl PIP_KEYRING_PROVIDER=disabled UV_KEYRING_PROVIDER=disabled PYTORCH_NIGHTLY= /bin/sh -c uv pip install --system ep_kernels/dist/*.whl --verbose         --extra-index-url ${PYTORCH_CUDA_INDEX_BASE_URL}/cu$(echo $CUDA_VERSION | cut -d. -f1,2 | tr -d '.') # buildkit
                        
# 2026-04-10 11:46:13  0.00B 设置环境变量 LD_LIBRARY_PATH
ENV LD_LIBRARY_PATH=/usr/local/cuda/lib64:/usr/local/cuda/lib64
                        
# 2026-04-10 11:46:13  49.15MB 执行命令并创建新的镜像层
RUN |22 CUDA_VERSION=12.9.1 PYTHON_VERSION=3.12 DEADSNAKES_MIRROR_URL= DEADSNAKES_GPGKEY_URL= GET_PIP_URL=https://bootstrap.pypa.io/get-pip.py PYTORCH_CUDA_INDEX_BASE_URL=https://download.pytorch.org/whl FLASHINFER_VERSION=0.6.6 GDRCOPY_CUDA_VERSION=12.8 GDRCOPY_OS_VERSION=Ubuntu22_04 TARGETPLATFORM=linux/arm64 BITSANDBYTES_VERSION_X86=0.46.1 BITSANDBYTES_VERSION_ARM64=0.42.0 TIMM_VERSION=>=1.0.17 RUNAI_MODEL_STREAMER_VERSION=>=0.15.7 PIP_INDEX_URL= UV_INDEX_URL= PIP_EXTRA_INDEX_URL= UV_EXTRA_INDEX_URL= PYTORCH_CUDA_INDEX_BASE_URL=https://download.pytorch.org/whl PIP_KEYRING_PROVIDER=disabled UV_KEYRING_PROVIDER=disabled PYTORCH_NIGHTLY= /bin/sh -c sh -c 'if ls /tmp/deepgemm/dist/*.whl >/dev/null 2>&1; then               uv pip install --system /tmp/deepgemm/dist/*.whl;            else               echo "No DeepGEMM wheels to install; skipping.";            fi' # buildkit
                        
# 2026-04-10 11:46:12  44.49MB 执行命令并创建新的镜像层
RUN |22 CUDA_VERSION=12.9.1 PYTHON_VERSION=3.12 DEADSNAKES_MIRROR_URL= DEADSNAKES_GPGKEY_URL= GET_PIP_URL=https://bootstrap.pypa.io/get-pip.py PYTORCH_CUDA_INDEX_BASE_URL=https://download.pytorch.org/whl FLASHINFER_VERSION=0.6.6 GDRCOPY_CUDA_VERSION=12.8 GDRCOPY_OS_VERSION=Ubuntu22_04 TARGETPLATFORM=linux/arm64 BITSANDBYTES_VERSION_X86=0.46.1 BITSANDBYTES_VERSION_ARM64=0.42.0 TIMM_VERSION=>=1.0.17 RUNAI_MODEL_STREAMER_VERSION=>=0.15.7 PIP_INDEX_URL= UV_INDEX_URL= PIP_EXTRA_INDEX_URL= UV_EXTRA_INDEX_URL= PYTORCH_CUDA_INDEX_BASE_URL=https://download.pytorch.org/whl PIP_KEYRING_PROVIDER=disabled UV_KEYRING_PROVIDER=disabled PYTORCH_NIGHTLY= /bin/sh -c uv pip install --system "transformers==5.5.0" # buildkit
                        
# 2026-04-10 11:46:10  0.00B 执行命令并创建新的镜像层
RUN |22 CUDA_VERSION=12.9.1 PYTHON_VERSION=3.12 DEADSNAKES_MIRROR_URL= DEADSNAKES_GPGKEY_URL= GET_PIP_URL=https://bootstrap.pypa.io/get-pip.py PYTORCH_CUDA_INDEX_BASE_URL=https://download.pytorch.org/whl FLASHINFER_VERSION=0.6.6 GDRCOPY_CUDA_VERSION=12.8 GDRCOPY_OS_VERSION=Ubuntu22_04 TARGETPLATFORM=linux/arm64 BITSANDBYTES_VERSION_X86=0.46.1 BITSANDBYTES_VERSION_ARM64=0.42.0 TIMM_VERSION=>=1.0.17 RUNAI_MODEL_STREAMER_VERSION=>=0.15.7 PIP_INDEX_URL= UV_INDEX_URL= PIP_EXTRA_INDEX_URL= UV_EXTRA_INDEX_URL= PYTORCH_CUDA_INDEX_BASE_URL=https://download.pytorch.org/whl PIP_KEYRING_PROVIDER=disabled UV_KEYRING_PROVIDER=disabled PYTORCH_NIGHTLY= /bin/sh -c . /etc/environment && uv pip list # buildkit
                        
# 2026-04-10 11:46:10  1.46GB 执行命令并创建新的镜像层
RUN |22 CUDA_VERSION=12.9.1 PYTHON_VERSION=3.12 DEADSNAKES_MIRROR_URL= DEADSNAKES_GPGKEY_URL= GET_PIP_URL=https://bootstrap.pypa.io/get-pip.py PYTORCH_CUDA_INDEX_BASE_URL=https://download.pytorch.org/whl FLASHINFER_VERSION=0.6.6 GDRCOPY_CUDA_VERSION=12.8 GDRCOPY_OS_VERSION=Ubuntu22_04 TARGETPLATFORM=linux/arm64 BITSANDBYTES_VERSION_X86=0.46.1 BITSANDBYTES_VERSION_ARM64=0.42.0 TIMM_VERSION=>=1.0.17 RUNAI_MODEL_STREAMER_VERSION=>=0.15.7 PIP_INDEX_URL= UV_INDEX_URL= PIP_EXTRA_INDEX_URL= UV_EXTRA_INDEX_URL= PYTORCH_CUDA_INDEX_BASE_URL=https://download.pytorch.org/whl PIP_KEYRING_PROVIDER=disabled UV_KEYRING_PROVIDER=disabled PYTORCH_NIGHTLY= /bin/sh -c if [ "${PYTORCH_NIGHTLY}" = "1" ]; then         echo "Installing torch nightly..."         && uv pip install --system $(cat torch_lib_versions.txt | xargs) --pre         --index-url ${PYTORCH_CUDA_INDEX_BASE_URL}/nightly/cu$(echo $CUDA_VERSION | cut -d. -f1,2 | tr -d '.')         && echo "Installing vLLM..."         && uv pip install --system dist/*.whl --verbose         --extra-index-url ${PYTORCH_CUDA_INDEX_BASE_URL}/nightly/cu$(echo $CUDA_VERSION | cut -d. -f1,2 | tr -d '.');     else         echo "Installing vLLM..."         && uv pip install --system dist/*.whl --verbose         --extra-index-url ${PYTORCH_CUDA_INDEX_BASE_URL}/cu$(echo $CUDA_VERSION | cut -d. -f1,2 | tr -d '.');     fi # buildkit
                        
# 2026-04-10 11:36:19  71.00B 复制新文件或目录到容器中
COPY /workspace/torch_lib_versions.txt torch_lib_versions.txt # buildkit
                        
# 2026-04-10 11:36:19  0.00B 定义构建参数
ARG PYTORCH_NIGHTLY
                        
# 2026-04-10 11:36:19  0.00B 定义构建参数
ARG PIP_KEYRING_PROVIDER UV_KEYRING_PROVIDER
                        
# 2026-04-10 11:36:19  0.00B 定义构建参数
ARG PYTORCH_CUDA_INDEX_BASE_URL
                        
# 2026-04-10 11:36:19  0.00B 定义构建参数
ARG PIP_EXTRA_INDEX_URL UV_EXTRA_INDEX_URL
                        
# 2026-04-10 11:36:19  0.00B 定义构建参数
ARG PIP_INDEX_URL UV_INDEX_URL
                        
# 2026-04-10 11:36:19  665.44MB 执行命令并创建新的镜像层
RUN |14 CUDA_VERSION=12.9.1 PYTHON_VERSION=3.12 DEADSNAKES_MIRROR_URL= DEADSNAKES_GPGKEY_URL= GET_PIP_URL=https://bootstrap.pypa.io/get-pip.py PYTORCH_CUDA_INDEX_BASE_URL=https://download.pytorch.org/whl FLASHINFER_VERSION=0.6.6 GDRCOPY_CUDA_VERSION=12.8 GDRCOPY_OS_VERSION=Ubuntu22_04 TARGETPLATFORM=linux/arm64 BITSANDBYTES_VERSION_X86=0.46.1 BITSANDBYTES_VERSION_ARM64=0.42.0 TIMM_VERSION=>=1.0.17 RUNAI_MODEL_STREAMER_VERSION=>=0.15.7 /bin/sh -c if [ "$TARGETPLATFORM" = "linux/arm64" ]; then         BITSANDBYTES_VERSION="${BITSANDBYTES_VERSION_ARM64}";     else         BITSANDBYTES_VERSION="${BITSANDBYTES_VERSION_X86}";     fi;     uv pip install --system accelerate hf_transfer modelscope         "bitsandbytes>=${BITSANDBYTES_VERSION}" "timm${TIMM_VERSION}" "runai-model-streamer[s3,gcs,azure]${RUNAI_MODEL_STREAMER_VERSION}" # buildkit
                        
# 2026-04-10 11:36:09  0.00B 定义构建参数
ARG RUNAI_MODEL_STREAMER_VERSION=>=0.15.7
                        
# 2026-04-10 11:36:09  0.00B 定义构建参数
ARG TIMM_VERSION=>=1.0.17
                        
# 2026-04-10 11:36:09  0.00B 定义构建参数
ARG BITSANDBYTES_VERSION_ARM64=0.42.0
                        
# 2026-04-10 11:36:09  0.00B 定义构建参数
ARG BITSANDBYTES_VERSION_X86=0.46.1
                        
# 2026-04-10 11:36:09  2.40MB 执行命令并创建新的镜像层
RUN |10 CUDA_VERSION=12.9.1 PYTHON_VERSION=3.12 DEADSNAKES_MIRROR_URL= DEADSNAKES_GPGKEY_URL= GET_PIP_URL=https://bootstrap.pypa.io/get-pip.py PYTORCH_CUDA_INDEX_BASE_URL=https://download.pytorch.org/whl FLASHINFER_VERSION=0.6.6 GDRCOPY_CUDA_VERSION=12.8 GDRCOPY_OS_VERSION=Ubuntu22_04 TARGETPLATFORM=linux/arm64 /bin/sh -c set -eux;     case "${TARGETPLATFORM}" in       linux/arm64) UUARCH="aarch64" ;;       linux/amd64) UUARCH="x64" ;;       *) echo "Unsupported TARGETPLATFORM: ${TARGETPLATFORM}" >&2; exit 1 ;;     esac;     /tmp/install_gdrcopy.sh "${GDRCOPY_OS_VERSION}" "${GDRCOPY_CUDA_VERSION}" "${UUARCH}" &&     rm /tmp/install_gdrcopy.sh # buildkit
                        
# 2026-04-10 11:36:00  1.44KB 复制新文件或目录到容器中
COPY tools/install_gdrcopy.sh /tmp/install_gdrcopy.sh # buildkit
                        
# 2026-04-10 11:36:00  0.00B 定义构建参数
ARG TARGETPLATFORM
                        
# 2026-04-10 11:36:00  0.00B 定义构建参数
ARG GDRCOPY_OS_VERSION=Ubuntu22_04
                        
# 2026-04-10 11:36:00  0.00B 定义构建参数
ARG GDRCOPY_CUDA_VERSION=12.8
                        
# 2026-04-10 11:36:00  319.35KB 执行命令并创建新的镜像层
RUN |7 CUDA_VERSION=12.9.1 PYTHON_VERSION=3.12 DEADSNAKES_MIRROR_URL= DEADSNAKES_GPGKEY_URL= GET_PIP_URL=https://bootstrap.pypa.io/get-pip.py PYTORCH_CUDA_INDEX_BASE_URL=https://download.pytorch.org/whl FLASHINFER_VERSION=0.6.6 /bin/sh -c python3 <<'PYEOF'
from flashinfer.jit import env as jit_env
from flashinfer.jit.cubin_loader import download_trtllm_headers, get_cubin
from flashinfer.artifacts import ArtifactPath, CheckSumHash

download_trtllm_headers(
    'bmm',
    jit_env.FLASHINFER_CUBIN_DIR / 'flashinfer' / 'trtllm' / 'batched_gemm' / 'trtllmGen_bmm_export',
    f'{ArtifactPath.TRTLLM_GEN_BMM}/include/trtllmGen_bmm_export',
    ArtifactPath.TRTLLM_GEN_BMM,
    get_cubin(f'{ArtifactPath.TRTLLM_GEN_BMM}/checksums.txt', CheckSumHash.TRTLLM_GEN_BMM),
)

print('FlashInfer TRTLLM BMM headers downloaded successfully')
PYEOF # buildkit
                        
# 2026-04-10 11:35:55  6.42GB 执行命令并创建新的镜像层
RUN |7 CUDA_VERSION=12.9.1 PYTHON_VERSION=3.12 DEADSNAKES_MIRROR_URL= DEADSNAKES_GPGKEY_URL= GET_PIP_URL=https://bootstrap.pypa.io/get-pip.py PYTORCH_CUDA_INDEX_BASE_URL=https://download.pytorch.org/whl FLASHINFER_VERSION=0.6.6 /bin/sh -c uv pip install --system flashinfer-jit-cache==${FLASHINFER_VERSION}         --extra-index-url https://flashinfer.ai/whl/cu$(echo $CUDA_VERSION | cut -d. -f1,2 | tr -d '.')     && flashinfer show-config # buildkit
                        
# 2026-04-10 11:33:31  0.00B 定义构建参数
ARG FLASHINFER_VERSION=0.6.6
                        
# 2026-04-10 11:33:31  9.05GB 执行命令并创建新的镜像层
RUN |6 CUDA_VERSION=12.9.1 PYTHON_VERSION=3.12 DEADSNAKES_MIRROR_URL= DEADSNAKES_GPGKEY_URL= GET_PIP_URL=https://bootstrap.pypa.io/get-pip.py PYTORCH_CUDA_INDEX_BASE_URL=https://download.pytorch.org/whl /bin/sh -c uv pip install --system -r /tmp/requirements-cuda.txt         --extra-index-url ${PYTORCH_CUDA_INDEX_BASE_URL}/cu$(echo $CUDA_VERSION | cut -d. -f1,2 | tr -d '.') &&     rm /tmp/requirements-cuda.txt /tmp/common.txt # buildkit
                        
# 2026-04-10 11:32:00  702.00B 复制新文件或目录到容器中
COPY requirements/cuda.txt /tmp/requirements-cuda.txt # buildkit
                        
# 2026-04-10 11:32:00  2.92KB 复制新文件或目录到容器中
COPY requirements/common.txt /tmp/common.txt # buildkit
                        
# 2026-04-10 11:31:58  0.00B 定义构建参数
ARG PYTORCH_CUDA_INDEX_BASE_URL
                        
# 2026-04-10 11:31:58  0.00B 设置环境变量 VLLM_ENABLE_CUDA_COMPATIBILITY
ENV VLLM_ENABLE_CUDA_COMPATIBILITY=0
                        
# 2026-04-10 11:31:58  0.00B 设置环境变量 UV_LINK_MODE
ENV UV_LINK_MODE=copy
                        
# 2026-04-10 11:31:58  0.00B 设置环境变量 UV_INDEX_STRATEGY
ENV UV_INDEX_STRATEGY=unsafe-best-match
                        
# 2026-04-10 11:31:58  0.00B 设置环境变量 UV_HTTP_TIMEOUT
ENV UV_HTTP_TIMEOUT=500
                        
# 2026-04-10 11:31:58  76.78MB 执行命令并创建新的镜像层
RUN |5 CUDA_VERSION=12.9.1 PYTHON_VERSION=3.12 DEADSNAKES_MIRROR_URL= DEADSNAKES_GPGKEY_URL= GET_PIP_URL=https://bootstrap.pypa.io/get-pip.py /bin/sh -c python3 -m pip install uv # buildkit
                        
# 2026-04-10 11:31:55  2.36GB 执行命令并创建新的镜像层
RUN |5 CUDA_VERSION=12.9.1 PYTHON_VERSION=3.12 DEADSNAKES_MIRROR_URL= DEADSNAKES_GPGKEY_URL= GET_PIP_URL=https://bootstrap.pypa.io/get-pip.py /bin/sh -c CUDA_VERSION_DASH=$(echo $CUDA_VERSION | cut -d. -f1,2 | tr '.' '-') &&     apt-get update -y &&     apt-get install -y --no-install-recommends         cuda-nvcc-${CUDA_VERSION_DASH}         cuda-cudart-${CUDA_VERSION_DASH}         cuda-nvrtc-${CUDA_VERSION_DASH}         cuda-cuobjdump-${CUDA_VERSION_DASH}         libcurand-dev-${CUDA_VERSION_DASH}         libcublas-${CUDA_VERSION_DASH}         libnccl-dev &&     rm -rf /var/lib/apt/lists/* # buildkit
                        
# 2026-04-10 11:30:22  575.13MB 执行命令并创建新的镜像层
RUN |5 CUDA_VERSION=12.9.1 PYTHON_VERSION=3.12 DEADSNAKES_MIRROR_URL= DEADSNAKES_GPGKEY_URL= GET_PIP_URL=https://bootstrap.pypa.io/get-pip.py /bin/sh -c apt-get update -y     && apt-get install -y --no-install-recommends         software-properties-common         curl         sudo         ffmpeg         libsm6         libxext6         libgl1     && if [ ! -z ${DEADSNAKES_MIRROR_URL} ] ; then         if [ ! -z "${DEADSNAKES_GPGKEY_URL}" ] ; then             mkdir -p -m 0755 /etc/apt/keyrings ;             curl -L ${DEADSNAKES_GPGKEY_URL} | gpg --dearmor > /etc/apt/keyrings/deadsnakes.gpg ;             sudo chmod 644 /etc/apt/keyrings/deadsnakes.gpg ;             echo "deb [signed-by=/etc/apt/keyrings/deadsnakes.gpg] ${DEADSNAKES_MIRROR_URL} $(lsb_release -cs) main" > /etc/apt/sources.list.d/deadsnakes.list ;         fi ;     else         for i in 1 2 3; do             add-apt-repository -y ppa:deadsnakes/ppa && break ||             { echo "Attempt $i failed, retrying in 5s..."; sleep 5; };         done ;     fi     && apt-get update -y     && apt-get install -y --no-install-recommends         python${PYTHON_VERSION}         python${PYTHON_VERSION}-dev         python${PYTHON_VERSION}-venv         libibverbs-dev     && rm -rf /var/lib/apt/lists/*     && update-alternatives --install /usr/bin/python3 python3 /usr/bin/python${PYTHON_VERSION} 1     && update-alternatives --set python3 /usr/bin/python${PYTHON_VERSION}     && ln -sf /usr/bin/python${PYTHON_VERSION}-config /usr/bin/python3-config     && rm -f /usr/lib/python${PYTHON_VERSION}/EXTERNALLY-MANAGED     && curl -sS ${GET_PIP_URL} | python${PYTHON_VERSION}     && python3 --version && python3 -m pip --version # buildkit
                        
# 2026-04-10 11:25:47  136.00B 执行命令并创建新的镜像层
RUN |5 CUDA_VERSION=12.9.1 PYTHON_VERSION=3.12 DEADSNAKES_MIRROR_URL= DEADSNAKES_GPGKEY_URL= GET_PIP_URL=https://bootstrap.pypa.io/get-pip.py /bin/sh -c PYTHON_VERSION_STR=$(echo ${PYTHON_VERSION} | sed 's/\.//g') &&     echo "export PYTHON_VERSION_STR=${PYTHON_VERSION_STR}" >> /etc/environment # buildkit
                        
# 2026-04-10 11:25:46  0.00B 设置工作目录为/vllm-workspace
WORKDIR /vllm-workspace
                        
# 2026-04-10 11:25:46  0.00B 设置环境变量 DEBIAN_FRONTEND
ENV DEBIAN_FRONTEND=noninteractive
                        
# 2026-04-10 11:25:46  0.00B 定义构建参数
ARG GET_PIP_URL
                        
# 2026-04-10 11:25:46  0.00B 定义构建参数
ARG DEADSNAKES_GPGKEY_URL
                        
# 2026-04-10 11:25:46  0.00B 定义构建参数
ARG DEADSNAKES_MIRROR_URL
                        
# 2026-04-10 11:25:46  0.00B 定义构建参数
ARG PYTHON_VERSION
                        
# 2026-04-10 11:25:46  0.00B 定义构建参数
ARG CUDA_VERSION
                        
# 2025-07-19 04:13:38  0.00B 设置环境变量 NVIDIA_DRIVER_CAPABILITIES
ENV NVIDIA_DRIVER_CAPABILITIES=compute,utility
                        
# 2025-07-19 04:13:38  0.00B 设置环境变量 NVIDIA_VISIBLE_DEVICES
ENV NVIDIA_VISIBLE_DEVICES=all
                        
# 2025-07-19 04:13:38  17.29KB 复制新文件或目录到容器中
COPY NGC-DL-CONTAINER-LICENSE / # buildkit
                        
# 2025-07-19 04:13:38  0.00B 设置环境变量 LD_LIBRARY_PATH
ENV LD_LIBRARY_PATH=/usr/local/cuda/lib64
                        
# 2025-07-19 04:13:38  0.00B 设置环境变量 PATH
ENV PATH=/usr/local/cuda/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin
                        
# 2025-07-19 04:13:38  22.00B 执行命令并创建新的镜像层
RUN |1 TARGETARCH=arm64 /bin/sh -c echo "/usr/local/cuda/lib64" >> /etc/ld.so.conf.d/nvidia.conf # buildkit
                        
# 2025-07-19 04:13:38  293.79MB 执行命令并创建新的镜像层
RUN |1 TARGETARCH=arm64 /bin/sh -c apt-get update && apt-get install -y --no-install-recommends     cuda-cudart-12-9=${NV_CUDA_CUDART_VERSION}     cuda-compat-12-9     && rm -rf /var/lib/apt/lists/* # buildkit
                        
# 2025-07-19 04:12:35  0.00B 设置环境变量 CUDA_VERSION
ENV CUDA_VERSION=12.9.1
                        
# 2025-07-19 04:12:35  10.22MB 执行命令并创建新的镜像层
RUN |1 TARGETARCH=arm64 /bin/sh -c apt-get update && apt-get install -y --no-install-recommends     gnupg2 curl ca-certificates &&     curl -fsSLO https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/${NVARCH}/cuda-keyring_1.1-1_all.deb &&     dpkg -i cuda-keyring_1.1-1_all.deb &&     apt-get purge --autoremove -y curl     && rm -rf /var/lib/apt/lists/* # buildkit
                        
# 2025-07-19 04:12:35  0.00B 添加元数据标签
LABEL maintainer=NVIDIA CORPORATION <cudatools@nvidia.com>
                        
# 2025-07-19 04:12:35  0.00B 定义构建参数
ARG TARGETARCH
                        
# 2025-07-19 04:12:35  0.00B 设置环境变量 NV_CUDA_CUDART_VERSION
ENV NV_CUDA_CUDART_VERSION=12.9.79-1
                        
# 2025-07-19 04:12:35  0.00B 设置环境变量 NVIDIA_REQUIRE_CUDA brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand
ENV NVIDIA_REQUIRE_CUDA=cuda>=12.9 brand=unknown,driver>=535,driver<536 brand=grid,driver>=535,driver<536 brand=tesla,driver>=535,driver<536 brand=nvidia,driver>=535,driver<536 brand=quadro,driver>=535,driver<536 brand=quadrortx,driver>=535,driver<536 brand=nvidiartx,driver>=535,driver<536 brand=vapps,driver>=535,driver<536 brand=vpc,driver>=535,driver<536 brand=vcs,driver>=535,driver<536 brand=vws,driver>=535,driver<536 brand=cloudgaming,driver>=535,driver<536 brand=unknown,driver>=550,driver<551 brand=grid,driver>=550,driver<551 brand=tesla,driver>=550,driver<551 brand=nvidia,driver>=550,driver<551 brand=quadro,driver>=550,driver<551 brand=quadrortx,driver>=550,driver<551 brand=nvidiartx,driver>=550,driver<551 brand=vapps,driver>=550,driver<551 brand=vpc,driver>=550,driver<551 brand=vcs,driver>=550,driver<551 brand=vws,driver>=550,driver<551 brand=cloudgaming,driver>=550,driver<551 brand=unknown,driver>=560,driver<561 brand=grid,driver>=560,driver<561 brand=tesla,driver>=560,driver<561 brand=nvidia,driver>=560,driver<561 brand=quadro,driver>=560,driver<561 brand=quadrortx,driver>=560,driver<561 brand=nvidiartx,driver>=560,driver<561 brand=vapps,driver>=560,driver<561 brand=vpc,driver>=560,driver<561 brand=vcs,driver>=560,driver<561 brand=vws,driver>=560,driver<561 brand=cloudgaming,driver>=560,driver<561 brand=unknown,driver>=565,driver<566 brand=grid,driver>=565,driver<566 brand=tesla,driver>=565,driver<566 brand=nvidia,driver>=565,driver<566 brand=quadro,driver>=565,driver<566 brand=quadrortx,driver>=565,driver<566 brand=nvidiartx,driver>=565,driver<566 brand=vapps,driver>=565,driver<566 brand=vpc,driver>=565,driver<566 brand=vcs,driver>=565,driver<566 brand=vws,driver>=565,driver<566 brand=cloudgaming,driver>=565,driver<566 brand=unknown,driver>=570,driver<571 brand=grid,driver>=570,driver<571 brand=tesla,driver>=570,driver<571 brand=nvidia,driver>=570,driver<571 brand=quadro,driver>=570,driver<571 brand=quadrortx,driver>=570,driver<571 brand=nvidiartx,driver>=570,driver<571 brand=vapps,driver>=570,driver<571 brand=vpc,driver>=570,driver<571 brand=vcs,driver>=570,driver<571 brand=vws,driver>=570,driver<571 brand=cloudgaming,driver>=570,driver<571
                        
# 2025-07-19 04:12:35  0.00B 设置环境变量 NVARCH
ENV NVARCH=sbsa
                        
# 2025-07-15 00:34:17  0.00B 
/bin/sh -c #(nop)  CMD ["/bin/bash"]
                        
# 2025-07-15 00:34:16  69.24MB 
/bin/sh -c #(nop) ADD file:834c1c2a4c7753a37759a9b21f64a990572643eee18669b1faa782fee45355ed in / 
                        
# 2025-07-15 00:34:14  0.00B 
/bin/sh -c #(nop)  LABEL org.opencontainers.image.version=22.04
                        
# 2025-07-15 00:34:14  0.00B 
/bin/sh -c #(nop)  LABEL org.opencontainers.image.ref.name=ubuntu
                        
# 2025-07-15 00:34:14  0.00B 
/bin/sh -c #(nop)  ARG LAUNCHPAD_BUILD_ARCH
                        
# 2025-07-15 00:34:14  0.00B 
/bin/sh -c #(nop)  ARG RELEASE
                        
                    

镜像信息

{
    "Id": "sha256:045b622b6bdc689fa46568f3b21a18af7e67b30d58c06647d9f6e2b61ac01185",
    "RepoTags": [
        "vllm/vllm-openai:gemma4",
        "swr.cn-north-4.myhuaweicloud.com/ddn-k8s/docker.io/vllm/vllm-openai:gemma4-linuxarm64"
    ],
    "RepoDigests": [
        "vllm/vllm-openai@sha256:952594c2e6ded02be6e3f195eb95d5a50c5f2a44a879427b2f77c32e0a732a79",
        "swr.cn-north-4.myhuaweicloud.com/ddn-k8s/docker.io/vllm/vllm-openai@sha256:fc842636d4565f0c80de2419b3083237d7c978128c8157670c1e692eb385f964"
    ],
    "Parent": "",
    "Comment": "buildkit.dockerfile.v0",
    "Created": "2026-04-10T03:53:01.415837076Z",
    "Container": "",
    "ContainerConfig": null,
    "DockerVersion": "",
    "Author": "",
    "Config": {
        "Hostname": "",
        "Domainname": "",
        "User": "",
        "AttachStdin": false,
        "AttachStdout": false,
        "AttachStderr": false,
        "Tty": false,
        "OpenStdin": false,
        "StdinOnce": false,
        "Env": [
            "PATH=/usr/local/cuda/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin",
            "NVARCH=sbsa",
            "NVIDIA_REQUIRE_CUDA=cuda\u003e=12.9 brand=unknown,driver\u003e=535,driver\u003c536 brand=grid,driver\u003e=535,driver\u003c536 brand=tesla,driver\u003e=535,driver\u003c536 brand=nvidia,driver\u003e=535,driver\u003c536 brand=quadro,driver\u003e=535,driver\u003c536 brand=quadrortx,driver\u003e=535,driver\u003c536 brand=nvidiartx,driver\u003e=535,driver\u003c536 brand=vapps,driver\u003e=535,driver\u003c536 brand=vpc,driver\u003e=535,driver\u003c536 brand=vcs,driver\u003e=535,driver\u003c536 brand=vws,driver\u003e=535,driver\u003c536 brand=cloudgaming,driver\u003e=535,driver\u003c536 brand=unknown,driver\u003e=550,driver\u003c551 brand=grid,driver\u003e=550,driver\u003c551 brand=tesla,driver\u003e=550,driver\u003c551 brand=nvidia,driver\u003e=550,driver\u003c551 brand=quadro,driver\u003e=550,driver\u003c551 brand=quadrortx,driver\u003e=550,driver\u003c551 brand=nvidiartx,driver\u003e=550,driver\u003c551 brand=vapps,driver\u003e=550,driver\u003c551 brand=vpc,driver\u003e=550,driver\u003c551 brand=vcs,driver\u003e=550,driver\u003c551 brand=vws,driver\u003e=550,driver\u003c551 brand=cloudgaming,driver\u003e=550,driver\u003c551 brand=unknown,driver\u003e=560,driver\u003c561 brand=grid,driver\u003e=560,driver\u003c561 brand=tesla,driver\u003e=560,driver\u003c561 brand=nvidia,driver\u003e=560,driver\u003c561 brand=quadro,driver\u003e=560,driver\u003c561 brand=quadrortx,driver\u003e=560,driver\u003c561 brand=nvidiartx,driver\u003e=560,driver\u003c561 brand=vapps,driver\u003e=560,driver\u003c561 brand=vpc,driver\u003e=560,driver\u003c561 brand=vcs,driver\u003e=560,driver\u003c561 brand=vws,driver\u003e=560,driver\u003c561 brand=cloudgaming,driver\u003e=560,driver\u003c561 brand=unknown,driver\u003e=565,driver\u003c566 brand=grid,driver\u003e=565,driver\u003c566 brand=tesla,driver\u003e=565,driver\u003c566 brand=nvidia,driver\u003e=565,driver\u003c566 brand=quadro,driver\u003e=565,driver\u003c566 brand=quadrortx,driver\u003e=565,driver\u003c566 brand=nvidiartx,driver\u003e=565,driver\u003c566 brand=vapps,driver\u003e=565,driver\u003c566 brand=vpc,driver\u003e=565,driver\u003c566 brand=vcs,driver\u003e=565,driver\u003c566 brand=vws,driver\u003e=565,driver\u003c566 brand=cloudgaming,driver\u003e=565,driver\u003c566 brand=unknown,driver\u003e=570,driver\u003c571 brand=grid,driver\u003e=570,driver\u003c571 brand=tesla,driver\u003e=570,driver\u003c571 brand=nvidia,driver\u003e=570,driver\u003c571 brand=quadro,driver\u003e=570,driver\u003c571 brand=quadrortx,driver\u003e=570,driver\u003c571 brand=nvidiartx,driver\u003e=570,driver\u003c571 brand=vapps,driver\u003e=570,driver\u003c571 brand=vpc,driver\u003e=570,driver\u003c571 brand=vcs,driver\u003e=570,driver\u003c571 brand=vws,driver\u003e=570,driver\u003c571 brand=cloudgaming,driver\u003e=570,driver\u003c571",
            "NV_CUDA_CUDART_VERSION=12.9.79-1",
            "CUDA_VERSION=12.9.1",
            "LD_LIBRARY_PATH=/usr/local/nvidia/lib64:/usr/local/cuda/lib64:/usr/local/cuda/lib64",
            "NVIDIA_VISIBLE_DEVICES=all",
            "NVIDIA_DRIVER_CAPABILITIES=compute,utility",
            "DEBIAN_FRONTEND=noninteractive",
            "UV_HTTP_TIMEOUT=500",
            "UV_INDEX_STRATEGY=unsafe-best-match",
            "UV_LINK_MODE=copy",
            "VLLM_ENABLE_CUDA_COMPATIBILITY=0",
            "TORCH_CUDA_ARCH_LIST=8.7 8.9 9.0 10.0+PTX 12.0",
            "VLLM_USAGE_SOURCE=production-docker-image"
        ],
        "Cmd": null,
        "Image": "",
        "Volumes": null,
        "WorkingDir": "/vllm-workspace",
        "Entrypoint": [
            "vllm",
            "serve"
        ],
        "OnBuild": null,
        "Labels": {
            "maintainer": "NVIDIA CORPORATION \u003ccudatools@nvidia.com\u003e",
            "org.opencontainers.image.ref.name": "ubuntu",
            "org.opencontainers.image.version": "22.04"
        }
    },
    "Architecture": "arm64",
    "Os": "linux",
    "Size": 23196084663,
    "GraphDriver": {
        "Data": {
            "LowerDir": "/var/lib/docker/overlay2/4a50d6f2dec2b09323754164da9c2a004b369f68379d7ffd7f6d1642aae62b22/diff:/var/lib/docker/overlay2/23e9e1380e27ac2ee2ed0029efff0fe7c19aa217a5bcdc0277dbc0857225bd05/diff:/var/lib/docker/overlay2/4d40174e600c53f9f1c5de4b914ecfa207fc1a5b15b42c62d9f130ac1d674ae0/diff:/var/lib/docker/overlay2/58c5e7680812045c049883ad83040609090524e8b7d7cde41453bdaf6657e2ec/diff:/var/lib/docker/overlay2/cdc21fb57a64f946ff2bff29f5b8ee8db1d1802b036ca427482ed15c926cf20d/diff:/var/lib/docker/overlay2/3c9e3ac96caf35c3a83c54d70eaff9cd5a3ed62b49cc21138c75efa387de072a/diff:/var/lib/docker/overlay2/3c676a6e605a30c3e92da5dae57377fb6702c3c3899e5b0f2140da87f6913403/diff:/var/lib/docker/overlay2/121f7bbe511453fd637b4e8d8c6e11c574aabc8c53795f2e51ea7d048758dc47/diff:/var/lib/docker/overlay2/c93d57ea977adc0169687fe5ed6bbaf103ff72dc4dccad631782d01ea41b4a46/diff:/var/lib/docker/overlay2/88a1917ea6f4927009562a3c9099749ff8a6cb2f0fdd8e7599266e56417e7e7d/diff:/var/lib/docker/overlay2/11147daa31971316fcc8a10e24a4df1ba175236e20f1545188b7cdeb63f28854/diff:/var/lib/docker/overlay2/47c3c69864412f37266c866a261c5a5f68d6e8ee66148a16b2da77589856cf43/diff:/var/lib/docker/overlay2/8ef55efdabcea4126c3dc1f57269f17c605f8f259c42098579f6ca322fcbae26/diff:/var/lib/docker/overlay2/37cdff4a8e93ebc58504942d555b384d9244038fe128d658e37a75f5761b6d2c/diff:/var/lib/docker/overlay2/67d8597483b7830eb2ab4b906c6f63cd56536ab85e3bf883d3c997544c2878b5/diff:/var/lib/docker/overlay2/b0a307afb80e1acee4d99c973c4f4fcf1614fe84f44b00d91c357a0d41b7f20c/diff:/var/lib/docker/overlay2/cd6a5b0a5fab3c0684b185ead784858baaf57a116f72831300c5b75869137584/diff:/var/lib/docker/overlay2/b87a763377bf69384a7f9fd2ca5ca813283b585fd885c2fb575969afcb702ea6/diff:/var/lib/docker/overlay2/fb37a0bd59efb6cfe7347645f392ded3d14354de420cfb67357e50569deac6f2/diff:/var/lib/docker/overlay2/712f0c8d4df631c06feb1404bd8e3b013dfd657be6d5fee34a027a3a731af786/diff:/var/lib/docker/overlay2/f45414a5c72882db4aef8a40ef20284c88204c7ffec2bd03a04989be3bc6617e/diff:/var/lib/docker/overlay2/083f8492959a8773bcf6a8bf93531ea1cbb429d27b031a8a11a051b3ea651eb5/diff:/var/lib/docker/overlay2/ea833ab9d9aaf5038787ac632e3755974139d20f89f06db4787d93fa0c48bde0/diff:/var/lib/docker/overlay2/43576a64fe606c2727b9b07a890fdee3d735643898415e5be15d1bba7d9ce234/diff:/var/lib/docker/overlay2/de36b177e608e8c1428e9690db292cacfa65e932a807bdc5debda7c2f0e372b3/diff:/var/lib/docker/overlay2/e4221bd322d0c17f1d09649d479e04c0d678879a50160824a6e439d2ff581b75/diff:/var/lib/docker/overlay2/af78776d10f27d03de20f2f4c6cd392fac170747de1d1686b83d74a412605149/diff",
            "MergedDir": "/var/lib/docker/overlay2/31974b7aed4965f29b8a9870753eeada218d692b2e383da01dd7d37725d6dd89/merged",
            "UpperDir": "/var/lib/docker/overlay2/31974b7aed4965f29b8a9870753eeada218d692b2e383da01dd7d37725d6dd89/diff",
            "WorkDir": "/var/lib/docker/overlay2/31974b7aed4965f29b8a9870753eeada218d692b2e383da01dd7d37725d6dd89/work"
        },
        "Name": "overlay2"
    },
    "RootFS": {
        "Type": "layers",
        "Layers": [
            "sha256:e437eb226c5d6bbd2c93a89e3708b0c005e0132aa682f109591fca659de658d1",
            "sha256:4c03e8e391d1443f071b295f97260e45b92b3df5f2a248c369434a143a5be1b1",
            "sha256:fdc01e9ff8a2f5cbe11111373e4b5f60a22ece493d2ebb3066e55a0bbabd1fd2",
            "sha256:226cc82ba2cb7de7e7639ceac53fa837b5dda11bcd040041637cd27df8153e35",
            "sha256:455bca42f6ec40ca42fea4bea15c6c17b97101af90413cdd25647de2b9d98960",
            "sha256:c9466eff986ebb084d1dd332465ef3da3f239897111aef901fec0d09d6018e0c",
            "sha256:efa5ffbbc9850b47d87fdd764ec2f239ff5bccdeb9a29cf66f6c6b97abb94b28",
            "sha256:d6671f2fd0690151aa8413ec35d92fe3b320948cbc27789fc1e7edc66759fb19",
            "sha256:b9dea79bafa9b37149fba6fa43110c7645bb757c0e3765540e4a975792b9695e",
            "sha256:8649b6012ffa747573f0aae08ea890a420225b4abd168b714609a69401bd10ec",
            "sha256:fad6b3547370466ae429677c0914c71ba3c2fcb0e75384f0c98598d3135291bb",
            "sha256:5b2a244c81331e6d19c5bdd0d667d5c1cfd2c52c258db326bd46e6cd4ff8a35d",
            "sha256:afc8c16163450948496c8f5f79800d30f2c096cfdc6faf60cce96e53f43d066e",
            "sha256:bb581ac22169ffb4ecdb18a7fb261b84252bdcf2dc03cb4d8b48472082edb62e",
            "sha256:faf8b47f27cce6c3361173449d5282fcf952b686ab4a2b238839167dce8bd7f2",
            "sha256:862779e0d5de1696846fc4454a47a7a937ed6b33cc4fedb6b9cffa76e02d50b3",
            "sha256:3816564cc7a7b3277daff3987af96e7fa5a204cb0aa1d6054393c0b90765b701",
            "sha256:5e8f0629584ce4639c2a518fc5f77a50f73ec18c7e138b16a3f9bcb46a765218",
            "sha256:e0391d418f4cf12c5f34917a292127cfd60667fb734bdd43c73cc3b68f259cf8",
            "sha256:68a4f8fafbbd4fd67cc0e51cbba3beaf1c53211728806cf54c4ff0dc136bb9f3",
            "sha256:c6dffb1b083a7695c312f2d8f4cdef24c9741fbffa51b8685fdb865e87f26ea2",
            "sha256:5948808253344efeac1c601bb2e9409468908a09fd930a7cca27d5693eed3aa0",
            "sha256:5eea66c0291b392bf6b5697354ab38168b4691fed999b59c9b5551aef8eb305f",
            "sha256:111189efe70e7764735c3189cbae89bbcd4ae456a4920010fc1bd3640e99a1b7",
            "sha256:dedb1021d939122cd91a5a6f843c24b84a936b6504d031d0887461aa86da6576",
            "sha256:38215ecf2b85e490a7e3f01249782e89a9dd9973c157f2dd9eeeb9c58c6f0573",
            "sha256:d868167b60683394a4d9722d13ef60ad5488913f4d0d37e4d45b628fea9217e1",
            "sha256:77354d60f4ccee392705b63c8bf7f2ec9f225c1185455557b916922071c015b8"
        ]
    },
    "Metadata": {
        "LastTagTime": "2026-04-13T02:53:00.653826875+08:00"
    }
}

更多版本

docker.io/vllm/vllm-openai:v0.5.4

linux/amd64 docker.io9.90GB2024-09-07 06:20
2573

docker.io/vllm/vllm-openai:v0.6.0

linux/amd64 docker.io9.72GB2024-09-11 01:51
1646

docker.io/vllm/vllm-openai:v0.6.1.post2

linux/amd64 docker.io9.81GB2024-09-24 01:43
1199

docker.io/vllm/vllm-openai:latest

linux/amd64 docker.io10.24GB2024-10-11 00:43
7290

docker.io/vllm/vllm-openai:v0.6.4.post1

linux/amd64 docker.io10.64GB2024-11-19 00:42
1250

docker.io/vllm/vllm-openai:v0.6.4

linux/amd64 docker.io10.64GB2024-12-11 02:08
953

docker.io/vllm/vllm-openai:v0.6.3

linux/amd64 docker.io10.43GB2024-12-12 02:41
1189

docker.io/vllm/vllm-openai:v0.6.6

linux/amd64 docker.io10.23GB2025-01-04 00:37
1621

docker.io/vllm/vllm-openai:v0.6.6.post1

linux/amd64 docker.io10.23GB2025-01-24 00:21
1206

docker.io/vllm/vllm-openai:v0.7.1

linux/amd64 docker.io16.53GB2025-02-08 02:05
1254

docker.io/vllm/vllm-openai:v0.7.2

linux/amd64 docker.io16.53GB2025-02-09 00:28
2977

docker.io/vllm/vllm-openai:v0.7.3

linux/amd64 docker.io16.43GB2025-02-24 00:50
3742

docker.io/vllm/vllm-openai:v0.8.0

linux/amd64 docker.io16.62GB2025-03-20 00:23
1506

docker.io/vllm/vllm-openai:v0.8.1

linux/amd64 docker.io16.62GB2025-03-21 00:28
1242

docker.io/vllm/vllm-openai:v0.8.2

linux/amd64 docker.io16.92GB2025-03-27 01:12
1465

docker.io/vllm/vllm-openai:v0.8.3

linux/amd64 docker.io17.13GB2025-04-08 00:58
1490

docker.io/vllm/vllm-openai:v0.8.4

linux/amd64 docker.io17.16GB2025-04-17 01:16
1934

docker.io/vllm/vllm-openai:v0.8.5

linux/amd64 docker.io17.30GB2025-04-30 02:45
3461

docker.io/vllm/vllm-openai:v0.8.5.post1

linux/amd64 docker.io17.30GB2025-05-07 02:06
3394

docker.io/vllm/vllm-openai:v0.9.0.1

linux/amd64 docker.io20.81GB2025-06-05 01:12
2249

docker.io/vllm/vllm-openai:v0.9.1

linux/amd64 docker.io20.85GB2025-06-12 01:29
2989

docker.io/vllm/vllm-openai:v0.9.2

linux/amd64 docker.io20.76GB2025-07-09 03:00
7099

docker.io/vllm/vllm-openai:v0.10.0

linux/amd64 docker.io26.13GB2025-07-26 03:15
1917

docker.io/vllm/vllm-openai:gptoss

linux/amd64 docker.io33.86GB2025-08-07 01:52
1399

docker.io/vllm/vllm-openai:v0.10.1

linux/amd64 docker.io20.25GB2025-08-20 03:05
1355

docker.io/vllm/vllm-openai:v0.10.1.1

linux/amd64 docker.io20.26GB2025-08-23 01:43
2207

docker.io/vllm/vllm-openai:v0.10.2

linux/amd64 docker.io22.49GB2025-09-16 03:40
1656

docker.io/vllm/vllm-openai:v0.2.7

linux/amd64 docker.io6.34GB2025-10-01 01:07
471

docker.io/vllm/vllm-openai:v0.11.0-x86_64

linux/amd64 docker.io25.86GB2025-10-09 02:14
2342

docker.io/vllm/vllm-openai:v0.10.2-x86_64

linux/amd64 docker.io22.49GB2025-10-09 02:22
591

docker.io/vllm/vllm-openai:v0.11.0

linux/amd64 docker.io25.86GB2025-10-09 11:24
2353

docker.io/vllm/vllm-openai:v0.11.0

linux/arm64 docker.io24.17GB2025-10-30 00:47
1001

docker.io/vllm/vllm-openai:v0.3.3

linux/amd64 docker.io9.13GB2025-11-18 01:01
380

docker.io/vllm/vllm-openai:v0.11.1

linux/amd64 docker.io28.72GB2025-11-21 01:03
803

docker.io/vllm/vllm-openai:v0.11.2

linux/amd64 docker.io28.82GB2025-11-22 00:46
1447

docker.io/vllm/vllm-openai:v0.11.1

linux/arm64 docker.io26.54GB2025-11-22 01:23
445

docker.io/vllm/vllm-openai:v0.4.0

linux/amd64 docker.io9.88GB2025-11-22 01:58
443

docker.io/vllm/vllm-openai:v0.11.2

linux/arm64 docker.io26.54GB2025-11-22 04:06
636

docker.io/vllm/vllm-openai:nightly

linux/amd64 docker.io18.74GB2025-12-03 02:43
1340

docker.io/vllm/vllm-openai:v0.12.0-aarch64

linux/arm64 docker.io17.89GB2025-12-05 03:12
552

docker.io/vllm/vllm-openai:v0.12.0

linux/amd64 docker.io19.47GB2025-12-05 03:59
1942

docker.io/vllm/vllm-openai:v0.13.0

linux/amd64 docker.io19.51GB2026-01-22 01:41
561

docker.io/vllm/vllm-openai:v0.14.0

linux/amd64 docker.io19.66GB2026-01-22 03:16
608

docker.io/vllm/vllm-openai:v0.14.1

linux/amd64 docker.io19.69GB2026-01-27 01:52
751

docker.io/vllm/vllm-openai:v0.15.0

linux/amd64 docker.io20.13GB2026-01-31 00:51
873
525

docker.io/vllm/vllm-openai:v0.15.1

linux/amd64 docker.io20.14GB2026-02-06 01:14
821

docker.io/vllm/vllm-openai:v0.15.1-cu130

linux/amd64 docker.io18.77GB2026-02-07 00:39
747

docker.io/vllm/vllm-openai:latest

linux/arm64 docker.io20.65GB2026-02-08 00:59
431

docker.io/vllm/vllm-openai:v0.15.1-aarch64-cu130

linux/arm64 docker.io19.60GB2026-02-10 00:44
609

docker.io/vllm/vllm-openai:glm5

linux/amd64 docker.io20.27GB2026-02-14 00:56
391

docker.io/vllm/vllm-openai:qwen3_5

linux/amd64 docker.io20.93GB2026-02-25 01:18
1186

docker.io/vllm/vllm-openai:qwen3_5-x86_64-cu129

linux/amd64 docker.io20.93GB2026-02-27 00:33
504
329

docker.io/vllm/vllm-openai:v0.16.0

linux/amd64 docker.io20.37GB2026-02-28 01:04
678

docker.io/vllm/vllm-openai:v0.16.0-cu130

linux/amd64 docker.io19.01GB2026-02-28 02:55
465
323

docker.io/vllm/vllm-openai:v0.5.1

linux/amd64 docker.io10.40GB2026-03-06 01:01
126

docker.io/vllm/vllm-openai:v0.4.0.post1

linux/amd64 docker.io9.88GB2026-03-06 01:29
144

docker.io/vllm/vllm-openai:cu130-nightly

linux/amd64 docker.io19.55GB2026-03-06 02:04
315
223

docker.io/vllm/vllm-openai:v0.17.0

linux/amd64 docker.io20.75GB2026-03-08 02:27
1079

docker.io/vllm/vllm-openai:v0.17.0-cu130

linux/amd64 docker.io19.55GB2026-03-10 01:39
284

docker.io/vllm/vllm-openai:v0.17.0

linux/arm64 docker.io21.50GB2026-03-11 01:42
362

docker.io/vllm/vllm-openai:v0.4.3

linux/amd64 docker.io7.86GB2026-03-11 02:04
133

docker.io/vllm/vllm-openai:v0.13.0

linux/arm64 docker.io17.98GB2026-03-11 03:52
142

docker.io/vllm/vllm-openai:v0.17.1-cu130

linux/amd64 docker.io19.55GB2026-03-13 02:22
350

docker.io/vllm/vllm-openai:v0.17.1

linux/amd64 docker.io20.75GB2026-03-14 02:34
566

docker.io/vllm/vllm-openai:v0.14.0

linux/arm64 docker.io20.19GB2026-03-17 02:20
147

docker.io/vllm/vllm-openai:v0.18.0

linux/amd64 docker.io22.40GB2026-03-23 01:43
639

docker.io/vllm/vllm-openai:v0.18.0-cu130

linux/amd64 docker.io19.65GB2026-03-24 01:29
425

docker.io/vllm/vllm-openai-cpu:latest-x86_64

linux/amd64 docker.io3.48GB2026-03-28 10:31
88

docker.io/vllm/vllm-openai-cpu:latest-arm64

linux/arm64 docker.io2.23GB2026-03-28 10:34
95

docker.io/vllm/vllm-openai-rocm:latest

linux/amd64 docker.io24.49GB2026-03-30 00:46
111

docker.io/vllm/vllm-openai:latest-cu130

linux/amd64 docker.io19.65GB2026-03-31 00:47
222

docker.io/vllm/vllm-openai:v0.18.1-cu130

linux/amd64 docker.io19.66GB2026-04-02 00:39
131

docker.io/vllm/vllm-openai:v0.18.1

linux/amd64 docker.io22.41GB2026-04-02 01:12
190

docker.io/vllm/vllm-openai:v0.19.0-ubuntu2404

linux/amd64 docker.io22.35GB2026-04-04 02:18
183

docker.io/vllm/vllm-openai:gemma4-cu130

linux/amd64 docker.io21.18GB2026-04-04 03:13
593

docker.io/vllm/vllm-openai:v0.19.0

linux/amd64 docker.io22.41GB2026-04-04 03:47
390

docker.io/vllm/vllm-openai-cpu:v0.19.0

linux/amd64 docker.io3.49GB2026-04-06 07:48
143

docker.io/vllm/vllm-openai-cpu:v0.19.0

linux/arm64 docker.io2.25GB2026-04-06 07:51
61

docker.io/vllm/vllm-openai:v0.19.0-cu130-ubuntu2404

linux/amd64 docker.io19.74GB2026-04-08 03:21
138

docker.io/vllm/vllm-openai:gemma4

linux/amd64 docker.io23.92GB2026-04-09 00:39
136
55

docker.io/vllm/vllm-openai:qwen3_5-cu130

linux/amd64 docker.io19.60GB2026-04-11 00:51
72

docker.io/vllm/vllm-openai:v0.19.0-cu130

linux/amd64 docker.io19.66GB2026-04-12 00:29
54

docker.io/vllm/vllm-openai:gemma4-0409-x86_64-cu130

linux/amd64 docker.io19.72GB2026-04-12 01:06
40

docker.io/vllm/vllm-openai:gemma4-cu130

linux/arm64 docker.io20.43GB2026-04-13 01:49
12

docker.io/vllm/vllm-openai:v0.19.0-cu130

linux/arm64 docker.io20.37GB2026-04-13 02:18
14

docker.io/vllm/vllm-openai:gemma4

linux/arm64 docker.io23.20GB2026-04-13 03:21
7