docker.io/vllm/vllm-openai:gemma4-cu130 linux/arm64

docker.io/vllm/vllm-openai:gemma4-cu130 - 国内下载镜像源 浏览次数:454 温馨提示: 这是一个 linux/arm64 系统架构镜像
这是镜像描述:

vllm/openai

基于 OpenAI 的 GPT-3 模型的 API 服务,支持自然语言处理等功能。

源镜像 docker.io/vllm/vllm-openai:gemma4-cu130
国内镜像 swr.cn-north-4.myhuaweicloud.com/ddn-k8s/docker.io/vllm/vllm-openai:gemma4-cu130-linuxarm64
镜像ID sha256:db59febc6c476c9c93701cc9f54917533cee774d4e4bb27eff6fafeae37509f8
镜像TAG gemma4-cu130-linuxarm64
大小 20.43GB
镜像源 docker.io
项目信息 Docker-Hub主页 🚀项目TAG 🚀
CMD
启动入口 vllm serve
工作目录 /vllm-workspace
OS/平台 linux/arm64
浏览量 454 次
贡献者
镜像创建 2026-04-10T03:47:24.766139086Z
同步时间 2026-04-13 01:49
环境变量
PATH=/usr/local/nvidia/bin:/usr/local/cuda/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin NVARCH=sbsa NVIDIA_REQUIRE_CUDA=cuda>=13.0 brand=unknown,driver>=535,driver<536 brand=grid,driver>=535,driver<536 brand=tesla,driver>=535,driver<536 brand=nvidia,driver>=535,driver<536 brand=quadro,driver>=535,driver<536 brand=quadrortx,driver>=535,driver<536 brand=nvidiartx,driver>=535,driver<536 brand=vapps,driver>=535,driver<536 brand=vpc,driver>=535,driver<536 brand=vcs,driver>=535,driver<536 brand=vws,driver>=535,driver<536 brand=cloudgaming,driver>=535,driver<536 brand=unknown,driver>=550,driver<551 brand=grid,driver>=550,driver<551 brand=tesla,driver>=550,driver<551 brand=nvidia,driver>=550,driver<551 brand=quadro,driver>=550,driver<551 brand=quadrortx,driver>=550,driver<551 brand=nvidiartx,driver>=550,driver<551 brand=vapps,driver>=550,driver<551 brand=vpc,driver>=550,driver<551 brand=vcs,driver>=550,driver<551 brand=vws,driver>=550,driver<551 brand=cloudgaming,driver>=550,driver<551 brand=unknown,driver>=565,driver<566 brand=grid,driver>=565,driver<566 brand=tesla,driver>=565,driver<566 brand=nvidia,driver>=565,driver<566 brand=quadro,driver>=565,driver<566 brand=quadrortx,driver>=565,driver<566 brand=nvidiartx,driver>=565,driver<566 brand=vapps,driver>=565,driver<566 brand=vpc,driver>=565,driver<566 brand=vcs,driver>=565,driver<566 brand=vws,driver>=565,driver<566 brand=cloudgaming,driver>=565,driver<566 brand=unknown,driver>=570,driver<571 brand=grid,driver>=570,driver<571 brand=tesla,driver>=570,driver<571 brand=nvidia,driver>=570,driver<571 brand=quadro,driver>=570,driver<571 brand=quadrortx,driver>=570,driver<571 brand=nvidiartx,driver>=570,driver<571 brand=vapps,driver>=570,driver<571 brand=vpc,driver>=570,driver<571 brand=vcs,driver>=570,driver<571 brand=vws,driver>=570,driver<571 brand=cloudgaming,driver>=570,driver<571 brand=unknown,driver>=575,driver<576 brand=grid,driver>=575,driver<576 brand=tesla,driver>=575,driver<576 brand=nvidia,driver>=575,driver<576 brand=quadro,driver>=575,driver<576 brand=quadrortx,driver>=575,driver<576 brand=nvidiartx,driver>=575,driver<576 brand=vapps,driver>=575,driver<576 brand=vpc,driver>=575,driver<576 brand=vcs,driver>=575,driver<576 brand=vws,driver>=575,driver<576 brand=cloudgaming,driver>=575,driver<576 NV_CUDA_CUDART_VERSION=13.0.88-1 CUDA_VERSION=13.0.1 LD_LIBRARY_PATH=/usr/local/nvidia/lib64:/usr/local/cuda/lib64:/usr/local/nvidia/lib:/usr/local/nvidia/lib64:/usr/local/cuda/lib64 NVIDIA_VISIBLE_DEVICES=all NVIDIA_DRIVER_CAPABILITIES=compute,utility DEBIAN_FRONTEND=noninteractive UV_HTTP_TIMEOUT=500 UV_INDEX_STRATEGY=unsafe-best-match UV_LINK_MODE=copy VLLM_ENABLE_CUDA_COMPATIBILITY=0 TORCH_CUDA_ARCH_LIST=8.7 8.9 9.0 10.0+PTX 12.0 12.1 VLLM_USAGE_SOURCE=production-docker-image
镜像标签
NVIDIA CORPORATION <cudatools@nvidia.com>: maintainer ubuntu: org.opencontainers.image.ref.name 22.04: org.opencontainers.image.version

Docker拉取命令

docker pull swr.cn-north-4.myhuaweicloud.com/ddn-k8s/docker.io/vllm/vllm-openai:gemma4-cu130-linuxarm64
docker tag  swr.cn-north-4.myhuaweicloud.com/ddn-k8s/docker.io/vllm/vllm-openai:gemma4-cu130-linuxarm64  docker.io/vllm/vllm-openai:gemma4-cu130

Containerd拉取命令

ctr images pull swr.cn-north-4.myhuaweicloud.com/ddn-k8s/docker.io/vllm/vllm-openai:gemma4-cu130-linuxarm64
ctr images tag  swr.cn-north-4.myhuaweicloud.com/ddn-k8s/docker.io/vllm/vllm-openai:gemma4-cu130-linuxarm64  docker.io/vllm/vllm-openai:gemma4-cu130

Shell快速替换命令

sed -i 's#vllm/vllm-openai:gemma4-cu130#swr.cn-north-4.myhuaweicloud.com/ddn-k8s/docker.io/vllm/vllm-openai:gemma4-cu130-linuxarm64#' deployment.yaml

Ansible快速分发-Docker

#ansible k8s -m shell -a 'docker pull swr.cn-north-4.myhuaweicloud.com/ddn-k8s/docker.io/vllm/vllm-openai:gemma4-cu130-linuxarm64 && docker tag  swr.cn-north-4.myhuaweicloud.com/ddn-k8s/docker.io/vllm/vllm-openai:gemma4-cu130-linuxarm64  docker.io/vllm/vllm-openai:gemma4-cu130'

Ansible快速分发-Containerd

#ansible k8s -m shell -a 'ctr images pull swr.cn-north-4.myhuaweicloud.com/ddn-k8s/docker.io/vllm/vllm-openai:gemma4-cu130-linuxarm64 && ctr images tag  swr.cn-north-4.myhuaweicloud.com/ddn-k8s/docker.io/vllm/vllm-openai:gemma4-cu130-linuxarm64  docker.io/vllm/vllm-openai:gemma4-cu130'

镜像构建历史


# 2026-04-10 11:47:24  0.00B 配置容器启动时运行的命令
ENTRYPOINT ["vllm" "serve"]
                        
# 2026-04-10 11:47:24  0.00B 设置环境变量 VLLM_USAGE_SOURCE
ENV VLLM_USAGE_SOURCE=production-docker-image
                        
# 2026-04-10 11:47:24  1.65GB 执行命令并创建新的镜像层
RUN |8 TARGETPLATFORM=linux/arm64 INSTALL_KV_CONNECTORS=true CUDA_VERSION=13.0.1 PIP_INDEX_URL= UV_INDEX_URL= PIP_EXTRA_INDEX_URL= UV_EXTRA_INDEX_URL= torch_cuda_arch_list=8.7 8.9 9.0 10.0+PTX 12.0 12.1 /bin/sh -c CUDA_MAJOR="${CUDA_VERSION%%.*}";     CUDA_VERSION_DASH=$(echo $CUDA_VERSION | cut -d. -f1,2 | tr '.' '-');     CUDA_HOME=/usr/local/cuda;     BUILD_PKGS="libcusparse-dev-${CUDA_VERSION_DASH}                 libcublas-dev-${CUDA_VERSION_DASH}                 libcusolver-dev-${CUDA_VERSION_DASH}";     if [ "$INSTALL_KV_CONNECTORS" = "true" ]; then         if [ "$CUDA_MAJOR" -ge 13 ]; then             uv pip install --system nixl-cu13;         fi;         uv pip install --system -r /tmp/kv_connectors.txt --no-build || (             apt-get update -y &&             apt-get install -y --no-install-recommends ${BUILD_PKGS} &&             uv pip install --system -r /tmp/kv_connectors.txt --no-build-isolation &&             apt-get purge -y ${BUILD_PKGS} &&             rm -rf /var/lib/apt/lists/*         );     fi # buildkit
                        
# 2026-04-10 11:43:51  0.00B 设置环境变量 TORCH_CUDA_ARCH_LIST
ENV TORCH_CUDA_ARCH_LIST=8.7 8.9 9.0 10.0+PTX 12.0 12.1
                        
# 2026-04-10 11:43:51  0.00B 定义构建参数
ARG torch_cuda_arch_list=7.0 7.5 8.0 8.9 9.0 10.0 12.0
                        
# 2026-04-10 11:43:51  0.00B 设置环境变量 UV_HTTP_TIMEOUT
ENV UV_HTTP_TIMEOUT=500
                        
# 2026-04-10 11:43:51  0.00B 定义构建参数
ARG PIP_EXTRA_INDEX_URL UV_EXTRA_INDEX_URL
                        
# 2026-04-10 11:43:51  0.00B 定义构建参数
ARG PIP_INDEX_URL UV_INDEX_URL
                        
# 2026-04-10 11:43:51  0.00B 定义构建参数
ARG CUDA_VERSION
                        
# 2026-04-10 11:43:51  0.00B 定义构建参数
ARG INSTALL_KV_CONNECTORS=false
                        
# 2026-04-10 11:43:51  0.00B 定义构建参数
ARG TARGETPLATFORM
                        
# 2026-04-10 11:43:51  27.84KB 复制新文件或目录到容器中
COPY ./vllm/collect_env.py . # buildkit
                        
# 2026-04-10 11:43:51  958.90KB 复制新文件或目录到容器中
COPY benchmarks benchmarks # buildkit
                        
# 2026-04-10 11:43:51  1.21MB 复制新文件或目录到容器中
COPY examples examples # buildkit
                        
# 2026-04-10 11:43:50  0.00B 设置环境变量 LD_LIBRARY_PATH
ENV LD_LIBRARY_PATH=/usr/local/nvidia/lib64:/usr/local/cuda/lib64:/usr/local/nvidia/lib:/usr/local/nvidia/lib64:/usr/local/cuda/lib64
                        
# 2026-04-10 11:43:50  29.57MB 执行命令并创建新的镜像层
RUN |22 CUDA_VERSION=13.0.1 PYTHON_VERSION=3.12 DEADSNAKES_MIRROR_URL= DEADSNAKES_GPGKEY_URL= GET_PIP_URL=https://bootstrap.pypa.io/get-pip.py PYTORCH_CUDA_INDEX_BASE_URL=https://download.pytorch.org/whl FLASHINFER_VERSION=0.6.6 GDRCOPY_CUDA_VERSION=12.8 GDRCOPY_OS_VERSION=Ubuntu22_04 TARGETPLATFORM=linux/arm64 BITSANDBYTES_VERSION_X86=0.46.1 BITSANDBYTES_VERSION_ARM64=0.42.0 TIMM_VERSION=>=1.0.17 RUNAI_MODEL_STREAMER_VERSION=>=0.15.7 PIP_INDEX_URL= UV_INDEX_URL= PIP_EXTRA_INDEX_URL= UV_EXTRA_INDEX_URL= PYTORCH_CUDA_INDEX_BASE_URL=https://download.pytorch.org/whl PIP_KEYRING_PROVIDER=disabled UV_KEYRING_PROVIDER=disabled PYTORCH_NIGHTLY= /bin/sh -c uv pip install --system ep_kernels/dist/*.whl --verbose         --extra-index-url ${PYTORCH_CUDA_INDEX_BASE_URL}/cu$(echo $CUDA_VERSION | cut -d. -f1,2 | tr -d '.') # buildkit
                        
# 2026-04-10 11:43:50  0.00B 设置环境变量 LD_LIBRARY_PATH
ENV LD_LIBRARY_PATH=/usr/local/cuda/lib64:/usr/local/nvidia/lib:/usr/local/nvidia/lib64:/usr/local/cuda/lib64
                        
# 2026-04-10 11:43:50  49.14MB 执行命令并创建新的镜像层
RUN |22 CUDA_VERSION=13.0.1 PYTHON_VERSION=3.12 DEADSNAKES_MIRROR_URL= DEADSNAKES_GPGKEY_URL= GET_PIP_URL=https://bootstrap.pypa.io/get-pip.py PYTORCH_CUDA_INDEX_BASE_URL=https://download.pytorch.org/whl FLASHINFER_VERSION=0.6.6 GDRCOPY_CUDA_VERSION=12.8 GDRCOPY_OS_VERSION=Ubuntu22_04 TARGETPLATFORM=linux/arm64 BITSANDBYTES_VERSION_X86=0.46.1 BITSANDBYTES_VERSION_ARM64=0.42.0 TIMM_VERSION=>=1.0.17 RUNAI_MODEL_STREAMER_VERSION=>=0.15.7 PIP_INDEX_URL= UV_INDEX_URL= PIP_EXTRA_INDEX_URL= UV_EXTRA_INDEX_URL= PYTORCH_CUDA_INDEX_BASE_URL=https://download.pytorch.org/whl PIP_KEYRING_PROVIDER=disabled UV_KEYRING_PROVIDER=disabled PYTORCH_NIGHTLY= /bin/sh -c sh -c 'if ls /tmp/deepgemm/dist/*.whl >/dev/null 2>&1; then               uv pip install --system /tmp/deepgemm/dist/*.whl;            else               echo "No DeepGEMM wheels to install; skipping.";            fi' # buildkit
                        
# 2026-04-10 11:43:48  44.49MB 执行命令并创建新的镜像层
RUN |22 CUDA_VERSION=13.0.1 PYTHON_VERSION=3.12 DEADSNAKES_MIRROR_URL= DEADSNAKES_GPGKEY_URL= GET_PIP_URL=https://bootstrap.pypa.io/get-pip.py PYTORCH_CUDA_INDEX_BASE_URL=https://download.pytorch.org/whl FLASHINFER_VERSION=0.6.6 GDRCOPY_CUDA_VERSION=12.8 GDRCOPY_OS_VERSION=Ubuntu22_04 TARGETPLATFORM=linux/arm64 BITSANDBYTES_VERSION_X86=0.46.1 BITSANDBYTES_VERSION_ARM64=0.42.0 TIMM_VERSION=>=1.0.17 RUNAI_MODEL_STREAMER_VERSION=>=0.15.7 PIP_INDEX_URL= UV_INDEX_URL= PIP_EXTRA_INDEX_URL= UV_EXTRA_INDEX_URL= PYTORCH_CUDA_INDEX_BASE_URL=https://download.pytorch.org/whl PIP_KEYRING_PROVIDER=disabled UV_KEYRING_PROVIDER=disabled PYTORCH_NIGHTLY= /bin/sh -c uv pip install --system "transformers==5.5.0" # buildkit
                        
# 2026-04-10 11:43:47  0.00B 执行命令并创建新的镜像层
RUN |22 CUDA_VERSION=13.0.1 PYTHON_VERSION=3.12 DEADSNAKES_MIRROR_URL= DEADSNAKES_GPGKEY_URL= GET_PIP_URL=https://bootstrap.pypa.io/get-pip.py PYTORCH_CUDA_INDEX_BASE_URL=https://download.pytorch.org/whl FLASHINFER_VERSION=0.6.6 GDRCOPY_CUDA_VERSION=12.8 GDRCOPY_OS_VERSION=Ubuntu22_04 TARGETPLATFORM=linux/arm64 BITSANDBYTES_VERSION_X86=0.46.1 BITSANDBYTES_VERSION_ARM64=0.42.0 TIMM_VERSION=>=1.0.17 RUNAI_MODEL_STREAMER_VERSION=>=0.15.7 PIP_INDEX_URL= UV_INDEX_URL= PIP_EXTRA_INDEX_URL= UV_EXTRA_INDEX_URL= PYTORCH_CUDA_INDEX_BASE_URL=https://download.pytorch.org/whl PIP_KEYRING_PROVIDER=disabled UV_KEYRING_PROVIDER=disabled PYTORCH_NIGHTLY= /bin/sh -c . /etc/environment && uv pip list # buildkit
                        
# 2026-04-10 11:43:46  757.14MB 执行命令并创建新的镜像层
RUN |22 CUDA_VERSION=13.0.1 PYTHON_VERSION=3.12 DEADSNAKES_MIRROR_URL= DEADSNAKES_GPGKEY_URL= GET_PIP_URL=https://bootstrap.pypa.io/get-pip.py PYTORCH_CUDA_INDEX_BASE_URL=https://download.pytorch.org/whl FLASHINFER_VERSION=0.6.6 GDRCOPY_CUDA_VERSION=12.8 GDRCOPY_OS_VERSION=Ubuntu22_04 TARGETPLATFORM=linux/arm64 BITSANDBYTES_VERSION_X86=0.46.1 BITSANDBYTES_VERSION_ARM64=0.42.0 TIMM_VERSION=>=1.0.17 RUNAI_MODEL_STREAMER_VERSION=>=0.15.7 PIP_INDEX_URL= UV_INDEX_URL= PIP_EXTRA_INDEX_URL= UV_EXTRA_INDEX_URL= PYTORCH_CUDA_INDEX_BASE_URL=https://download.pytorch.org/whl PIP_KEYRING_PROVIDER=disabled UV_KEYRING_PROVIDER=disabled PYTORCH_NIGHTLY= /bin/sh -c if [ "${PYTORCH_NIGHTLY}" = "1" ]; then         echo "Installing torch nightly..."         && uv pip install --system $(cat torch_lib_versions.txt | xargs) --pre         --index-url ${PYTORCH_CUDA_INDEX_BASE_URL}/nightly/cu$(echo $CUDA_VERSION | cut -d. -f1,2 | tr -d '.')         && echo "Installing vLLM..."         && uv pip install --system dist/*.whl --verbose         --extra-index-url ${PYTORCH_CUDA_INDEX_BASE_URL}/nightly/cu$(echo $CUDA_VERSION | cut -d. -f1,2 | tr -d '.');     else         echo "Installing vLLM..."         && uv pip install --system dist/*.whl --verbose         --extra-index-url ${PYTORCH_CUDA_INDEX_BASE_URL}/cu$(echo $CUDA_VERSION | cut -d. -f1,2 | tr -d '.');     fi # buildkit
                        
# 2026-04-10 11:34:16  71.00B 复制新文件或目录到容器中
COPY /workspace/torch_lib_versions.txt torch_lib_versions.txt # buildkit
                        
# 2026-04-10 11:34:16  0.00B 定义构建参数
ARG PYTORCH_NIGHTLY
                        
# 2026-04-10 11:34:16  0.00B 定义构建参数
ARG PIP_KEYRING_PROVIDER UV_KEYRING_PROVIDER
                        
# 2026-04-10 11:34:16  0.00B 定义构建参数
ARG PYTORCH_CUDA_INDEX_BASE_URL
                        
# 2026-04-10 11:34:16  0.00B 定义构建参数
ARG PIP_EXTRA_INDEX_URL UV_EXTRA_INDEX_URL
                        
# 2026-04-10 11:34:16  0.00B 定义构建参数
ARG PIP_INDEX_URL UV_INDEX_URL
                        
# 2026-04-10 11:34:16  498.83MB 执行命令并创建新的镜像层
RUN |14 CUDA_VERSION=13.0.1 PYTHON_VERSION=3.12 DEADSNAKES_MIRROR_URL= DEADSNAKES_GPGKEY_URL= GET_PIP_URL=https://bootstrap.pypa.io/get-pip.py PYTORCH_CUDA_INDEX_BASE_URL=https://download.pytorch.org/whl FLASHINFER_VERSION=0.6.6 GDRCOPY_CUDA_VERSION=12.8 GDRCOPY_OS_VERSION=Ubuntu22_04 TARGETPLATFORM=linux/arm64 BITSANDBYTES_VERSION_X86=0.46.1 BITSANDBYTES_VERSION_ARM64=0.42.0 TIMM_VERSION=>=1.0.17 RUNAI_MODEL_STREAMER_VERSION=>=0.15.7 /bin/sh -c if [ "$TARGETPLATFORM" = "linux/arm64" ]; then         BITSANDBYTES_VERSION="${BITSANDBYTES_VERSION_ARM64}";     else         BITSANDBYTES_VERSION="${BITSANDBYTES_VERSION_X86}";     fi;     uv pip install --system accelerate hf_transfer modelscope         "bitsandbytes>=${BITSANDBYTES_VERSION}" "timm${TIMM_VERSION}" "runai-model-streamer[s3,gcs,azure]${RUNAI_MODEL_STREAMER_VERSION}" # buildkit
                        
# 2026-04-10 11:34:08  0.00B 定义构建参数
ARG RUNAI_MODEL_STREAMER_VERSION=>=0.15.7
                        
# 2026-04-10 11:34:08  0.00B 定义构建参数
ARG TIMM_VERSION=>=1.0.17
                        
# 2026-04-10 11:34:08  0.00B 定义构建参数
ARG BITSANDBYTES_VERSION_ARM64=0.42.0
                        
# 2026-04-10 11:34:08  0.00B 定义构建参数
ARG BITSANDBYTES_VERSION_X86=0.46.1
                        
# 2026-04-10 11:34:08  2.40MB 执行命令并创建新的镜像层
RUN |10 CUDA_VERSION=13.0.1 PYTHON_VERSION=3.12 DEADSNAKES_MIRROR_URL= DEADSNAKES_GPGKEY_URL= GET_PIP_URL=https://bootstrap.pypa.io/get-pip.py PYTORCH_CUDA_INDEX_BASE_URL=https://download.pytorch.org/whl FLASHINFER_VERSION=0.6.6 GDRCOPY_CUDA_VERSION=12.8 GDRCOPY_OS_VERSION=Ubuntu22_04 TARGETPLATFORM=linux/arm64 /bin/sh -c set -eux;     case "${TARGETPLATFORM}" in       linux/arm64) UUARCH="aarch64" ;;       linux/amd64) UUARCH="x64" ;;       *) echo "Unsupported TARGETPLATFORM: ${TARGETPLATFORM}" >&2; exit 1 ;;     esac;     /tmp/install_gdrcopy.sh "${GDRCOPY_OS_VERSION}" "${GDRCOPY_CUDA_VERSION}" "${UUARCH}" &&     rm /tmp/install_gdrcopy.sh # buildkit
                        
# 2026-04-10 11:33:59  1.44KB 复制新文件或目录到容器中
COPY tools/install_gdrcopy.sh /tmp/install_gdrcopy.sh # buildkit
                        
# 2026-04-10 11:33:59  0.00B 定义构建参数
ARG TARGETPLATFORM
                        
# 2026-04-10 11:33:59  0.00B 定义构建参数
ARG GDRCOPY_OS_VERSION=Ubuntu22_04
                        
# 2026-04-10 11:33:59  0.00B 定义构建参数
ARG GDRCOPY_CUDA_VERSION=12.8
                        
# 2026-04-10 11:33:59  319.35KB 执行命令并创建新的镜像层
RUN |7 CUDA_VERSION=13.0.1 PYTHON_VERSION=3.12 DEADSNAKES_MIRROR_URL= DEADSNAKES_GPGKEY_URL= GET_PIP_URL=https://bootstrap.pypa.io/get-pip.py PYTORCH_CUDA_INDEX_BASE_URL=https://download.pytorch.org/whl FLASHINFER_VERSION=0.6.6 /bin/sh -c python3 <<'PYEOF'
from flashinfer.jit import env as jit_env
from flashinfer.jit.cubin_loader import download_trtllm_headers, get_cubin
from flashinfer.artifacts import ArtifactPath, CheckSumHash

download_trtllm_headers(
    'bmm',
    jit_env.FLASHINFER_CUBIN_DIR / 'flashinfer' / 'trtllm' / 'batched_gemm' / 'trtllmGen_bmm_export',
    f'{ArtifactPath.TRTLLM_GEN_BMM}/include/trtllmGen_bmm_export',
    ArtifactPath.TRTLLM_GEN_BMM,
    get_cubin(f'{ArtifactPath.TRTLLM_GEN_BMM}/checksums.txt', CheckSumHash.TRTLLM_GEN_BMM),
)

print('FlashInfer TRTLLM BMM headers downloaded successfully')
PYEOF # buildkit
                        
# 2026-04-10 11:33:55  7.42GB 执行命令并创建新的镜像层
RUN |7 CUDA_VERSION=13.0.1 PYTHON_VERSION=3.12 DEADSNAKES_MIRROR_URL= DEADSNAKES_GPGKEY_URL= GET_PIP_URL=https://bootstrap.pypa.io/get-pip.py PYTORCH_CUDA_INDEX_BASE_URL=https://download.pytorch.org/whl FLASHINFER_VERSION=0.6.6 /bin/sh -c uv pip install --system flashinfer-jit-cache==${FLASHINFER_VERSION}         --extra-index-url https://flashinfer.ai/whl/cu$(echo $CUDA_VERSION | cut -d. -f1,2 | tr -d '.')     && flashinfer show-config # buildkit
                        
# 2026-04-10 11:31:13  0.00B 定义构建参数
ARG FLASHINFER_VERSION=0.6.6
                        
# 2026-04-10 11:31:13  6.76GB 执行命令并创建新的镜像层
RUN |6 CUDA_VERSION=13.0.1 PYTHON_VERSION=3.12 DEADSNAKES_MIRROR_URL= DEADSNAKES_GPGKEY_URL= GET_PIP_URL=https://bootstrap.pypa.io/get-pip.py PYTORCH_CUDA_INDEX_BASE_URL=https://download.pytorch.org/whl /bin/sh -c uv pip install --system -r /tmp/requirements-cuda.txt         --extra-index-url ${PYTORCH_CUDA_INDEX_BASE_URL}/cu$(echo $CUDA_VERSION | cut -d. -f1,2 | tr -d '.') &&     rm /tmp/requirements-cuda.txt /tmp/common.txt # buildkit
                        
# 2026-04-10 11:30:09  702.00B 复制新文件或目录到容器中
COPY requirements/cuda.txt /tmp/requirements-cuda.txt # buildkit
                        
# 2026-04-10 11:30:03  2.92KB 复制新文件或目录到容器中
COPY requirements/common.txt /tmp/common.txt # buildkit
                        
# 2026-04-10 11:30:01  0.00B 定义构建参数
ARG PYTORCH_CUDA_INDEX_BASE_URL
                        
# 2026-04-10 11:30:01  0.00B 设置环境变量 VLLM_ENABLE_CUDA_COMPATIBILITY
ENV VLLM_ENABLE_CUDA_COMPATIBILITY=0
                        
# 2026-04-10 11:30:01  0.00B 设置环境变量 UV_LINK_MODE
ENV UV_LINK_MODE=copy
                        
# 2026-04-10 11:30:01  0.00B 设置环境变量 UV_INDEX_STRATEGY
ENV UV_INDEX_STRATEGY=unsafe-best-match
                        
# 2026-04-10 11:30:01  0.00B 设置环境变量 UV_HTTP_TIMEOUT
ENV UV_HTTP_TIMEOUT=500
                        
# 2026-04-10 11:30:01  76.78MB 执行命令并创建新的镜像层
RUN |5 CUDA_VERSION=13.0.1 PYTHON_VERSION=3.12 DEADSNAKES_MIRROR_URL= DEADSNAKES_GPGKEY_URL= GET_PIP_URL=https://bootstrap.pypa.io/get-pip.py /bin/sh -c python3 -m pip install uv # buildkit
                        
# 2026-04-10 11:29:58  2.19GB 执行命令并创建新的镜像层
RUN |5 CUDA_VERSION=13.0.1 PYTHON_VERSION=3.12 DEADSNAKES_MIRROR_URL= DEADSNAKES_GPGKEY_URL= GET_PIP_URL=https://bootstrap.pypa.io/get-pip.py /bin/sh -c CUDA_VERSION_DASH=$(echo $CUDA_VERSION | cut -d. -f1,2 | tr '.' '-') &&     apt-get update -y &&     apt-get install -y --no-install-recommends         cuda-nvcc-${CUDA_VERSION_DASH}         cuda-cudart-${CUDA_VERSION_DASH}         cuda-nvrtc-${CUDA_VERSION_DASH}         cuda-cuobjdump-${CUDA_VERSION_DASH}         libcurand-dev-${CUDA_VERSION_DASH}         libcublas-${CUDA_VERSION_DASH}         libnccl-dev &&     rm -rf /var/lib/apt/lists/* # buildkit
                        
# 2026-04-10 11:28:50  567.85MB 执行命令并创建新的镜像层
RUN |5 CUDA_VERSION=13.0.1 PYTHON_VERSION=3.12 DEADSNAKES_MIRROR_URL= DEADSNAKES_GPGKEY_URL= GET_PIP_URL=https://bootstrap.pypa.io/get-pip.py /bin/sh -c apt-get update -y     && apt-get install -y --no-install-recommends         software-properties-common         curl         sudo         ffmpeg         libsm6         libxext6         libgl1     && if [ ! -z ${DEADSNAKES_MIRROR_URL} ] ; then         if [ ! -z "${DEADSNAKES_GPGKEY_URL}" ] ; then             mkdir -p -m 0755 /etc/apt/keyrings ;             curl -L ${DEADSNAKES_GPGKEY_URL} | gpg --dearmor > /etc/apt/keyrings/deadsnakes.gpg ;             sudo chmod 644 /etc/apt/keyrings/deadsnakes.gpg ;             echo "deb [signed-by=/etc/apt/keyrings/deadsnakes.gpg] ${DEADSNAKES_MIRROR_URL} $(lsb_release -cs) main" > /etc/apt/sources.list.d/deadsnakes.list ;         fi ;     else         for i in 1 2 3; do             add-apt-repository -y ppa:deadsnakes/ppa && break ||             { echo "Attempt $i failed, retrying in 5s..."; sleep 5; };         done ;     fi     && apt-get update -y     && apt-get install -y --no-install-recommends         python${PYTHON_VERSION}         python${PYTHON_VERSION}-dev         python${PYTHON_VERSION}-venv         libibverbs-dev     && rm -rf /var/lib/apt/lists/*     && update-alternatives --install /usr/bin/python3 python3 /usr/bin/python${PYTHON_VERSION} 1     && update-alternatives --set python3 /usr/bin/python${PYTHON_VERSION}     && ln -sf /usr/bin/python${PYTHON_VERSION}-config /usr/bin/python3-config     && rm -f /usr/lib/python${PYTHON_VERSION}/EXTERNALLY-MANAGED     && curl -sS ${GET_PIP_URL} | python${PYTHON_VERSION}     && python3 --version && python3 -m pip --version # buildkit
                        
# 2026-04-10 11:26:05  136.00B 执行命令并创建新的镜像层
RUN |5 CUDA_VERSION=13.0.1 PYTHON_VERSION=3.12 DEADSNAKES_MIRROR_URL= DEADSNAKES_GPGKEY_URL= GET_PIP_URL=https://bootstrap.pypa.io/get-pip.py /bin/sh -c PYTHON_VERSION_STR=$(echo ${PYTHON_VERSION} | sed 's/\.//g') &&     echo "export PYTHON_VERSION_STR=${PYTHON_VERSION_STR}" >> /etc/environment # buildkit
                        
# 2026-04-10 11:25:27  0.00B 设置工作目录为/vllm-workspace
WORKDIR /vllm-workspace
                        
# 2026-04-10 11:25:27  0.00B 设置环境变量 DEBIAN_FRONTEND
ENV DEBIAN_FRONTEND=noninteractive
                        
# 2026-04-10 11:25:27  0.00B 定义构建参数
ARG GET_PIP_URL
                        
# 2026-04-10 11:25:27  0.00B 定义构建参数
ARG DEADSNAKES_GPGKEY_URL
                        
# 2026-04-10 11:25:27  0.00B 定义构建参数
ARG DEADSNAKES_MIRROR_URL
                        
# 2026-04-10 11:25:27  0.00B 定义构建参数
ARG PYTHON_VERSION
                        
# 2026-04-10 11:25:27  0.00B 定义构建参数
ARG CUDA_VERSION
                        
# 2025-09-09 01:23:07  0.00B 设置环境变量 NVIDIA_DRIVER_CAPABILITIES
ENV NVIDIA_DRIVER_CAPABILITIES=compute,utility
                        
# 2025-09-09 01:23:07  0.00B 设置环境变量 NVIDIA_VISIBLE_DEVICES
ENV NVIDIA_VISIBLE_DEVICES=all
                        
# 2025-09-09 01:23:07  17.29KB 复制新文件或目录到容器中
COPY NGC-DL-CONTAINER-LICENSE / # buildkit
                        
# 2025-09-09 01:23:07  0.00B 设置环境变量 LD_LIBRARY_PATH
ENV LD_LIBRARY_PATH=/usr/local/nvidia/lib:/usr/local/nvidia/lib64:/usr/local/cuda/lib64
                        
# 2025-09-09 01:23:07  0.00B 设置环境变量 PATH
ENV PATH=/usr/local/nvidia/bin:/usr/local/cuda/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin
                        
# 2025-09-09 01:23:07  22.00B 执行命令并创建新的镜像层
RUN |1 TARGETARCH=arm64 /bin/sh -c echo "/usr/local/cuda/lib64" >> /etc/ld.so.conf.d/nvidia.conf # buildkit
                        
# 2025-09-09 01:23:07  303.50MB 执行命令并创建新的镜像层
RUN |1 TARGETARCH=arm64 /bin/sh -c apt-get update && apt-get install -y --no-install-recommends     cuda-cudart-13-0=${NV_CUDA_CUDART_VERSION}     cuda-compat-13-0     && rm -rf /var/lib/apt/lists/* # buildkit
                        
# 2025-09-09 01:23:07  0.00B 设置环境变量 CUDA_VERSION
ENV CUDA_VERSION=13.0.1
                        
# 2025-09-09 01:23:07  10.21MB 执行命令并创建新的镜像层
RUN |1 TARGETARCH=arm64 /bin/sh -c apt-get update && apt-get install -y --no-install-recommends     gnupg2 curl ca-certificates &&     curl -fsSLO https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/${NVARCH}/cuda-keyring_1.1-1_all.deb &&     dpkg -i cuda-keyring_1.1-1_all.deb &&     apt-get purge --autoremove -y curl     && rm -rf /var/lib/apt/lists/* # buildkit
                        
# 2025-09-09 01:23:07  0.00B 添加元数据标签
LABEL maintainer=NVIDIA CORPORATION <cudatools@nvidia.com>
                        
# 2025-09-09 01:23:07  0.00B 定义构建参数
ARG TARGETARCH
                        
# 2025-09-09 01:23:07  0.00B 设置环境变量 NV_CUDA_CUDART_VERSION
ENV NV_CUDA_CUDART_VERSION=13.0.88-1
                        
# 2025-09-09 01:23:07  0.00B 设置环境变量 NVIDIA_REQUIRE_CUDA brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand
ENV NVIDIA_REQUIRE_CUDA=cuda>=13.0 brand=unknown,driver>=535,driver<536 brand=grid,driver>=535,driver<536 brand=tesla,driver>=535,driver<536 brand=nvidia,driver>=535,driver<536 brand=quadro,driver>=535,driver<536 brand=quadrortx,driver>=535,driver<536 brand=nvidiartx,driver>=535,driver<536 brand=vapps,driver>=535,driver<536 brand=vpc,driver>=535,driver<536 brand=vcs,driver>=535,driver<536 brand=vws,driver>=535,driver<536 brand=cloudgaming,driver>=535,driver<536 brand=unknown,driver>=550,driver<551 brand=grid,driver>=550,driver<551 brand=tesla,driver>=550,driver<551 brand=nvidia,driver>=550,driver<551 brand=quadro,driver>=550,driver<551 brand=quadrortx,driver>=550,driver<551 brand=nvidiartx,driver>=550,driver<551 brand=vapps,driver>=550,driver<551 brand=vpc,driver>=550,driver<551 brand=vcs,driver>=550,driver<551 brand=vws,driver>=550,driver<551 brand=cloudgaming,driver>=550,driver<551 brand=unknown,driver>=565,driver<566 brand=grid,driver>=565,driver<566 brand=tesla,driver>=565,driver<566 brand=nvidia,driver>=565,driver<566 brand=quadro,driver>=565,driver<566 brand=quadrortx,driver>=565,driver<566 brand=nvidiartx,driver>=565,driver<566 brand=vapps,driver>=565,driver<566 brand=vpc,driver>=565,driver<566 brand=vcs,driver>=565,driver<566 brand=vws,driver>=565,driver<566 brand=cloudgaming,driver>=565,driver<566 brand=unknown,driver>=570,driver<571 brand=grid,driver>=570,driver<571 brand=tesla,driver>=570,driver<571 brand=nvidia,driver>=570,driver<571 brand=quadro,driver>=570,driver<571 brand=quadrortx,driver>=570,driver<571 brand=nvidiartx,driver>=570,driver<571 brand=vapps,driver>=570,driver<571 brand=vpc,driver>=570,driver<571 brand=vcs,driver>=570,driver<571 brand=vws,driver>=570,driver<571 brand=cloudgaming,driver>=570,driver<571 brand=unknown,driver>=575,driver<576 brand=grid,driver>=575,driver<576 brand=tesla,driver>=575,driver<576 brand=nvidia,driver>=575,driver<576 brand=quadro,driver>=575,driver<576 brand=quadrortx,driver>=575,driver<576 brand=nvidiartx,driver>=575,driver<576 brand=vapps,driver>=575,driver<576 brand=vpc,driver>=575,driver<576 brand=vcs,driver>=575,driver<576 brand=vws,driver>=575,driver<576 brand=cloudgaming,driver>=575,driver<576
                        
# 2025-09-09 01:23:07  0.00B 设置环境变量 NVARCH
ENV NVARCH=sbsa
                        
# 2025-08-20 01:21:19  0.00B 
/bin/sh -c #(nop)  CMD ["/bin/bash"]
                        
# 2025-08-20 01:21:19  69.25MB 
/bin/sh -c #(nop) ADD file:5f2c65daac761cc691b34ee3e3e2ba42ec520d71fc59aef131d38058a7891ab8 in / 
                        
# 2025-08-20 01:21:17  0.00B 
/bin/sh -c #(nop)  LABEL org.opencontainers.image.version=22.04
                        
# 2025-08-20 01:21:17  0.00B 
/bin/sh -c #(nop)  LABEL org.opencontainers.image.ref.name=ubuntu
                        
# 2025-08-20 01:21:17  0.00B 
/bin/sh -c #(nop)  ARG LAUNCHPAD_BUILD_ARCH
                        
# 2025-08-20 01:21:17  0.00B 
/bin/sh -c #(nop)  ARG RELEASE
                        
                    

镜像信息

{
    "Id": "sha256:db59febc6c476c9c93701cc9f54917533cee774d4e4bb27eff6fafeae37509f8",
    "RepoTags": [
        "vllm/vllm-openai:gemma4-cu130",
        "swr.cn-north-4.myhuaweicloud.com/ddn-k8s/docker.io/vllm/vllm-openai:gemma4-cu130-linuxarm64"
    ],
    "RepoDigests": [
        "vllm/vllm-openai@sha256:0d152595cd940ea1e0890fa190a37b7a86e3f7f0f5048ac3e2e4c14529fea833",
        "swr.cn-north-4.myhuaweicloud.com/ddn-k8s/docker.io/vllm/vllm-openai@sha256:30ad1a12cda59a59c0e5a893eb07d1fc947b1460873653e632e2ad22b2f1b1d2"
    ],
    "Parent": "",
    "Comment": "buildkit.dockerfile.v0",
    "Created": "2026-04-10T03:47:24.766139086Z",
    "Container": "",
    "ContainerConfig": null,
    "DockerVersion": "",
    "Author": "",
    "Config": {
        "Hostname": "",
        "Domainname": "",
        "User": "",
        "AttachStdin": false,
        "AttachStdout": false,
        "AttachStderr": false,
        "Tty": false,
        "OpenStdin": false,
        "StdinOnce": false,
        "Env": [
            "PATH=/usr/local/nvidia/bin:/usr/local/cuda/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin",
            "NVARCH=sbsa",
            "NVIDIA_REQUIRE_CUDA=cuda\u003e=13.0 brand=unknown,driver\u003e=535,driver\u003c536 brand=grid,driver\u003e=535,driver\u003c536 brand=tesla,driver\u003e=535,driver\u003c536 brand=nvidia,driver\u003e=535,driver\u003c536 brand=quadro,driver\u003e=535,driver\u003c536 brand=quadrortx,driver\u003e=535,driver\u003c536 brand=nvidiartx,driver\u003e=535,driver\u003c536 brand=vapps,driver\u003e=535,driver\u003c536 brand=vpc,driver\u003e=535,driver\u003c536 brand=vcs,driver\u003e=535,driver\u003c536 brand=vws,driver\u003e=535,driver\u003c536 brand=cloudgaming,driver\u003e=535,driver\u003c536 brand=unknown,driver\u003e=550,driver\u003c551 brand=grid,driver\u003e=550,driver\u003c551 brand=tesla,driver\u003e=550,driver\u003c551 brand=nvidia,driver\u003e=550,driver\u003c551 brand=quadro,driver\u003e=550,driver\u003c551 brand=quadrortx,driver\u003e=550,driver\u003c551 brand=nvidiartx,driver\u003e=550,driver\u003c551 brand=vapps,driver\u003e=550,driver\u003c551 brand=vpc,driver\u003e=550,driver\u003c551 brand=vcs,driver\u003e=550,driver\u003c551 brand=vws,driver\u003e=550,driver\u003c551 brand=cloudgaming,driver\u003e=550,driver\u003c551 brand=unknown,driver\u003e=565,driver\u003c566 brand=grid,driver\u003e=565,driver\u003c566 brand=tesla,driver\u003e=565,driver\u003c566 brand=nvidia,driver\u003e=565,driver\u003c566 brand=quadro,driver\u003e=565,driver\u003c566 brand=quadrortx,driver\u003e=565,driver\u003c566 brand=nvidiartx,driver\u003e=565,driver\u003c566 brand=vapps,driver\u003e=565,driver\u003c566 brand=vpc,driver\u003e=565,driver\u003c566 brand=vcs,driver\u003e=565,driver\u003c566 brand=vws,driver\u003e=565,driver\u003c566 brand=cloudgaming,driver\u003e=565,driver\u003c566 brand=unknown,driver\u003e=570,driver\u003c571 brand=grid,driver\u003e=570,driver\u003c571 brand=tesla,driver\u003e=570,driver\u003c571 brand=nvidia,driver\u003e=570,driver\u003c571 brand=quadro,driver\u003e=570,driver\u003c571 brand=quadrortx,driver\u003e=570,driver\u003c571 brand=nvidiartx,driver\u003e=570,driver\u003c571 brand=vapps,driver\u003e=570,driver\u003c571 brand=vpc,driver\u003e=570,driver\u003c571 brand=vcs,driver\u003e=570,driver\u003c571 brand=vws,driver\u003e=570,driver\u003c571 brand=cloudgaming,driver\u003e=570,driver\u003c571 brand=unknown,driver\u003e=575,driver\u003c576 brand=grid,driver\u003e=575,driver\u003c576 brand=tesla,driver\u003e=575,driver\u003c576 brand=nvidia,driver\u003e=575,driver\u003c576 brand=quadro,driver\u003e=575,driver\u003c576 brand=quadrortx,driver\u003e=575,driver\u003c576 brand=nvidiartx,driver\u003e=575,driver\u003c576 brand=vapps,driver\u003e=575,driver\u003c576 brand=vpc,driver\u003e=575,driver\u003c576 brand=vcs,driver\u003e=575,driver\u003c576 brand=vws,driver\u003e=575,driver\u003c576 brand=cloudgaming,driver\u003e=575,driver\u003c576",
            "NV_CUDA_CUDART_VERSION=13.0.88-1",
            "CUDA_VERSION=13.0.1",
            "LD_LIBRARY_PATH=/usr/local/nvidia/lib64:/usr/local/cuda/lib64:/usr/local/nvidia/lib:/usr/local/nvidia/lib64:/usr/local/cuda/lib64",
            "NVIDIA_VISIBLE_DEVICES=all",
            "NVIDIA_DRIVER_CAPABILITIES=compute,utility",
            "DEBIAN_FRONTEND=noninteractive",
            "UV_HTTP_TIMEOUT=500",
            "UV_INDEX_STRATEGY=unsafe-best-match",
            "UV_LINK_MODE=copy",
            "VLLM_ENABLE_CUDA_COMPATIBILITY=0",
            "TORCH_CUDA_ARCH_LIST=8.7 8.9 9.0 10.0+PTX 12.0 12.1",
            "VLLM_USAGE_SOURCE=production-docker-image"
        ],
        "Cmd": null,
        "Image": "",
        "Volumes": null,
        "WorkingDir": "/vllm-workspace",
        "Entrypoint": [
            "vllm",
            "serve"
        ],
        "OnBuild": null,
        "Labels": {
            "maintainer": "NVIDIA CORPORATION \u003ccudatools@nvidia.com\u003e",
            "org.opencontainers.image.ref.name": "ubuntu",
            "org.opencontainers.image.version": "22.04"
        }
    },
    "Architecture": "arm64",
    "Os": "linux",
    "Size": 20430040572,
    "GraphDriver": {
        "Data": {
            "LowerDir": "/var/lib/docker/overlay2/652edc0aca9944f8dc5f6f028a598ef74cf4ff8e49ff1c1c019ab4497e38c175/diff:/var/lib/docker/overlay2/51955565e33b2ccc7a018afc278350284edb6c6275867fe10456cb271f1ca15b/diff:/var/lib/docker/overlay2/7bdf47c874f6400ffb6613dca7c517612a57db886854ffc02901ed41cddcfa82/diff:/var/lib/docker/overlay2/291b4e2925b52bd6bb188c68fbaf5cdafae2af1e8cccf6f63f248df6dcaa854a/diff:/var/lib/docker/overlay2/dd667104cd452fe547df5679db255592bf253f87844e8959c38d405689989cf3/diff:/var/lib/docker/overlay2/d4df166197faf84ea7ffd8eeca4dea6cc602759c78127e9cd852566b8bbe6a23/diff:/var/lib/docker/overlay2/f49e15acfb45113253f99a16e25071ec2dbb18028fc9739125b919d2ab86c080/diff:/var/lib/docker/overlay2/432efb7fbafe1485bd2c4e5e9c13724ce66ce7341565113f86d11c3a90dbaa48/diff:/var/lib/docker/overlay2/003ede6b1857c7e770a1a18113a42eb88ad5e106bc34df568425c79ff7e9e6dc/diff:/var/lib/docker/overlay2/3706d6a915f765615997386e07afa1cca5f77fc4c195df1a949ed887e8c90a7c/diff:/var/lib/docker/overlay2/0dfcd21772df5ac3d8ec9fd81ec138ab9b80fce376c74e50f1a60f381fdfddfa/diff:/var/lib/docker/overlay2/ef90e6fd273e7604a8bc532b59598a4d4a11b3d3f6a8d39f3cc92134e8e8b9b8/diff:/var/lib/docker/overlay2/bc78ef74d86a600be66c3254f02646e86d7d2f8985217c29172a038e5c5c9ef6/diff:/var/lib/docker/overlay2/d955f7e146b83bcbd49c0e30da2fe3ed8d422d6cf5f0dbad48e43d9e75161348/diff:/var/lib/docker/overlay2/1e391da1edbf92233d27a93b8fce5fb108549e803825e1f26f371b3b085cd255/diff:/var/lib/docker/overlay2/1a20f1ed1bab19223d8e361547ef0ea7f1ea28e91848770beff72764e3aa86c9/diff:/var/lib/docker/overlay2/bac83f429445f1ee564a3be11cfc206da18ad8dd7b7baddcffb1fb2e1ab49abc/diff:/var/lib/docker/overlay2/bc46fd69c807162d710fb16d9965637c965b19359089f01e1b25784fabb55e4e/diff:/var/lib/docker/overlay2/8203dd111a8ffbe3b59c4bd1ad0db32698b29be49f98e7f972516673c2783e14/diff:/var/lib/docker/overlay2/ef218d57b71f50b77895fb0a6ad5b7917147520afa92dedca017f55bc7b7fafc/diff:/var/lib/docker/overlay2/b0743e123bd436553b9368cf32bee2cbbf114d069c498be46eafbd5a02c76c85/diff:/var/lib/docker/overlay2/5651ebce10e3d18149b07dc94be354ea9a9eca08cc7b13fccffff57237c186a2/diff:/var/lib/docker/overlay2/dcc6f01f832085b32052b9b40a7221251e8ce7503229b0b11c0725c07929a130/diff:/var/lib/docker/overlay2/a679cca47049455766757a93abb76da8a827e3531f8a012a484ca6d89ff035c0/diff:/var/lib/docker/overlay2/c4db7faea88d1c0405b7b22a43d4dcfef0e68bbf3c5edf4180eeaf7c5e633646/diff:/var/lib/docker/overlay2/91d0e0a4c54be7c2f607291c5dbf7db6144e91ff8486ac1c9cdcb18239d6feaa/diff:/var/lib/docker/overlay2/9248fa85459ddd0c3ddb534b49599952264a5f8e8940d2bc60bc1a573b9aa95e/diff",
            "MergedDir": "/var/lib/docker/overlay2/b454c38e57105b864d67985dfe8f375d1b4b110e457a99628a950eec221202d0/merged",
            "UpperDir": "/var/lib/docker/overlay2/b454c38e57105b864d67985dfe8f375d1b4b110e457a99628a950eec221202d0/diff",
            "WorkDir": "/var/lib/docker/overlay2/b454c38e57105b864d67985dfe8f375d1b4b110e457a99628a950eec221202d0/work"
        },
        "Name": "overlay2"
    },
    "RootFS": {
        "Type": "layers",
        "Layers": [
            "sha256:3e04ae02e3b48a52cd15f8595efd67e8ab7b3e9766f7d1e12a25c8c76f3a86bf",
            "sha256:71adbbf158ee0f6f368160ea2ed102435749a2720b8eca69dfdb2d1f47a55680",
            "sha256:31dbd0fa0bdc5599759cd67617f924de243a225e1d95776b9e6a5c404f399e43",
            "sha256:4762a460f72d9950e1b8a8b6775e3e631bc418f9f915501879f29f7fec97fcd8",
            "sha256:495bbaf4cc5ca53d8286fe305d2fc2fd3ecb2c8791cfdd4473e984d2eb0f3ea1",
            "sha256:953bce126be7eea59d9fd09118c3333ee615120dbc6c26a4dedcf5681c240a8c",
            "sha256:05dcbef656274f6878607be8e1e582570fe2fa9a6e58ebc406928468bb49aa10",
            "sha256:50403177b54b9acd0cd8893bfb6f5c41769d379018d496bca4e42aa7cc914fab",
            "sha256:bd99863fb9c8531c9f79c2322901aef398e105954aec55beec6b86b656b27c93",
            "sha256:3f3d5c500269acbe191a5be22b324930da1b9ff6556ecfaa5628e19a00a7c986",
            "sha256:facef9f5f9ac1ddd8c6afac64716288371bffabe1d6dd5a3be378420e904abf2",
            "sha256:d19634f1b36694a7d8a9451ce02be815a441d829f5f17393fc11c644c240b626",
            "sha256:7d56f101af9f3652e208beb6f7c807c80f95279a50af8043c1ccb5c57aa1cea7",
            "sha256:efa46cc4596cb003132815ca9a80a264c23f6aa11477561e5720ebb428e786dc",
            "sha256:2afd5fdfba1d8ec8beccedc7c94ae2c0de04ec819222cf03f762c05e1003af33",
            "sha256:3c5e4043fc725c6819d3d5bf7efcfa550844ea52d200a84bea886b50e94f04ba",
            "sha256:0344e55f146f6ca9426303e96f2b8ce4f9c4c93c6f62220455a3bc19b106d2e0",
            "sha256:799d134a873411deefe28c5a392d586992dfbb2bd6c897499047c59adcbe1461",
            "sha256:5925483b4478a94796f803bab70de0a9394e966ea0d38bf0b03fed90cd1e8bef",
            "sha256:b9c2b5defefca985029da8384d1ef6c150dce19c93eaa570149b9e9b3ce97f69",
            "sha256:b171f156a0bf5db7eafd2d92a54040e84d06c1a9ecaa40642cd64392776da16e",
            "sha256:b61a41fb36858e8e72b6c3aa68c7f3ea1e97b5a950b202c3c75f438dd5f3ef05",
            "sha256:83e15d01ed70646098322a159b9f36f4957c7a32a55e6c56f589463140f661d4",
            "sha256:36c948868bbaf85719f83613f74e6726273fd2f3d47b0243dd8414ecb5f6d2ab",
            "sha256:8287e25a6df19359436c325732f61860cc6e74d0296fe7c34d435c7726e6ca14",
            "sha256:2c8e0b57a1a2fe599afa02f671c46e87882192031afefc1ad41b97e3a6952088",
            "sha256:43a916ec3815d8f420060223e1c77b1effdff7c696cdd4773de1cfaefe3315c6",
            "sha256:783eb7e3be80ff9b1499a90b34a8c2dedc7cfa80d567e114c4ac5ffec905e42c"
        ]
    },
    "Metadata": {
        "LastTagTime": "2026-04-13T01:26:23.907145156+08:00"
    }
}

更多版本

docker.io/vllm/vllm-openai:v0.5.4

linux/amd64 docker.io9.90GB2024-09-07 06:20
3237

docker.io/vllm/vllm-openai:v0.6.0

linux/amd64 docker.io9.72GB2024-09-11 01:51
1829

docker.io/vllm/vllm-openai:v0.6.1.post2

linux/amd64 docker.io9.81GB2024-09-24 01:43
1388

docker.io/vllm/vllm-openai:latest

linux/amd64 docker.io10.24GB2024-10-11 00:43
8908

docker.io/vllm/vllm-openai:v0.6.4.post1

linux/amd64 docker.io10.64GB2024-11-19 00:42
1473

docker.io/vllm/vllm-openai:v0.6.4

linux/amd64 docker.io10.64GB2024-12-11 02:08
1151

docker.io/vllm/vllm-openai:v0.6.3

linux/amd64 docker.io10.43GB2024-12-12 02:41
1453

docker.io/vllm/vllm-openai:v0.6.6

linux/amd64 docker.io10.23GB2025-01-04 00:37
1807

docker.io/vllm/vllm-openai:v0.6.6.post1

linux/amd64 docker.io10.23GB2025-01-24 00:21
1549

docker.io/vllm/vllm-openai:v0.7.1

linux/amd64 docker.io16.53GB2025-02-08 02:05
1406

docker.io/vllm/vllm-openai:v0.7.2

linux/amd64 docker.io16.53GB2025-02-09 00:28
3229

docker.io/vllm/vllm-openai:v0.7.3

linux/amd64 docker.io16.43GB2025-02-24 00:50
4019

docker.io/vllm/vllm-openai:v0.8.0

linux/amd64 docker.io16.62GB2025-03-20 00:23
1663

docker.io/vllm/vllm-openai:v0.8.1

linux/amd64 docker.io16.62GB2025-03-21 00:28
1379

docker.io/vllm/vllm-openai:v0.8.2

linux/amd64 docker.io16.92GB2025-03-27 01:12
1600

docker.io/vllm/vllm-openai:v0.8.3

linux/amd64 docker.io17.13GB2025-04-08 00:58
1636

docker.io/vllm/vllm-openai:v0.8.4

linux/amd64 docker.io17.16GB2025-04-17 01:16
2128

docker.io/vllm/vllm-openai:v0.8.5

linux/amd64 docker.io17.30GB2025-04-30 02:45
3723

docker.io/vllm/vllm-openai:v0.8.5.post1

linux/amd64 docker.io17.30GB2025-05-07 02:06
3857

docker.io/vllm/vllm-openai:v0.9.0.1

linux/amd64 docker.io20.81GB2025-06-05 01:12
2606

docker.io/vllm/vllm-openai:v0.9.1

linux/amd64 docker.io20.85GB2025-06-12 01:29
3244

docker.io/vllm/vllm-openai:v0.9.2

linux/amd64 docker.io20.76GB2025-07-09 03:00
7904

docker.io/vllm/vllm-openai:v0.10.0

linux/amd64 docker.io26.13GB2025-07-26 03:15
2179

docker.io/vllm/vllm-openai:gptoss

linux/amd64 docker.io33.86GB2025-08-07 01:52
1568

docker.io/vllm/vllm-openai:v0.10.1

linux/amd64 docker.io20.25GB2025-08-20 03:05
1504

docker.io/vllm/vllm-openai:v0.10.1.1

linux/amd64 docker.io20.26GB2025-08-23 01:43
2532

docker.io/vllm/vllm-openai:v0.10.2

linux/amd64 docker.io22.49GB2025-09-16 03:40
1910

docker.io/vllm/vllm-openai:v0.2.7

linux/amd64 docker.io6.34GB2025-10-01 01:07
560

docker.io/vllm/vllm-openai:v0.11.0-x86_64

linux/amd64 docker.io25.86GB2025-10-09 02:14
2622

docker.io/vllm/vllm-openai:v0.10.2-x86_64

linux/amd64 docker.io22.49GB2025-10-09 02:22
691

docker.io/vllm/vllm-openai:v0.11.0

linux/amd64 docker.io25.86GB2025-10-09 11:24
2685

docker.io/vllm/vllm-openai:v0.11.0

linux/arm64 docker.io24.17GB2025-10-30 00:47
1157

docker.io/vllm/vllm-openai:v0.3.3

linux/amd64 docker.io9.13GB2025-11-18 01:01
515

docker.io/vllm/vllm-openai:v0.11.1

linux/amd64 docker.io28.72GB2025-11-21 01:03
908

docker.io/vllm/vllm-openai:v0.11.2

linux/amd64 docker.io28.82GB2025-11-22 00:46
1967

docker.io/vllm/vllm-openai:v0.11.1

linux/arm64 docker.io26.54GB2025-11-22 01:23
523

docker.io/vllm/vllm-openai:v0.4.0

linux/amd64 docker.io9.88GB2025-11-22 01:58
520

docker.io/vllm/vllm-openai:v0.11.2

linux/arm64 docker.io26.54GB2025-11-22 04:06
724

docker.io/vllm/vllm-openai:nightly

linux/amd64 docker.io18.74GB2025-12-03 02:43
2263

docker.io/vllm/vllm-openai:v0.12.0-aarch64

linux/arm64 docker.io17.89GB2025-12-05 03:12
899

docker.io/vllm/vllm-openai:v0.12.0

linux/amd64 docker.io19.47GB2025-12-05 03:59
2804

docker.io/vllm/vllm-openai:v0.13.0

linux/amd64 docker.io19.51GB2026-01-22 01:41
1472

docker.io/vllm/vllm-openai:v0.14.0

linux/amd64 docker.io19.66GB2026-01-22 03:16
757

docker.io/vllm/vllm-openai:v0.14.1

linux/amd64 docker.io19.69GB2026-01-27 01:52
915

docker.io/vllm/vllm-openai:v0.15.0

linux/amd64 docker.io20.13GB2026-01-31 00:51
1146
632

docker.io/vllm/vllm-openai:v0.15.1

linux/amd64 docker.io20.14GB2026-02-06 01:14
1022

docker.io/vllm/vllm-openai:v0.15.1-cu130

linux/amd64 docker.io18.77GB2026-02-07 00:39
868

docker.io/vllm/vllm-openai:latest

linux/arm64 docker.io20.65GB2026-02-08 00:59
624

docker.io/vllm/vllm-openai:v0.15.1-aarch64-cu130

linux/arm64 docker.io19.60GB2026-02-10 00:44
700

docker.io/vllm/vllm-openai:glm5

linux/amd64 docker.io20.27GB2026-02-14 00:56
610

docker.io/vllm/vllm-openai:qwen3_5

linux/amd64 docker.io20.93GB2026-02-25 01:18
1607

docker.io/vllm/vllm-openai:qwen3_5-x86_64-cu129

linux/amd64 docker.io20.93GB2026-02-27 00:33
671
459

docker.io/vllm/vllm-openai:v0.16.0

linux/amd64 docker.io20.37GB2026-02-28 01:04
860

docker.io/vllm/vllm-openai:v0.16.0-cu130

linux/amd64 docker.io19.01GB2026-02-28 02:55
577
438

docker.io/vllm/vllm-openai:v0.5.1

linux/amd64 docker.io10.40GB2026-03-06 01:01
256

docker.io/vllm/vllm-openai:v0.4.0.post1

linux/amd64 docker.io9.88GB2026-03-06 01:29
322

docker.io/vllm/vllm-openai:cu130-nightly

linux/amd64 docker.io19.55GB2026-03-06 02:04
759
393

docker.io/vllm/vllm-openai:v0.17.0

linux/amd64 docker.io20.75GB2026-03-08 02:27
1709

docker.io/vllm/vllm-openai:v0.17.0-cu130

linux/amd64 docker.io19.55GB2026-03-10 01:39
445

docker.io/vllm/vllm-openai:v0.17.0

linux/arm64 docker.io21.50GB2026-03-11 01:42
488

docker.io/vllm/vllm-openai:v0.4.3

linux/amd64 docker.io7.86GB2026-03-11 02:04
281

docker.io/vllm/vllm-openai:v0.13.0

linux/arm64 docker.io17.98GB2026-03-11 03:52
212

docker.io/vllm/vllm-openai:v0.17.1-cu130

linux/amd64 docker.io19.55GB2026-03-13 02:22
807

docker.io/vllm/vllm-openai:v0.17.1

linux/amd64 docker.io20.75GB2026-03-14 02:34
849

docker.io/vllm/vllm-openai:v0.14.0

linux/arm64 docker.io20.19GB2026-03-17 02:20
259

docker.io/vllm/vllm-openai:v0.18.0

linux/amd64 docker.io22.40GB2026-03-23 01:43
928

docker.io/vllm/vllm-openai:v0.18.0-cu130

linux/amd64 docker.io19.65GB2026-03-24 01:29
696

docker.io/vllm/vllm-openai-cpu:latest-x86_64

linux/amd64 docker.io3.48GB2026-03-28 10:31
284

docker.io/vllm/vllm-openai-cpu:latest-arm64

linux/arm64 docker.io2.23GB2026-03-28 10:34
204

docker.io/vllm/vllm-openai-rocm:latest

linux/amd64 docker.io24.49GB2026-03-30 00:46
339

docker.io/vllm/vllm-openai:latest-cu130

linux/amd64 docker.io19.65GB2026-03-31 00:47
817

docker.io/vllm/vllm-openai:v0.18.1-cu130

linux/amd64 docker.io19.66GB2026-04-02 00:39
280

docker.io/vllm/vllm-openai:v0.18.1

linux/amd64 docker.io22.41GB2026-04-02 01:12
442

docker.io/vllm/vllm-openai:v0.19.0-ubuntu2404

linux/amd64 docker.io22.35GB2026-04-04 02:18
462

docker.io/vllm/vllm-openai:gemma4-cu130

linux/amd64 docker.io21.18GB2026-04-04 03:13
1391

docker.io/vllm/vllm-openai:v0.19.0

linux/amd64 docker.io22.41GB2026-04-04 03:47
989

docker.io/vllm/vllm-openai-cpu:v0.19.0

linux/amd64 docker.io3.49GB2026-04-06 07:48
395

docker.io/vllm/vllm-openai-cpu:v0.19.0

linux/arm64 docker.io2.25GB2026-04-06 07:51
238

docker.io/vllm/vllm-openai:v0.19.0-cu130-ubuntu2404

linux/amd64 docker.io19.74GB2026-04-08 03:21
473

docker.io/vllm/vllm-openai:gemma4

linux/amd64 docker.io23.92GB2026-04-09 00:39
616
336

docker.io/vllm/vllm-openai:qwen3_5-cu130

linux/amd64 docker.io19.60GB2026-04-11 00:51
393

docker.io/vllm/vllm-openai:v0.19.0-cu130

linux/amd64 docker.io19.66GB2026-04-12 00:29
332

docker.io/vllm/vllm-openai:gemma4-0409-x86_64-cu130

linux/amd64 docker.io19.72GB2026-04-12 01:06
395

docker.io/vllm/vllm-openai:gemma4-cu130

linux/arm64 docker.io20.43GB2026-04-13 01:49
453

docker.io/vllm/vllm-openai:v0.19.0-cu130

linux/arm64 docker.io20.37GB2026-04-13 02:18
333

docker.io/vllm/vllm-openai:gemma4

linux/arm64 docker.io23.20GB2026-04-13 03:21
302

docker.io/vllm/vllm-openai:cu130-nightly-aarch64

linux/arm64 docker.io21.48GB2026-04-14 00:35
330

docker.io/vllm/vllm-openai:minimax27

linux/amd64 docker.io24.06GB2026-04-14 01:09
357

docker.io/vllm/vllm-openai:glm51

linux/amd64 docker.io22.46GB2026-04-16 03:22
383

docker.io/vllm/vllm-openai:glm51

linux/arm64 docker.io23.19GB2026-04-17 01:16
156

docker.io/vllm/vllm-openai:glm51-cu130

linux/amd64 docker.io19.71GB2026-04-17 01:41
346

docker.io/vllm/vllm-openai-cpu:v0.19.0-x86_64

linux/amd64 docker.io3.49GB2026-04-17 23:45
241
148

docker.io/vllm/vllm-openai:v0.19.1-cu130

linux/amd64 docker.io19.60GB2026-04-20 00:28
979