docker.io/vllm/vllm-openai:gemma4-0409-x86_64-cu130 linux/amd64

docker.io/vllm/vllm-openai:gemma4-0409-x86_64-cu130 - 国内下载镜像源 浏览次数:376
这是镜像描述:

vllm/openai

基于 OpenAI 的 GPT-3 模型的 API 服务,支持自然语言处理等功能。

源镜像 docker.io/vllm/vllm-openai:gemma4-0409-x86_64-cu130
国内镜像 swr.cn-north-4.myhuaweicloud.com/ddn-k8s/docker.io/vllm/vllm-openai:gemma4-0409-x86_64-cu130
镜像ID sha256:bda20f266f801e6b7079b2ec074bf172b6d10280b9df109e21b51e0bbc12dbfd
镜像TAG gemma4-0409-x86_64-cu130
大小 19.72GB
镜像源 docker.io
项目信息 Docker-Hub主页 🚀项目TAG 🚀
CMD
启动入口 vllm serve
工作目录 /vllm-workspace
OS/平台 linux/amd64
浏览量 376 次
贡献者
镜像创建 2026-04-10T05:19:15.821299241Z
同步时间 2026-04-12 01:06
环境变量
PATH=/usr/local/nvidia/bin:/usr/local/cuda/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin NVARCH=x86_64 NVIDIA_REQUIRE_CUDA=cuda>=13.0 brand=unknown,driver>=535,driver<536 brand=grid,driver>=535,driver<536 brand=tesla,driver>=535,driver<536 brand=nvidia,driver>=535,driver<536 brand=quadro,driver>=535,driver<536 brand=quadrortx,driver>=535,driver<536 brand=nvidiartx,driver>=535,driver<536 brand=vapps,driver>=535,driver<536 brand=vpc,driver>=535,driver<536 brand=vcs,driver>=535,driver<536 brand=vws,driver>=535,driver<536 brand=cloudgaming,driver>=535,driver<536 brand=unknown,driver>=550,driver<551 brand=grid,driver>=550,driver<551 brand=tesla,driver>=550,driver<551 brand=nvidia,driver>=550,driver<551 brand=quadro,driver>=550,driver<551 brand=quadrortx,driver>=550,driver<551 brand=nvidiartx,driver>=550,driver<551 brand=vapps,driver>=550,driver<551 brand=vpc,driver>=550,driver<551 brand=vcs,driver>=550,driver<551 brand=vws,driver>=550,driver<551 brand=cloudgaming,driver>=550,driver<551 brand=unknown,driver>=565,driver<566 brand=grid,driver>=565,driver<566 brand=tesla,driver>=565,driver<566 brand=nvidia,driver>=565,driver<566 brand=quadro,driver>=565,driver<566 brand=quadrortx,driver>=565,driver<566 brand=nvidiartx,driver>=565,driver<566 brand=vapps,driver>=565,driver<566 brand=vpc,driver>=565,driver<566 brand=vcs,driver>=565,driver<566 brand=vws,driver>=565,driver<566 brand=cloudgaming,driver>=565,driver<566 brand=unknown,driver>=570,driver<571 brand=grid,driver>=570,driver<571 brand=tesla,driver>=570,driver<571 brand=nvidia,driver>=570,driver<571 brand=quadro,driver>=570,driver<571 brand=quadrortx,driver>=570,driver<571 brand=nvidiartx,driver>=570,driver<571 brand=vapps,driver>=570,driver<571 brand=vpc,driver>=570,driver<571 brand=vcs,driver>=570,driver<571 brand=vws,driver>=570,driver<571 brand=cloudgaming,driver>=570,driver<571 brand=unknown,driver>=575,driver<576 brand=grid,driver>=575,driver<576 brand=tesla,driver>=575,driver<576 brand=nvidia,driver>=575,driver<576 brand=quadro,driver>=575,driver<576 brand=quadrortx,driver>=575,driver<576 brand=nvidiartx,driver>=575,driver<576 brand=vapps,driver>=575,driver<576 brand=vpc,driver>=575,driver<576 brand=vcs,driver>=575,driver<576 brand=vws,driver>=575,driver<576 brand=cloudgaming,driver>=575,driver<576 NV_CUDA_CUDART_VERSION=13.0.88-1 CUDA_VERSION=13.0.1 LD_LIBRARY_PATH=/usr/local/nvidia/lib64:/usr/local/cuda/lib64:/usr/local/nvidia/lib:/usr/local/nvidia/lib64:/usr/local/cuda/lib64 NVIDIA_VISIBLE_DEVICES=all NVIDIA_DRIVER_CAPABILITIES=compute,utility DEBIAN_FRONTEND=noninteractive UV_HTTP_TIMEOUT=500 UV_INDEX_STRATEGY=unsafe-best-match UV_LINK_MODE=copy VLLM_ENABLE_CUDA_COMPATIBILITY=0 TORCH_CUDA_ARCH_LIST=7.0 7.5 8.0 8.9 9.0 10.0 12.0 VLLM_USAGE_SOURCE=production-docker-image
镜像标签
NVIDIA CORPORATION <cudatools@nvidia.com>: maintainer ubuntu: org.opencontainers.image.ref.name 22.04: org.opencontainers.image.version

Docker拉取命令

docker pull swr.cn-north-4.myhuaweicloud.com/ddn-k8s/docker.io/vllm/vllm-openai:gemma4-0409-x86_64-cu130
docker tag  swr.cn-north-4.myhuaweicloud.com/ddn-k8s/docker.io/vllm/vllm-openai:gemma4-0409-x86_64-cu130  docker.io/vllm/vllm-openai:gemma4-0409-x86_64-cu130

Containerd拉取命令

ctr images pull swr.cn-north-4.myhuaweicloud.com/ddn-k8s/docker.io/vllm/vllm-openai:gemma4-0409-x86_64-cu130
ctr images tag  swr.cn-north-4.myhuaweicloud.com/ddn-k8s/docker.io/vllm/vllm-openai:gemma4-0409-x86_64-cu130  docker.io/vllm/vllm-openai:gemma4-0409-x86_64-cu130

Shell快速替换命令

sed -i 's#vllm/vllm-openai:gemma4-0409-x86_64-cu130#swr.cn-north-4.myhuaweicloud.com/ddn-k8s/docker.io/vllm/vllm-openai:gemma4-0409-x86_64-cu130#' deployment.yaml

Ansible快速分发-Docker

#ansible k8s -m shell -a 'docker pull swr.cn-north-4.myhuaweicloud.com/ddn-k8s/docker.io/vllm/vllm-openai:gemma4-0409-x86_64-cu130 && docker tag  swr.cn-north-4.myhuaweicloud.com/ddn-k8s/docker.io/vllm/vllm-openai:gemma4-0409-x86_64-cu130  docker.io/vllm/vllm-openai:gemma4-0409-x86_64-cu130'

Ansible快速分发-Containerd

#ansible k8s -m shell -a 'ctr images pull swr.cn-north-4.myhuaweicloud.com/ddn-k8s/docker.io/vllm/vllm-openai:gemma4-0409-x86_64-cu130 && ctr images tag  swr.cn-north-4.myhuaweicloud.com/ddn-k8s/docker.io/vllm/vllm-openai:gemma4-0409-x86_64-cu130  docker.io/vllm/vllm-openai:gemma4-0409-x86_64-cu130'

镜像构建历史


# 2026-04-10 13:19:15  0.00B 配置容器启动时运行的命令
ENTRYPOINT ["vllm" "serve"]
                        
# 2026-04-10 13:19:15  0.00B 设置环境变量 VLLM_USAGE_SOURCE
ENV VLLM_USAGE_SOURCE=production-docker-image
                        
# 2026-04-10 13:19:15  915.38MB 执行命令并创建新的镜像层
RUN |8 TARGETPLATFORM=linux/amd64 INSTALL_KV_CONNECTORS=true CUDA_VERSION=13.0.1 PIP_INDEX_URL= UV_INDEX_URL= PIP_EXTRA_INDEX_URL= UV_EXTRA_INDEX_URL= torch_cuda_arch_list=7.0 7.5 8.0 8.9 9.0 10.0 12.0 /bin/sh -c CUDA_MAJOR="${CUDA_VERSION%%.*}";     CUDA_VERSION_DASH=$(echo $CUDA_VERSION | cut -d. -f1,2 | tr '.' '-');     CUDA_HOME=/usr/local/cuda;     BUILD_PKGS="libcusparse-dev-${CUDA_VERSION_DASH}                 libcublas-dev-${CUDA_VERSION_DASH}                 libcusolver-dev-${CUDA_VERSION_DASH}";     if [ "$INSTALL_KV_CONNECTORS" = "true" ]; then         if [ "$CUDA_MAJOR" -ge 13 ]; then             uv pip install --system nixl-cu13;         fi;         uv pip install --system -r /tmp/kv_connectors.txt --no-build || (             apt-get update -y &&             apt-get install -y --no-install-recommends ${BUILD_PKGS} &&             uv pip install --system -r /tmp/kv_connectors.txt --no-build-isolation &&             apt-get purge -y ${BUILD_PKGS} &&             rm -rf /var/lib/apt/lists/*         );     fi # buildkit
                        
# 2026-04-10 13:18:57  0.00B 设置环境变量 TORCH_CUDA_ARCH_LIST
ENV TORCH_CUDA_ARCH_LIST=7.0 7.5 8.0 8.9 9.0 10.0 12.0
                        
# 2026-04-10 13:18:57  0.00B 定义构建参数
ARG torch_cuda_arch_list=7.0 7.5 8.0 8.9 9.0 10.0 12.0
                        
# 2026-04-10 13:18:57  0.00B 设置环境变量 UV_HTTP_TIMEOUT
ENV UV_HTTP_TIMEOUT=500
                        
# 2026-04-10 13:18:57  0.00B 定义构建参数
ARG PIP_EXTRA_INDEX_URL UV_EXTRA_INDEX_URL
                        
# 2026-04-10 13:18:57  0.00B 定义构建参数
ARG PIP_INDEX_URL UV_INDEX_URL
                        
# 2026-04-10 13:18:57  0.00B 定义构建参数
ARG CUDA_VERSION
                        
# 2026-04-10 13:18:57  0.00B 定义构建参数
ARG INSTALL_KV_CONNECTORS=false
                        
# 2026-04-10 13:18:57  0.00B 定义构建参数
ARG TARGETPLATFORM
                        
# 2026-04-10 13:18:57  27.84KB 复制新文件或目录到容器中
COPY ./vllm/collect_env.py . # buildkit
                        
# 2026-04-10 13:18:57  958.90KB 复制新文件或目录到容器中
COPY benchmarks benchmarks # buildkit
                        
# 2026-04-10 13:18:57  1.21MB 复制新文件或目录到容器中
COPY examples examples # buildkit
                        
# 2026-04-10 13:18:57  0.00B 设置环境变量 LD_LIBRARY_PATH
ENV LD_LIBRARY_PATH=/usr/local/nvidia/lib64:/usr/local/cuda/lib64:/usr/local/nvidia/lib:/usr/local/nvidia/lib64:/usr/local/cuda/lib64
                        
# 2026-04-10 13:18:57  30.39MB 执行命令并创建新的镜像层
RUN |22 CUDA_VERSION=13.0.1 PYTHON_VERSION=3.12 DEADSNAKES_MIRROR_URL= DEADSNAKES_GPGKEY_URL= GET_PIP_URL=https://bootstrap.pypa.io/get-pip.py PYTORCH_CUDA_INDEX_BASE_URL=https://download.pytorch.org/whl FLASHINFER_VERSION=0.6.6 GDRCOPY_CUDA_VERSION=12.8 GDRCOPY_OS_VERSION=Ubuntu22_04 TARGETPLATFORM=linux/amd64 BITSANDBYTES_VERSION_X86=0.46.1 BITSANDBYTES_VERSION_ARM64=0.42.0 TIMM_VERSION=>=1.0.17 RUNAI_MODEL_STREAMER_VERSION=>=0.15.7 PIP_INDEX_URL= UV_INDEX_URL= PIP_EXTRA_INDEX_URL= UV_EXTRA_INDEX_URL= PYTORCH_CUDA_INDEX_BASE_URL=https://download.pytorch.org/whl PIP_KEYRING_PROVIDER=disabled UV_KEYRING_PROVIDER=disabled PYTORCH_NIGHTLY= /bin/sh -c uv pip install --system ep_kernels/dist/*.whl --verbose         --extra-index-url ${PYTORCH_CUDA_INDEX_BASE_URL}/cu$(echo $CUDA_VERSION | cut -d. -f1,2 | tr -d '.') # buildkit
                        
# 2026-04-10 13:18:56  0.00B 设置环境变量 LD_LIBRARY_PATH
ENV LD_LIBRARY_PATH=/usr/local/cuda/lib64:/usr/local/nvidia/lib:/usr/local/nvidia/lib64:/usr/local/cuda/lib64
                        
# 2026-04-10 13:18:56  50.01MB 执行命令并创建新的镜像层
RUN |22 CUDA_VERSION=13.0.1 PYTHON_VERSION=3.12 DEADSNAKES_MIRROR_URL= DEADSNAKES_GPGKEY_URL= GET_PIP_URL=https://bootstrap.pypa.io/get-pip.py PYTORCH_CUDA_INDEX_BASE_URL=https://download.pytorch.org/whl FLASHINFER_VERSION=0.6.6 GDRCOPY_CUDA_VERSION=12.8 GDRCOPY_OS_VERSION=Ubuntu22_04 TARGETPLATFORM=linux/amd64 BITSANDBYTES_VERSION_X86=0.46.1 BITSANDBYTES_VERSION_ARM64=0.42.0 TIMM_VERSION=>=1.0.17 RUNAI_MODEL_STREAMER_VERSION=>=0.15.7 PIP_INDEX_URL= UV_INDEX_URL= PIP_EXTRA_INDEX_URL= UV_EXTRA_INDEX_URL= PYTORCH_CUDA_INDEX_BASE_URL=https://download.pytorch.org/whl PIP_KEYRING_PROVIDER=disabled UV_KEYRING_PROVIDER=disabled PYTORCH_NIGHTLY= /bin/sh -c sh -c 'if ls /tmp/deepgemm/dist/*.whl >/dev/null 2>&1; then               uv pip install --system /tmp/deepgemm/dist/*.whl;            else               echo "No DeepGEMM wheels to install; skipping.";            fi' # buildkit
                        
# 2026-04-10 13:18:55  44.49MB 执行命令并创建新的镜像层
RUN |22 CUDA_VERSION=13.0.1 PYTHON_VERSION=3.12 DEADSNAKES_MIRROR_URL= DEADSNAKES_GPGKEY_URL= GET_PIP_URL=https://bootstrap.pypa.io/get-pip.py PYTORCH_CUDA_INDEX_BASE_URL=https://download.pytorch.org/whl FLASHINFER_VERSION=0.6.6 GDRCOPY_CUDA_VERSION=12.8 GDRCOPY_OS_VERSION=Ubuntu22_04 TARGETPLATFORM=linux/amd64 BITSANDBYTES_VERSION_X86=0.46.1 BITSANDBYTES_VERSION_ARM64=0.42.0 TIMM_VERSION=>=1.0.17 RUNAI_MODEL_STREAMER_VERSION=>=0.15.7 PIP_INDEX_URL= UV_INDEX_URL= PIP_EXTRA_INDEX_URL= UV_EXTRA_INDEX_URL= PYTORCH_CUDA_INDEX_BASE_URL=https://download.pytorch.org/whl PIP_KEYRING_PROVIDER=disabled UV_KEYRING_PROVIDER=disabled PYTORCH_NIGHTLY= /bin/sh -c uv pip install --system "transformers==5.5.0" # buildkit
                        
# 2026-04-10 13:18:53  0.00B 执行命令并创建新的镜像层
RUN |22 CUDA_VERSION=13.0.1 PYTHON_VERSION=3.12 DEADSNAKES_MIRROR_URL= DEADSNAKES_GPGKEY_URL= GET_PIP_URL=https://bootstrap.pypa.io/get-pip.py PYTORCH_CUDA_INDEX_BASE_URL=https://download.pytorch.org/whl FLASHINFER_VERSION=0.6.6 GDRCOPY_CUDA_VERSION=12.8 GDRCOPY_OS_VERSION=Ubuntu22_04 TARGETPLATFORM=linux/amd64 BITSANDBYTES_VERSION_X86=0.46.1 BITSANDBYTES_VERSION_ARM64=0.42.0 TIMM_VERSION=>=1.0.17 RUNAI_MODEL_STREAMER_VERSION=>=0.15.7 PIP_INDEX_URL= UV_INDEX_URL= PIP_EXTRA_INDEX_URL= UV_EXTRA_INDEX_URL= PYTORCH_CUDA_INDEX_BASE_URL=https://download.pytorch.org/whl PIP_KEYRING_PROVIDER=disabled UV_KEYRING_PROVIDER=disabled PYTORCH_NIGHTLY= /bin/sh -c . /etc/environment && uv pip list # buildkit
                        
# 2026-04-10 13:18:52  502.55MB 执行命令并创建新的镜像层
RUN |22 CUDA_VERSION=13.0.1 PYTHON_VERSION=3.12 DEADSNAKES_MIRROR_URL= DEADSNAKES_GPGKEY_URL= GET_PIP_URL=https://bootstrap.pypa.io/get-pip.py PYTORCH_CUDA_INDEX_BASE_URL=https://download.pytorch.org/whl FLASHINFER_VERSION=0.6.6 GDRCOPY_CUDA_VERSION=12.8 GDRCOPY_OS_VERSION=Ubuntu22_04 TARGETPLATFORM=linux/amd64 BITSANDBYTES_VERSION_X86=0.46.1 BITSANDBYTES_VERSION_ARM64=0.42.0 TIMM_VERSION=>=1.0.17 RUNAI_MODEL_STREAMER_VERSION=>=0.15.7 PIP_INDEX_URL= UV_INDEX_URL= PIP_EXTRA_INDEX_URL= UV_EXTRA_INDEX_URL= PYTORCH_CUDA_INDEX_BASE_URL=https://download.pytorch.org/whl PIP_KEYRING_PROVIDER=disabled UV_KEYRING_PROVIDER=disabled PYTORCH_NIGHTLY= /bin/sh -c if [ "${PYTORCH_NIGHTLY}" = "1" ]; then         echo "Installing torch nightly..."         && uv pip install --system $(cat torch_lib_versions.txt | xargs) --pre         --index-url ${PYTORCH_CUDA_INDEX_BASE_URL}/nightly/cu$(echo $CUDA_VERSION | cut -d. -f1,2 | tr -d '.')         && echo "Installing vLLM..."         && uv pip install --system dist/*.whl --verbose         --extra-index-url ${PYTORCH_CUDA_INDEX_BASE_URL}/nightly/cu$(echo $CUDA_VERSION | cut -d. -f1,2 | tr -d '.');     else         echo "Installing vLLM..."         && uv pip install --system dist/*.whl --verbose         --extra-index-url ${PYTORCH_CUDA_INDEX_BASE_URL}/cu$(echo $CUDA_VERSION | cut -d. -f1,2 | tr -d '.');     fi # buildkit
                        
# 2026-04-10 13:10:56  71.00B 复制新文件或目录到容器中
COPY /workspace/torch_lib_versions.txt torch_lib_versions.txt # buildkit
                        
# 2026-04-10 13:10:56  0.00B 定义构建参数
ARG PYTORCH_NIGHTLY
                        
# 2026-04-10 13:10:56  0.00B 定义构建参数
ARG PIP_KEYRING_PROVIDER UV_KEYRING_PROVIDER
                        
# 2026-04-10 13:10:56  0.00B 定义构建参数
ARG PYTORCH_CUDA_INDEX_BASE_URL
                        
# 2026-04-10 13:10:56  0.00B 定义构建参数
ARG PIP_EXTRA_INDEX_URL UV_EXTRA_INDEX_URL
                        
# 2026-04-10 13:10:56  0.00B 定义构建参数
ARG PIP_INDEX_URL UV_INDEX_URL
                        
# 2026-04-10 13:10:56  315.62MB 执行命令并创建新的镜像层
RUN |14 CUDA_VERSION=13.0.1 PYTHON_VERSION=3.12 DEADSNAKES_MIRROR_URL= DEADSNAKES_GPGKEY_URL= GET_PIP_URL=https://bootstrap.pypa.io/get-pip.py PYTORCH_CUDA_INDEX_BASE_URL=https://download.pytorch.org/whl FLASHINFER_VERSION=0.6.6 GDRCOPY_CUDA_VERSION=12.8 GDRCOPY_OS_VERSION=Ubuntu22_04 TARGETPLATFORM=linux/amd64 BITSANDBYTES_VERSION_X86=0.46.1 BITSANDBYTES_VERSION_ARM64=0.42.0 TIMM_VERSION=>=1.0.17 RUNAI_MODEL_STREAMER_VERSION=>=0.15.7 /bin/sh -c if [ "$TARGETPLATFORM" = "linux/arm64" ]; then         BITSANDBYTES_VERSION="${BITSANDBYTES_VERSION_ARM64}";     else         BITSANDBYTES_VERSION="${BITSANDBYTES_VERSION_X86}";     fi;     uv pip install --system accelerate hf_transfer modelscope         "bitsandbytes>=${BITSANDBYTES_VERSION}" "timm${TIMM_VERSION}" "runai-model-streamer[s3,gcs,azure]${RUNAI_MODEL_STREAMER_VERSION}" # buildkit
                        
# 2026-04-10 13:10:47  0.00B 定义构建参数
ARG RUNAI_MODEL_STREAMER_VERSION=>=0.15.7
                        
# 2026-04-10 13:10:47  0.00B 定义构建参数
ARG TIMM_VERSION=>=1.0.17
                        
# 2026-04-10 13:10:47  0.00B 定义构建参数
ARG BITSANDBYTES_VERSION_ARM64=0.42.0
                        
# 2026-04-10 13:10:47  0.00B 定义构建参数
ARG BITSANDBYTES_VERSION_X86=0.46.1
                        
# 2026-04-10 13:10:47  2.42MB 执行命令并创建新的镜像层
RUN |10 CUDA_VERSION=13.0.1 PYTHON_VERSION=3.12 DEADSNAKES_MIRROR_URL= DEADSNAKES_GPGKEY_URL= GET_PIP_URL=https://bootstrap.pypa.io/get-pip.py PYTORCH_CUDA_INDEX_BASE_URL=https://download.pytorch.org/whl FLASHINFER_VERSION=0.6.6 GDRCOPY_CUDA_VERSION=12.8 GDRCOPY_OS_VERSION=Ubuntu22_04 TARGETPLATFORM=linux/amd64 /bin/sh -c set -eux;     case "${TARGETPLATFORM}" in       linux/arm64) UUARCH="aarch64" ;;       linux/amd64) UUARCH="x64" ;;       *) echo "Unsupported TARGETPLATFORM: ${TARGETPLATFORM}" >&2; exit 1 ;;     esac;     /tmp/install_gdrcopy.sh "${GDRCOPY_OS_VERSION}" "${GDRCOPY_CUDA_VERSION}" "${UUARCH}" &&     rm /tmp/install_gdrcopy.sh # buildkit
                        
# 2026-04-10 13:10:40  1.44KB 复制新文件或目录到容器中
COPY tools/install_gdrcopy.sh /tmp/install_gdrcopy.sh # buildkit
                        
# 2026-04-10 13:10:40  0.00B 定义构建参数
ARG TARGETPLATFORM
                        
# 2026-04-10 13:10:40  0.00B 定义构建参数
ARG GDRCOPY_OS_VERSION=Ubuntu22_04
                        
# 2026-04-10 13:10:40  0.00B 定义构建参数
ARG GDRCOPY_CUDA_VERSION=12.8
                        
# 2026-04-10 13:10:40  319.35KB 执行命令并创建新的镜像层
RUN |7 CUDA_VERSION=13.0.1 PYTHON_VERSION=3.12 DEADSNAKES_MIRROR_URL= DEADSNAKES_GPGKEY_URL= GET_PIP_URL=https://bootstrap.pypa.io/get-pip.py PYTORCH_CUDA_INDEX_BASE_URL=https://download.pytorch.org/whl FLASHINFER_VERSION=0.6.6 /bin/sh -c python3 <<'PYEOF'
from flashinfer.jit import env as jit_env
from flashinfer.jit.cubin_loader import download_trtllm_headers, get_cubin
from flashinfer.artifacts import ArtifactPath, CheckSumHash

download_trtllm_headers(
    'bmm',
    jit_env.FLASHINFER_CUBIN_DIR / 'flashinfer' / 'trtllm' / 'batched_gemm' / 'trtllmGen_bmm_export',
    f'{ArtifactPath.TRTLLM_GEN_BMM}/include/trtllmGen_bmm_export',
    ArtifactPath.TRTLLM_GEN_BMM,
    get_cubin(f'{ArtifactPath.TRTLLM_GEN_BMM}/checksums.txt', CheckSumHash.TRTLLM_GEN_BMM),
)

print('FlashInfer TRTLLM BMM headers downloaded successfully')
PYEOF # buildkit
                        
# 2026-04-10 13:10:36  7.41GB 执行命令并创建新的镜像层
RUN |7 CUDA_VERSION=13.0.1 PYTHON_VERSION=3.12 DEADSNAKES_MIRROR_URL= DEADSNAKES_GPGKEY_URL= GET_PIP_URL=https://bootstrap.pypa.io/get-pip.py PYTORCH_CUDA_INDEX_BASE_URL=https://download.pytorch.org/whl FLASHINFER_VERSION=0.6.6 /bin/sh -c uv pip install --system flashinfer-jit-cache==${FLASHINFER_VERSION}         --extra-index-url https://flashinfer.ai/whl/cu$(echo $CUDA_VERSION | cut -d. -f1,2 | tr -d '.')     && flashinfer show-config # buildkit
                        
# 2026-04-10 13:08:12  0.00B 定义构建参数
ARG FLASHINFER_VERSION=0.6.6
                        
# 2026-04-10 13:08:12  7.07GB 执行命令并创建新的镜像层
RUN |6 CUDA_VERSION=13.0.1 PYTHON_VERSION=3.12 DEADSNAKES_MIRROR_URL= DEADSNAKES_GPGKEY_URL= GET_PIP_URL=https://bootstrap.pypa.io/get-pip.py PYTORCH_CUDA_INDEX_BASE_URL=https://download.pytorch.org/whl /bin/sh -c uv pip install --system -r /tmp/requirements-cuda.txt         --extra-index-url ${PYTORCH_CUDA_INDEX_BASE_URL}/cu$(echo $CUDA_VERSION | cut -d. -f1,2 | tr -d '.') &&     rm /tmp/requirements-cuda.txt /tmp/common.txt # buildkit
                        
# 2026-04-10 13:07:08  702.00B 复制新文件或目录到容器中
COPY requirements/cuda.txt /tmp/requirements-cuda.txt # buildkit
                        
# 2026-04-10 13:07:08  2.92KB 复制新文件或目录到容器中
COPY requirements/common.txt /tmp/common.txt # buildkit
                        
# 2026-04-10 13:07:05  0.00B 定义构建参数
ARG PYTORCH_CUDA_INDEX_BASE_URL
                        
# 2026-04-10 13:07:05  0.00B 设置环境变量 VLLM_ENABLE_CUDA_COMPATIBILITY
ENV VLLM_ENABLE_CUDA_COMPATIBILITY=0
                        
# 2026-04-10 13:07:05  0.00B 设置环境变量 UV_LINK_MODE
ENV UV_LINK_MODE=copy
                        
# 2026-04-10 13:07:05  0.00B 设置环境变量 UV_INDEX_STRATEGY
ENV UV_INDEX_STRATEGY=unsafe-best-match
                        
# 2026-04-10 13:07:05  0.00B 设置环境变量 UV_HTTP_TIMEOUT
ENV UV_HTTP_TIMEOUT=500
                        
# 2026-04-10 13:07:05  86.84MB 执行命令并创建新的镜像层
RUN |5 CUDA_VERSION=13.0.1 PYTHON_VERSION=3.12 DEADSNAKES_MIRROR_URL= DEADSNAKES_GPGKEY_URL= GET_PIP_URL=https://bootstrap.pypa.io/get-pip.py /bin/sh -c python3 -m pip install uv # buildkit
                        
# 2026-04-10 13:06:59  2.23GB 执行命令并创建新的镜像层
RUN |5 CUDA_VERSION=13.0.1 PYTHON_VERSION=3.12 DEADSNAKES_MIRROR_URL= DEADSNAKES_GPGKEY_URL= GET_PIP_URL=https://bootstrap.pypa.io/get-pip.py /bin/sh -c CUDA_VERSION_DASH=$(echo $CUDA_VERSION | cut -d. -f1,2 | tr '.' '-') &&     apt-get update -y &&     apt-get install -y --no-install-recommends         cuda-nvcc-${CUDA_VERSION_DASH}         cuda-cudart-${CUDA_VERSION_DASH}         cuda-nvrtc-${CUDA_VERSION_DASH}         cuda-cuobjdump-${CUDA_VERSION_DASH}         libcurand-dev-${CUDA_VERSION_DASH}         libcublas-${CUDA_VERSION_DASH}         libnccl-dev &&     rm -rf /var/lib/apt/lists/* # buildkit
                        
# 2026-04-10 13:05:50  646.96MB 执行命令并创建新的镜像层
RUN |5 CUDA_VERSION=13.0.1 PYTHON_VERSION=3.12 DEADSNAKES_MIRROR_URL= DEADSNAKES_GPGKEY_URL= GET_PIP_URL=https://bootstrap.pypa.io/get-pip.py /bin/sh -c apt-get update -y     && apt-get install -y --no-install-recommends         software-properties-common         curl         sudo         ffmpeg         libsm6         libxext6         libgl1     && if [ ! -z ${DEADSNAKES_MIRROR_URL} ] ; then         if [ ! -z "${DEADSNAKES_GPGKEY_URL}" ] ; then             mkdir -p -m 0755 /etc/apt/keyrings ;             curl -L ${DEADSNAKES_GPGKEY_URL} | gpg --dearmor > /etc/apt/keyrings/deadsnakes.gpg ;             sudo chmod 644 /etc/apt/keyrings/deadsnakes.gpg ;             echo "deb [signed-by=/etc/apt/keyrings/deadsnakes.gpg] ${DEADSNAKES_MIRROR_URL} $(lsb_release -cs) main" > /etc/apt/sources.list.d/deadsnakes.list ;         fi ;     else         for i in 1 2 3; do             add-apt-repository -y ppa:deadsnakes/ppa && break ||             { echo "Attempt $i failed, retrying in 5s..."; sleep 5; };         done ;     fi     && apt-get update -y     && apt-get install -y --no-install-recommends         python${PYTHON_VERSION}         python${PYTHON_VERSION}-dev         python${PYTHON_VERSION}-venv         libibverbs-dev     && rm -rf /var/lib/apt/lists/*     && update-alternatives --install /usr/bin/python3 python3 /usr/bin/python${PYTHON_VERSION} 1     && update-alternatives --set python3 /usr/bin/python${PYTHON_VERSION}     && ln -sf /usr/bin/python${PYTHON_VERSION}-config /usr/bin/python3-config     && rm -f /usr/lib/python${PYTHON_VERSION}/EXTERNALLY-MANAGED     && curl -sS ${GET_PIP_URL} | python${PYTHON_VERSION}     && python3 --version && python3 -m pip --version # buildkit
                        
# 2026-04-10 13:02:30  136.00B 执行命令并创建新的镜像层
RUN |5 CUDA_VERSION=13.0.1 PYTHON_VERSION=3.12 DEADSNAKES_MIRROR_URL= DEADSNAKES_GPGKEY_URL= GET_PIP_URL=https://bootstrap.pypa.io/get-pip.py /bin/sh -c PYTHON_VERSION_STR=$(echo ${PYTHON_VERSION} | sed 's/\.//g') &&     echo "export PYTHON_VERSION_STR=${PYTHON_VERSION_STR}" >> /etc/environment # buildkit
                        
# 2026-04-10 13:02:06  0.00B 设置工作目录为/vllm-workspace
WORKDIR /vllm-workspace
                        
# 2026-04-10 13:02:06  0.00B 设置环境变量 DEBIAN_FRONTEND
ENV DEBIAN_FRONTEND=noninteractive
                        
# 2026-04-10 13:02:06  0.00B 定义构建参数
ARG GET_PIP_URL
                        
# 2026-04-10 13:02:06  0.00B 定义构建参数
ARG DEADSNAKES_GPGKEY_URL
                        
# 2026-04-10 13:02:06  0.00B 定义构建参数
ARG DEADSNAKES_MIRROR_URL
                        
# 2026-04-10 13:02:06  0.00B 定义构建参数
ARG PYTHON_VERSION
                        
# 2026-04-10 13:02:06  0.00B 定义构建参数
ARG CUDA_VERSION
                        
# 2025-09-09 01:23:07  0.00B 设置环境变量 NVIDIA_DRIVER_CAPABILITIES
ENV NVIDIA_DRIVER_CAPABILITIES=compute,utility
                        
# 2025-09-09 01:23:07  0.00B 设置环境变量 NVIDIA_VISIBLE_DEVICES
ENV NVIDIA_VISIBLE_DEVICES=all
                        
# 2025-09-09 01:23:07  17.29KB 复制新文件或目录到容器中
COPY NGC-DL-CONTAINER-LICENSE / # buildkit
                        
# 2025-09-09 01:23:07  0.00B 设置环境变量 LD_LIBRARY_PATH
ENV LD_LIBRARY_PATH=/usr/local/nvidia/lib:/usr/local/nvidia/lib64:/usr/local/cuda/lib64
                        
# 2025-09-09 01:23:07  0.00B 设置环境变量 PATH
ENV PATH=/usr/local/nvidia/bin:/usr/local/cuda/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin
                        
# 2025-09-09 01:23:07  22.00B 执行命令并创建新的镜像层
RUN |1 TARGETARCH=amd64 /bin/sh -c echo "/usr/local/cuda/lib64" >> /etc/ld.so.conf.d/nvidia.conf # buildkit
                        
# 2025-09-09 01:23:07  322.88MB 执行命令并创建新的镜像层
RUN |1 TARGETARCH=amd64 /bin/sh -c apt-get update && apt-get install -y --no-install-recommends     cuda-cudart-13-0=${NV_CUDA_CUDART_VERSION}     cuda-compat-13-0     && rm -rf /var/lib/apt/lists/* # buildkit
                        
# 2025-09-09 01:23:07  0.00B 设置环境变量 CUDA_VERSION
ENV CUDA_VERSION=13.0.1
                        
# 2025-09-09 01:23:07  10.60MB 执行命令并创建新的镜像层
RUN |1 TARGETARCH=amd64 /bin/sh -c apt-get update && apt-get install -y --no-install-recommends     gnupg2 curl ca-certificates &&     curl -fsSLO https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/${NVARCH}/cuda-keyring_1.1-1_all.deb &&     dpkg -i cuda-keyring_1.1-1_all.deb &&     apt-get purge --autoremove -y curl     && rm -rf /var/lib/apt/lists/* # buildkit
                        
# 2025-09-09 01:23:07  0.00B 添加元数据标签
LABEL maintainer=NVIDIA CORPORATION <cudatools@nvidia.com>
                        
# 2025-09-09 01:23:07  0.00B 定义构建参数
ARG TARGETARCH
                        
# 2025-09-09 01:23:07  0.00B 设置环境变量 NV_CUDA_CUDART_VERSION
ENV NV_CUDA_CUDART_VERSION=13.0.88-1
                        
# 2025-09-09 01:23:07  0.00B 设置环境变量 NVIDIA_REQUIRE_CUDA brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand
ENV NVIDIA_REQUIRE_CUDA=cuda>=13.0 brand=unknown,driver>=535,driver<536 brand=grid,driver>=535,driver<536 brand=tesla,driver>=535,driver<536 brand=nvidia,driver>=535,driver<536 brand=quadro,driver>=535,driver<536 brand=quadrortx,driver>=535,driver<536 brand=nvidiartx,driver>=535,driver<536 brand=vapps,driver>=535,driver<536 brand=vpc,driver>=535,driver<536 brand=vcs,driver>=535,driver<536 brand=vws,driver>=535,driver<536 brand=cloudgaming,driver>=535,driver<536 brand=unknown,driver>=550,driver<551 brand=grid,driver>=550,driver<551 brand=tesla,driver>=550,driver<551 brand=nvidia,driver>=550,driver<551 brand=quadro,driver>=550,driver<551 brand=quadrortx,driver>=550,driver<551 brand=nvidiartx,driver>=550,driver<551 brand=vapps,driver>=550,driver<551 brand=vpc,driver>=550,driver<551 brand=vcs,driver>=550,driver<551 brand=vws,driver>=550,driver<551 brand=cloudgaming,driver>=550,driver<551 brand=unknown,driver>=565,driver<566 brand=grid,driver>=565,driver<566 brand=tesla,driver>=565,driver<566 brand=nvidia,driver>=565,driver<566 brand=quadro,driver>=565,driver<566 brand=quadrortx,driver>=565,driver<566 brand=nvidiartx,driver>=565,driver<566 brand=vapps,driver>=565,driver<566 brand=vpc,driver>=565,driver<566 brand=vcs,driver>=565,driver<566 brand=vws,driver>=565,driver<566 brand=cloudgaming,driver>=565,driver<566 brand=unknown,driver>=570,driver<571 brand=grid,driver>=570,driver<571 brand=tesla,driver>=570,driver<571 brand=nvidia,driver>=570,driver<571 brand=quadro,driver>=570,driver<571 brand=quadrortx,driver>=570,driver<571 brand=nvidiartx,driver>=570,driver<571 brand=vapps,driver>=570,driver<571 brand=vpc,driver>=570,driver<571 brand=vcs,driver>=570,driver<571 brand=vws,driver>=570,driver<571 brand=cloudgaming,driver>=570,driver<571 brand=unknown,driver>=575,driver<576 brand=grid,driver>=575,driver<576 brand=tesla,driver>=575,driver<576 brand=nvidia,driver>=575,driver<576 brand=quadro,driver>=575,driver<576 brand=quadrortx,driver>=575,driver<576 brand=nvidiartx,driver>=575,driver<576 brand=vapps,driver>=575,driver<576 brand=vpc,driver>=575,driver<576 brand=vcs,driver>=575,driver<576 brand=vws,driver>=575,driver<576 brand=cloudgaming,driver>=575,driver<576
                        
# 2025-09-09 01:23:07  0.00B 设置环境变量 NVARCH
ENV NVARCH=x86_64
                        
# 2025-08-20 01:17:10  0.00B 
/bin/sh -c #(nop)  CMD ["/bin/bash"]
                        
# 2025-08-20 01:17:10  77.87MB 
/bin/sh -c #(nop) ADD file:9303cc1f788d2a9a8f909b154339f7c637b2a53c75c0e7f3da62eb1fefe371b1 in / 
                        
# 2025-08-20 01:17:08  0.00B 
/bin/sh -c #(nop)  LABEL org.opencontainers.image.version=22.04
                        
# 2025-08-20 01:17:08  0.00B 
/bin/sh -c #(nop)  LABEL org.opencontainers.image.ref.name=ubuntu
                        
# 2025-08-20 01:17:08  0.00B 
/bin/sh -c #(nop)  ARG LAUNCHPAD_BUILD_ARCH
                        
# 2025-08-20 01:17:08  0.00B 
/bin/sh -c #(nop)  ARG RELEASE
                        
                    

镜像信息

{
    "Id": "sha256:bda20f266f801e6b7079b2ec074bf172b6d10280b9df109e21b51e0bbc12dbfd",
    "RepoTags": [
        "vllm/vllm-openai:gemma4-0409-x86_64-cu130",
        "swr.cn-north-4.myhuaweicloud.com/ddn-k8s/docker.io/vllm/vllm-openai:gemma4-0409-x86_64-cu130"
    ],
    "RepoDigests": [
        "vllm/vllm-openai@sha256:cf35c7ceb0a1cda82571d3df0a0efde2e736fab9c00e6191ebf2da6322edd8fb",
        "swr.cn-north-4.myhuaweicloud.com/ddn-k8s/docker.io/vllm/vllm-openai@sha256:3f1134e5d5ffe83c2c707be09a1572e467f74b72ff9dccc25d25e80ab567c95f"
    ],
    "Parent": "",
    "Comment": "buildkit.dockerfile.v0",
    "Created": "2026-04-10T05:19:15.821299241Z",
    "Container": "",
    "ContainerConfig": null,
    "DockerVersion": "",
    "Author": "",
    "Config": {
        "Hostname": "",
        "Domainname": "",
        "User": "",
        "AttachStdin": false,
        "AttachStdout": false,
        "AttachStderr": false,
        "Tty": false,
        "OpenStdin": false,
        "StdinOnce": false,
        "Env": [
            "PATH=/usr/local/nvidia/bin:/usr/local/cuda/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin",
            "NVARCH=x86_64",
            "NVIDIA_REQUIRE_CUDA=cuda\u003e=13.0 brand=unknown,driver\u003e=535,driver\u003c536 brand=grid,driver\u003e=535,driver\u003c536 brand=tesla,driver\u003e=535,driver\u003c536 brand=nvidia,driver\u003e=535,driver\u003c536 brand=quadro,driver\u003e=535,driver\u003c536 brand=quadrortx,driver\u003e=535,driver\u003c536 brand=nvidiartx,driver\u003e=535,driver\u003c536 brand=vapps,driver\u003e=535,driver\u003c536 brand=vpc,driver\u003e=535,driver\u003c536 brand=vcs,driver\u003e=535,driver\u003c536 brand=vws,driver\u003e=535,driver\u003c536 brand=cloudgaming,driver\u003e=535,driver\u003c536 brand=unknown,driver\u003e=550,driver\u003c551 brand=grid,driver\u003e=550,driver\u003c551 brand=tesla,driver\u003e=550,driver\u003c551 brand=nvidia,driver\u003e=550,driver\u003c551 brand=quadro,driver\u003e=550,driver\u003c551 brand=quadrortx,driver\u003e=550,driver\u003c551 brand=nvidiartx,driver\u003e=550,driver\u003c551 brand=vapps,driver\u003e=550,driver\u003c551 brand=vpc,driver\u003e=550,driver\u003c551 brand=vcs,driver\u003e=550,driver\u003c551 brand=vws,driver\u003e=550,driver\u003c551 brand=cloudgaming,driver\u003e=550,driver\u003c551 brand=unknown,driver\u003e=565,driver\u003c566 brand=grid,driver\u003e=565,driver\u003c566 brand=tesla,driver\u003e=565,driver\u003c566 brand=nvidia,driver\u003e=565,driver\u003c566 brand=quadro,driver\u003e=565,driver\u003c566 brand=quadrortx,driver\u003e=565,driver\u003c566 brand=nvidiartx,driver\u003e=565,driver\u003c566 brand=vapps,driver\u003e=565,driver\u003c566 brand=vpc,driver\u003e=565,driver\u003c566 brand=vcs,driver\u003e=565,driver\u003c566 brand=vws,driver\u003e=565,driver\u003c566 brand=cloudgaming,driver\u003e=565,driver\u003c566 brand=unknown,driver\u003e=570,driver\u003c571 brand=grid,driver\u003e=570,driver\u003c571 brand=tesla,driver\u003e=570,driver\u003c571 brand=nvidia,driver\u003e=570,driver\u003c571 brand=quadro,driver\u003e=570,driver\u003c571 brand=quadrortx,driver\u003e=570,driver\u003c571 brand=nvidiartx,driver\u003e=570,driver\u003c571 brand=vapps,driver\u003e=570,driver\u003c571 brand=vpc,driver\u003e=570,driver\u003c571 brand=vcs,driver\u003e=570,driver\u003c571 brand=vws,driver\u003e=570,driver\u003c571 brand=cloudgaming,driver\u003e=570,driver\u003c571 brand=unknown,driver\u003e=575,driver\u003c576 brand=grid,driver\u003e=575,driver\u003c576 brand=tesla,driver\u003e=575,driver\u003c576 brand=nvidia,driver\u003e=575,driver\u003c576 brand=quadro,driver\u003e=575,driver\u003c576 brand=quadrortx,driver\u003e=575,driver\u003c576 brand=nvidiartx,driver\u003e=575,driver\u003c576 brand=vapps,driver\u003e=575,driver\u003c576 brand=vpc,driver\u003e=575,driver\u003c576 brand=vcs,driver\u003e=575,driver\u003c576 brand=vws,driver\u003e=575,driver\u003c576 brand=cloudgaming,driver\u003e=575,driver\u003c576",
            "NV_CUDA_CUDART_VERSION=13.0.88-1",
            "CUDA_VERSION=13.0.1",
            "LD_LIBRARY_PATH=/usr/local/nvidia/lib64:/usr/local/cuda/lib64:/usr/local/nvidia/lib:/usr/local/nvidia/lib64:/usr/local/cuda/lib64",
            "NVIDIA_VISIBLE_DEVICES=all",
            "NVIDIA_DRIVER_CAPABILITIES=compute,utility",
            "DEBIAN_FRONTEND=noninteractive",
            "UV_HTTP_TIMEOUT=500",
            "UV_INDEX_STRATEGY=unsafe-best-match",
            "UV_LINK_MODE=copy",
            "VLLM_ENABLE_CUDA_COMPATIBILITY=0",
            "TORCH_CUDA_ARCH_LIST=7.0 7.5 8.0 8.9 9.0 10.0 12.0",
            "VLLM_USAGE_SOURCE=production-docker-image"
        ],
        "Cmd": null,
        "Image": "",
        "Volumes": null,
        "WorkingDir": "/vllm-workspace",
        "Entrypoint": [
            "vllm",
            "serve"
        ],
        "OnBuild": null,
        "Labels": {
            "maintainer": "NVIDIA CORPORATION \u003ccudatools@nvidia.com\u003e",
            "org.opencontainers.image.ref.name": "ubuntu",
            "org.opencontainers.image.version": "22.04"
        }
    },
    "Architecture": "amd64",
    "Os": "linux",
    "Size": 19718263849,
    "GraphDriver": {
        "Data": {
            "LowerDir": "/var/lib/docker/overlay2/e16795c9d5831f50066752b7d8ea618605844380436e5c69b70c5bc8ff7e2e1c/diff:/var/lib/docker/overlay2/1b415a2fbefc20d4cf203b0e4ff264abe7bf2b43ce8feb2d14ce7da314697800/diff:/var/lib/docker/overlay2/a6d1e8d49b9251ee0a80be807aabcb1ab1c48f048c3f5ff98974587c6c9b480f/diff:/var/lib/docker/overlay2/a6110b25a7b865c5150b7c0d6866faaeaab79ec1b2a76feea207f2a22e0fbc35/diff:/var/lib/docker/overlay2/81fc4543d690c95c75592bbd4a8f7d0429e240b388a4af6f67f03264736e5305/diff:/var/lib/docker/overlay2/761bad47ff7391ffc2ca9178f712981b8fd33490ca8be7d2696a23c70899cf35/diff:/var/lib/docker/overlay2/27f113537321c107da29b465e5a2cd28880b72578df229deed4fcf5353d06526/diff:/var/lib/docker/overlay2/a8a15e946ebcaed14cd20fe1d2623aeff4b7d0a7406162b4292fe360b420ae52/diff:/var/lib/docker/overlay2/845d76fd217c7ac8b52331a3527aaefa1363bba63073d3b06cd4416475d01695/diff:/var/lib/docker/overlay2/1891a5ea3a3b4def1f590ab6513805f110cb78d421c86b6e790adf44e82ca801/diff:/var/lib/docker/overlay2/1f26f3dfee5830d9684770287e0031816d690db5fcf37878bafac427ae35367d/diff:/var/lib/docker/overlay2/66ddf56367e8005c9c7f4fbff3589ba2ad614b06ad408d0740995361c18de558/diff:/var/lib/docker/overlay2/f36d15fd1afb53ec1f06ffb061f02f6ce21c10c23fba987b221c61acbb9df276/diff:/var/lib/docker/overlay2/93995094cc2e70729a8f90b5dbed3c7b36b37f34f4192c32b561aaa901dd042f/diff:/var/lib/docker/overlay2/4de31a2d06a55ec10d45b40b992c495e84cfb89c8d5eae11c77440b4f02938a7/diff:/var/lib/docker/overlay2/65baa60c261b34ddf177d26d585320a0e216f8fa8ff6b74508ec652d8e148d9f/diff:/var/lib/docker/overlay2/d2bc21ca1d477ecd297f17940c7dbc57d86c24c6d73d484f5eaffabef0e63968/diff:/var/lib/docker/overlay2/6c8886057cd99d3c345e1cf9fd1af59ad7a561b22e0454489c1661d5ba2398ab/diff:/var/lib/docker/overlay2/e7777bb2b57ff082be7af9d19b1ab650b0fe418db1e2213c48529003ff7beed5/diff:/var/lib/docker/overlay2/bd0fb959b91ea0f66c798e24cd81b95362708e621ad3bd9e0b8761d21ab7076b/diff:/var/lib/docker/overlay2/27c4eae79934c9e18d5c5fb1ea53a58b4b90bea8f9182e5dfd2658af43b015b4/diff:/var/lib/docker/overlay2/8c07236ed03f6b17eff3e1efb2fb5cdae1baeba2757a44ddede8db3851425724/diff:/var/lib/docker/overlay2/0fa933afe9bd85116c6d5343c6e08ced0d011a32e849ea44fce72b4d2d055aab/diff:/var/lib/docker/overlay2/102923de7412a6c0cdd5b5c2805dc4a46ce60d60e2265c618938af8fd8c49813/diff:/var/lib/docker/overlay2/63128cc8bca9ac3d61128c7871193d0c381afcc09d41f6063f572c5f55816e1c/diff:/var/lib/docker/overlay2/1d81ee2f19601333e631d7ffc6da6873a462cee4a95077ca9ea444a455285fec/diff:/var/lib/docker/overlay2/2c915c9261c285fb82e970469a3cf7ce891d888b5bb4afe0f4af3a4eef4c33a2/diff",
            "MergedDir": "/var/lib/docker/overlay2/513f3a903c6ccdb646273b7fdce3486d522ae145ed7975b19d2180edc57fa65b/merged",
            "UpperDir": "/var/lib/docker/overlay2/513f3a903c6ccdb646273b7fdce3486d522ae145ed7975b19d2180edc57fa65b/diff",
            "WorkDir": "/var/lib/docker/overlay2/513f3a903c6ccdb646273b7fdce3486d522ae145ed7975b19d2180edc57fa65b/work"
        },
        "Name": "overlay2"
    },
    "RootFS": {
        "Type": "layers",
        "Layers": [
            "sha256:dc6eb6dad5f9e332f00af553440e857b1467db1be43dd910cdb6830ba0898d50",
            "sha256:e2a50251472b1dae8bd3e8f3c7610b60293462494a6089ce12ec200d0532390e",
            "sha256:3e1159f70fa8cd9cf23103a830dd3fa91fd5dc5da0c209c43bf4846576cec32e",
            "sha256:7482e185693edb89761eb19d8f1a24e01c178ead3e5cac251253943f6c1d113e",
            "sha256:495bbaf4cc5ca53d8286fe305d2fc2fd3ecb2c8791cfdd4473e984d2eb0f3ea1",
            "sha256:a7cc26faa536dd51a3f0c44ba9587e4ec4ecdbb2bb48048291f7ea8bbf04d269",
            "sha256:ec52bdc6f70925716847140b5d3ff61a8d4af1027597f0b0a1ae63cd37e401e4",
            "sha256:715582d4da833d8960a14a1b22f866848638d74d8b8b80f51a93a2a6e80bf721",
            "sha256:1f73deec5dddefa70d103b5053e0804954d15eb77b93df8887d6d8bfe3f6baf7",
            "sha256:e5c80d3201277587feb3924e48ef9485c52ab190ed03b893550a11bc7b8d47a1",
            "sha256:51572b278ad23836eac69464b7905f67e5f005453382e99138bcadb525a0221a",
            "sha256:19c8c653bcfb6857914e8a56dc2d774dbb6d75805dda3eb10588ea6e0e32f4a7",
            "sha256:3ca594337b4cf2a5b0f1f11393ba29d836c8d08fc150dd1a711f35cf6c0f7342",
            "sha256:1a4d75adf5e76f0cc354de82040ed17d038827141f76c510d184fc6b85cad2be",
            "sha256:a276a4fb7d9130cf66f3f017685c920c3fb03a8b674e46e8f3a17e446029a000",
            "sha256:2049e34fb1b88170429602200d342fe71c4adfeb91178c0046dfb5e1c9dee903",
            "sha256:909411c6bd69e71c4a2da992e58398914e3a0ae1a0e084a5b0252ab8a1aca62b",
            "sha256:d065aff61b5754c95074c48e14b1f9f6f3f0b7044180c6679cee51dcaec3494d",
            "sha256:a82f8658eaa92d6c4244bdc1da2938084854080b279e1a8119e1f8290d924798",
            "sha256:3ba13f87d8d4f08ea3b4c66fb291a1b0468339ff77511d244d3cdc58310e7717",
            "sha256:df4ce098315659458d8f06113eae1f77c8eb1d8ebc75b01fb95f83b49d369b7b",
            "sha256:a4fa8d7e6d9a3119037d1c027de80a5180cd9a99e7f5a249e464dc1818f3e38d",
            "sha256:7bbcd762661849184cdf7b7fb9daceddd2d435dc802801eaf1e162c826692d63",
            "sha256:0e5067f08ef9e378c09f88b9e6c3c31384ab5b44513e57128810a3d52360f4ec",
            "sha256:3f18c912aaafc80e0c3d574d95cc366ba2516be10940b597b0e6db170f4244dc",
            "sha256:1037896a6cf86355152dfdc7e9f1c3b961b6cf2aa5dc0696a42adeab8f15f71f",
            "sha256:c94486336e137baf740bdc71269e408c9f7979d742a7121c3fb209be481ae89a",
            "sha256:391dd9613e106e1f436bd41a952fc0d99e159a3fcf80d8a19f58c2e0b7af60ad"
        ]
    },
    "Metadata": {
        "LastTagTime": "2026-04-12T00:46:00.872403542+08:00"
    }
}

更多版本

docker.io/vllm/vllm-openai:v0.5.4

linux/amd64 docker.io9.90GB2024-09-07 06:20
3188

docker.io/vllm/vllm-openai:v0.6.0

linux/amd64 docker.io9.72GB2024-09-11 01:51
1806

docker.io/vllm/vllm-openai:v0.6.1.post2

linux/amd64 docker.io9.81GB2024-09-24 01:43
1379

docker.io/vllm/vllm-openai:latest

linux/amd64 docker.io10.24GB2024-10-11 00:43
8700

docker.io/vllm/vllm-openai:v0.6.4.post1

linux/amd64 docker.io10.64GB2024-11-19 00:42
1466

docker.io/vllm/vllm-openai:v0.6.4

linux/amd64 docker.io10.64GB2024-12-11 02:08
1123

docker.io/vllm/vllm-openai:v0.6.3

linux/amd64 docker.io10.43GB2024-12-12 02:41
1430

docker.io/vllm/vllm-openai:v0.6.6

linux/amd64 docker.io10.23GB2025-01-04 00:37
1786

docker.io/vllm/vllm-openai:v0.6.6.post1

linux/amd64 docker.io10.23GB2025-01-24 00:21
1514

docker.io/vllm/vllm-openai:v0.7.1

linux/amd64 docker.io16.53GB2025-02-08 02:05
1398

docker.io/vllm/vllm-openai:v0.7.2

linux/amd64 docker.io16.53GB2025-02-09 00:28
3193

docker.io/vllm/vllm-openai:v0.7.3

linux/amd64 docker.io16.43GB2025-02-24 00:50
3991

docker.io/vllm/vllm-openai:v0.8.0

linux/amd64 docker.io16.62GB2025-03-20 00:23
1654

docker.io/vllm/vllm-openai:v0.8.1

linux/amd64 docker.io16.62GB2025-03-21 00:28
1364

docker.io/vllm/vllm-openai:v0.8.2

linux/amd64 docker.io16.92GB2025-03-27 01:12
1590

docker.io/vllm/vllm-openai:v0.8.3

linux/amd64 docker.io17.13GB2025-04-08 00:58
1627

docker.io/vllm/vllm-openai:v0.8.4

linux/amd64 docker.io17.16GB2025-04-17 01:16
2109

docker.io/vllm/vllm-openai:v0.8.5

linux/amd64 docker.io17.30GB2025-04-30 02:45
3712

docker.io/vllm/vllm-openai:v0.8.5.post1

linux/amd64 docker.io17.30GB2025-05-07 02:06
3824

docker.io/vllm/vllm-openai:v0.9.0.1

linux/amd64 docker.io20.81GB2025-06-05 01:12
2559

docker.io/vllm/vllm-openai:v0.9.1

linux/amd64 docker.io20.85GB2025-06-12 01:29
3222

docker.io/vllm/vllm-openai:v0.9.2

linux/amd64 docker.io20.76GB2025-07-09 03:00
7850

docker.io/vllm/vllm-openai:v0.10.0

linux/amd64 docker.io26.13GB2025-07-26 03:15
2137

docker.io/vllm/vllm-openai:gptoss

linux/amd64 docker.io33.86GB2025-08-07 01:52
1557

docker.io/vllm/vllm-openai:v0.10.1

linux/amd64 docker.io20.25GB2025-08-20 03:05
1494

docker.io/vllm/vllm-openai:v0.10.1.1

linux/amd64 docker.io20.26GB2025-08-23 01:43
2500

docker.io/vllm/vllm-openai:v0.10.2

linux/amd64 docker.io22.49GB2025-09-16 03:40
1885

docker.io/vllm/vllm-openai:v0.2.7

linux/amd64 docker.io6.34GB2025-10-01 01:07
555

docker.io/vllm/vllm-openai:v0.11.0-x86_64

linux/amd64 docker.io25.86GB2025-10-09 02:14
2607

docker.io/vllm/vllm-openai:v0.10.2-x86_64

linux/amd64 docker.io22.49GB2025-10-09 02:22
682

docker.io/vllm/vllm-openai:v0.11.0

linux/amd64 docker.io25.86GB2025-10-09 11:24
2666

docker.io/vllm/vllm-openai:v0.11.0

linux/arm64 docker.io24.17GB2025-10-30 00:47
1149

docker.io/vllm/vllm-openai:v0.3.3

linux/amd64 docker.io9.13GB2025-11-18 01:01
510

docker.io/vllm/vllm-openai:v0.11.1

linux/amd64 docker.io28.72GB2025-11-21 01:03
902

docker.io/vllm/vllm-openai:v0.11.2

linux/amd64 docker.io28.82GB2025-11-22 00:46
1929

docker.io/vllm/vllm-openai:v0.11.1

linux/arm64 docker.io26.54GB2025-11-22 01:23
518

docker.io/vllm/vllm-openai:v0.4.0

linux/amd64 docker.io9.88GB2025-11-22 01:58
515

docker.io/vllm/vllm-openai:v0.11.2

linux/arm64 docker.io26.54GB2025-11-22 04:06
718

docker.io/vllm/vllm-openai:nightly

linux/amd64 docker.io18.74GB2025-12-03 02:43
2154

docker.io/vllm/vllm-openai:v0.12.0-aarch64

linux/arm64 docker.io17.89GB2025-12-05 03:12
865

docker.io/vllm/vllm-openai:v0.12.0

linux/amd64 docker.io19.47GB2025-12-05 03:59
2689

docker.io/vllm/vllm-openai:v0.13.0

linux/amd64 docker.io19.51GB2026-01-22 01:41
1422

docker.io/vllm/vllm-openai:v0.14.0

linux/amd64 docker.io19.66GB2026-01-22 03:16
745

docker.io/vllm/vllm-openai:v0.14.1

linux/amd64 docker.io19.69GB2026-01-27 01:52
905

docker.io/vllm/vllm-openai:v0.15.0

linux/amd64 docker.io20.13GB2026-01-31 00:51
1113
625

docker.io/vllm/vllm-openai:v0.15.1

linux/amd64 docker.io20.14GB2026-02-06 01:14
1009

docker.io/vllm/vllm-openai:v0.15.1-cu130

linux/amd64 docker.io18.77GB2026-02-07 00:39
861

docker.io/vllm/vllm-openai:latest

linux/arm64 docker.io20.65GB2026-02-08 00:59
613

docker.io/vllm/vllm-openai:v0.15.1-aarch64-cu130

linux/arm64 docker.io19.60GB2026-02-10 00:44
693

docker.io/vllm/vllm-openai:glm5

linux/amd64 docker.io20.27GB2026-02-14 00:56
574

docker.io/vllm/vllm-openai:qwen3_5

linux/amd64 docker.io20.93GB2026-02-25 01:18
1581

docker.io/vllm/vllm-openai:qwen3_5-x86_64-cu129

linux/amd64 docker.io20.93GB2026-02-27 00:33
658
452

docker.io/vllm/vllm-openai:v0.16.0

linux/amd64 docker.io20.37GB2026-02-28 01:04
848

docker.io/vllm/vllm-openai:v0.16.0-cu130

linux/amd64 docker.io19.01GB2026-02-28 02:55
571
431

docker.io/vllm/vllm-openai:v0.5.1

linux/amd64 docker.io10.40GB2026-03-06 01:01
240

docker.io/vllm/vllm-openai:v0.4.0.post1

linux/amd64 docker.io9.88GB2026-03-06 01:29
311

docker.io/vllm/vllm-openai:cu130-nightly

linux/amd64 docker.io19.55GB2026-03-06 02:04
732
386

docker.io/vllm/vllm-openai:v0.17.0

linux/amd64 docker.io20.75GB2026-03-08 02:27
1657

docker.io/vllm/vllm-openai:v0.17.0-cu130

linux/amd64 docker.io19.55GB2026-03-10 01:39
438

docker.io/vllm/vllm-openai:v0.17.0

linux/arm64 docker.io21.50GB2026-03-11 01:42
484

docker.io/vllm/vllm-openai:v0.4.3

linux/amd64 docker.io7.86GB2026-03-11 02:04
266

docker.io/vllm/vllm-openai:v0.13.0

linux/arm64 docker.io17.98GB2026-03-11 03:52
210

docker.io/vllm/vllm-openai:v0.17.1-cu130

linux/amd64 docker.io19.55GB2026-03-13 02:22
782

docker.io/vllm/vllm-openai:v0.17.1

linux/amd64 docker.io20.75GB2026-03-14 02:34
832

docker.io/vllm/vllm-openai:v0.14.0

linux/arm64 docker.io20.19GB2026-03-17 02:20
257

docker.io/vllm/vllm-openai:v0.18.0

linux/amd64 docker.io22.40GB2026-03-23 01:43
906

docker.io/vllm/vllm-openai:v0.18.0-cu130

linux/amd64 docker.io19.65GB2026-03-24 01:29
674

docker.io/vllm/vllm-openai-cpu:latest-x86_64

linux/amd64 docker.io3.48GB2026-03-28 10:31
278

docker.io/vllm/vllm-openai-cpu:latest-arm64

linux/arm64 docker.io2.23GB2026-03-28 10:34
201

docker.io/vllm/vllm-openai-rocm:latest

linux/amd64 docker.io24.49GB2026-03-30 00:46
319

docker.io/vllm/vllm-openai:latest-cu130

linux/amd64 docker.io19.65GB2026-03-31 00:47
764

docker.io/vllm/vllm-openai:v0.18.1-cu130

linux/amd64 docker.io19.66GB2026-04-02 00:39
273

docker.io/vllm/vllm-openai:v0.18.1

linux/amd64 docker.io22.41GB2026-04-02 01:12
424

docker.io/vllm/vllm-openai:v0.19.0-ubuntu2404

linux/amd64 docker.io22.35GB2026-04-04 02:18
449

docker.io/vllm/vllm-openai:gemma4-cu130

linux/amd64 docker.io21.18GB2026-04-04 03:13
1327

docker.io/vllm/vllm-openai:v0.19.0

linux/amd64 docker.io22.41GB2026-04-04 03:47
964

docker.io/vllm/vllm-openai-cpu:v0.19.0

linux/amd64 docker.io3.49GB2026-04-06 07:48
381

docker.io/vllm/vllm-openai-cpu:v0.19.0

linux/arm64 docker.io2.25GB2026-04-06 07:51
222

docker.io/vllm/vllm-openai:v0.19.0-cu130-ubuntu2404

linux/amd64 docker.io19.74GB2026-04-08 03:21
462

docker.io/vllm/vllm-openai:gemma4

linux/amd64 docker.io23.92GB2026-04-09 00:39
595
315

docker.io/vllm/vllm-openai:qwen3_5-cu130

linux/amd64 docker.io19.60GB2026-04-11 00:51
368

docker.io/vllm/vllm-openai:v0.19.0-cu130

linux/amd64 docker.io19.66GB2026-04-12 00:29
305

docker.io/vllm/vllm-openai:gemma4-0409-x86_64-cu130

linux/amd64 docker.io19.72GB2026-04-12 01:06
375

docker.io/vllm/vllm-openai:gemma4-cu130

linux/arm64 docker.io20.43GB2026-04-13 01:49
427

docker.io/vllm/vllm-openai:v0.19.0-cu130

linux/arm64 docker.io20.37GB2026-04-13 02:18
327

docker.io/vllm/vllm-openai:gemma4

linux/arm64 docker.io23.20GB2026-04-13 03:21
284

docker.io/vllm/vllm-openai:cu130-nightly-aarch64

linux/arm64 docker.io21.48GB2026-04-14 00:35
310

docker.io/vllm/vllm-openai:minimax27

linux/amd64 docker.io24.06GB2026-04-14 01:09
334

docker.io/vllm/vllm-openai:glm51

linux/amd64 docker.io22.46GB2026-04-16 03:22
354

docker.io/vllm/vllm-openai:glm51

linux/arm64 docker.io23.19GB2026-04-17 01:16
151

docker.io/vllm/vllm-openai:glm51-cu130

linux/amd64 docker.io19.71GB2026-04-17 01:41
320

docker.io/vllm/vllm-openai-cpu:v0.19.0-x86_64

linux/amd64 docker.io3.49GB2026-04-17 23:45
219
140

docker.io/vllm/vllm-openai:v0.19.1-cu130

linux/amd64 docker.io19.60GB2026-04-20 00:28
958