广告图片

docker.io/gpustack/runner:cuda12.9-vllmv0.20.1 linux/amd64

docker.io/gpustack/runner:cuda12.9-vllmv0.20.1 - 国内下载镜像源 浏览次数:7

docker.io/gpustack/runner是一个用于在GPU环境中执行任务的容器镜像,通常集成了GPU驱动、运行时依赖及相关工具组件,支持在容器化场景下高效利用GPU资源运行计算任务(如机器学习训练、推理等GPU加速应用)。

源镜像 docker.io/gpustack/runner:cuda12.9-vllmv0.20.1
国内镜像 swr.cn-north-4.myhuaweicloud.com/ddn-k8s/docker.io/gpustack/runner:cuda12.9-vllmv0.20.1
镜像ID sha256:811925dbebbb080875b2097f672bf2537956592251bd67439a11fb8565f06235
镜像TAG cuda12.9-vllmv0.20.1
大小 27.44GB
镜像源 docker.io
项目信息 Docker-Hub主页 🚀项目TAG 🚀
CMD
启动入口 tini --
工作目录 /
OS/平台 linux/amd64
浏览量 7 次
贡献者
镜像创建 2026-05-07T03:13:37.009171821Z
同步时间 2026-05-09 02:28
环境变量
PATH=/usr/local/cuda/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin NVARCH=x86_64 NVIDIA_REQUIRE_CUDA=cuda>=12.9 brand=unknown,driver>=535,driver<536 brand=grid,driver>=535,driver<536 brand=tesla,driver>=535,driver<536 brand=nvidia,driver>=535,driver<536 brand=quadro,driver>=535,driver<536 brand=quadrortx,driver>=535,driver<536 brand=nvidiartx,driver>=535,driver<536 brand=vapps,driver>=535,driver<536 brand=vpc,driver>=535,driver<536 brand=vcs,driver>=535,driver<536 brand=vws,driver>=535,driver<536 brand=cloudgaming,driver>=535,driver<536 brand=unknown,driver>=550,driver<551 brand=grid,driver>=550,driver<551 brand=tesla,driver>=550,driver<551 brand=nvidia,driver>=550,driver<551 brand=quadro,driver>=550,driver<551 brand=quadrortx,driver>=550,driver<551 brand=nvidiartx,driver>=550,driver<551 brand=vapps,driver>=550,driver<551 brand=vpc,driver>=550,driver<551 brand=vcs,driver>=550,driver<551 brand=vws,driver>=550,driver<551 brand=cloudgaming,driver>=550,driver<551 brand=unknown,driver>=560,driver<561 brand=grid,driver>=560,driver<561 brand=tesla,driver>=560,driver<561 brand=nvidia,driver>=560,driver<561 brand=quadro,driver>=560,driver<561 brand=quadrortx,driver>=560,driver<561 brand=nvidiartx,driver>=560,driver<561 brand=vapps,driver>=560,driver<561 brand=vpc,driver>=560,driver<561 brand=vcs,driver>=560,driver<561 brand=vws,driver>=560,driver<561 brand=cloudgaming,driver>=560,driver<561 brand=unknown,driver>=565,driver<566 brand=grid,driver>=565,driver<566 brand=tesla,driver>=565,driver<566 brand=nvidia,driver>=565,driver<566 brand=quadro,driver>=565,driver<566 brand=quadrortx,driver>=565,driver<566 brand=nvidiartx,driver>=565,driver<566 brand=vapps,driver>=565,driver<566 brand=vpc,driver>=565,driver<566 brand=vcs,driver>=565,driver<566 brand=vws,driver>=565,driver<566 brand=cloudgaming,driver>=565,driver<566 brand=unknown,driver>=570,driver<571 brand=grid,driver>=570,driver<571 brand=tesla,driver>=570,driver<571 brand=nvidia,driver>=570,driver<571 brand=quadro,driver>=570,driver<571 brand=quadrortx,driver>=570,driver<571 brand=nvidiartx,driver>=570,driver<571 brand=vapps,driver>=570,driver<571 brand=vpc,driver>=570,driver<571 brand=vcs,driver>=570,driver<571 brand=vws,driver>=570,driver<571 brand=cloudgaming,driver>=570,driver<571 NV_CUDA_CUDART_VERSION=12.9.79-1 CUDA_VERSION=12.9.1 LD_LIBRARY_PATH=/usr/local/nvidia/lib64:/usr/local/cuda/lib64:/usr/local/cuda/lib64 NVIDIA_VISIBLE_DEVICES=all NVIDIA_DRIVER_CAPABILITIES=compute,utility DEBIAN_FRONTEND=noninteractive UV_HTTP_TIMEOUT=500 UV_INDEX_STRATEGY=unsafe-best-match UV_LINK_MODE=copy VLLM_ENABLE_CUDA_COMPATIBILITY=0 TORCH_CUDA_ARCH_LIST=7.5 8.0 8.6 8.9 9.0 10.0 12.0 VLLM_USAGE_SOURCE=production-docker-image LANG=en_US.UTF-8 LANGUAGE=en_US:en LC_ALL=en_US.UTF-8 PYTHON_VERSION=3.12 UV_SYSTEM_PYTHON=1 UV_PRERELEASE=allow UV_SKIP_WHEEL_FILENAME_CHECK=1 VLLM_VERSION=v0.20.1 VLLM_TORCH_VERSION=2.12.0 VLLM_TORCH_CUDA_VERSION=12.9.1 RAY_EXPERIMENTAL_NOSET_CUDA_VISIBLE_DEVICES=1 SAFETENSORS_FAST_GPU=1 RUNAI_STREAMER_MEMORY_LIMIT=0 RUNAI_STREAMER_LOG_TO_STDERR=1 RUNAI_STREAMER_LOG_LEVEL=INFO
镜像标签
NVIDIA CORPORATION <cudatools@nvidia.com>: maintainer ubuntu: org.opencontainers.image.ref.name 24.04: org.opencontainers.image.version

Docker拉取命令

docker pull swr.cn-north-4.myhuaweicloud.com/ddn-k8s/docker.io/gpustack/runner:cuda12.9-vllmv0.20.1
docker tag  swr.cn-north-4.myhuaweicloud.com/ddn-k8s/docker.io/gpustack/runner:cuda12.9-vllmv0.20.1  docker.io/gpustack/runner:cuda12.9-vllmv0.20.1

Containerd拉取命令

ctr images pull swr.cn-north-4.myhuaweicloud.com/ddn-k8s/docker.io/gpustack/runner:cuda12.9-vllmv0.20.1
ctr images tag  swr.cn-north-4.myhuaweicloud.com/ddn-k8s/docker.io/gpustack/runner:cuda12.9-vllmv0.20.1  docker.io/gpustack/runner:cuda12.9-vllmv0.20.1

Shell快速替换命令

sed -i 's#gpustack/runner:cuda12.9-vllmv0.20.1#swr.cn-north-4.myhuaweicloud.com/ddn-k8s/docker.io/gpustack/runner:cuda12.9-vllmv0.20.1#' deployment.yaml

Ansible快速分发-Docker

#ansible k8s -m shell -a 'docker pull swr.cn-north-4.myhuaweicloud.com/ddn-k8s/docker.io/gpustack/runner:cuda12.9-vllmv0.20.1 && docker tag  swr.cn-north-4.myhuaweicloud.com/ddn-k8s/docker.io/gpustack/runner:cuda12.9-vllmv0.20.1  docker.io/gpustack/runner:cuda12.9-vllmv0.20.1'

Ansible快速分发-Containerd

#ansible k8s -m shell -a 'ctr images pull swr.cn-north-4.myhuaweicloud.com/ddn-k8s/docker.io/gpustack/runner:cuda12.9-vllmv0.20.1 && ctr images tag  swr.cn-north-4.myhuaweicloud.com/ddn-k8s/docker.io/gpustack/runner:cuda12.9-vllmv0.20.1  docker.io/gpustack/runner:cuda12.9-vllmv0.20.1'

镜像构建历史


# 2026-05-07 11:13:37  0.00B 配置容器启动时运行的命令
ENTRYPOINT ["tini" "--"]
                        
# 2026-05-07 11:13:37  0.00B 设置工作目录为/
WORKDIR /
                        
# 2026-05-07 11:13:37  0.00B 设置环境变量 RAY_EXPERIMENTAL_NOSET_CUDA_VISIBLE_DEVICES SAFETENSORS_FAST_GPU RUNAI_STREAMER_MEMORY_LIMIT RUNAI_STREAMER_LOG_TO_STDERR RUNAI_STREAMER_LOG_LEVEL
ENV RAY_EXPERIMENTAL_NOSET_CUDA_VISIBLE_DEVICES=1 SAFETENSORS_FAST_GPU=1 RUNAI_STREAMER_MEMORY_LIMIT=0 RUNAI_STREAMER_LOG_TO_STDERR=1 RUNAI_STREAMER_LOG_LEVEL=INFO
                        
# 2026-05-07 11:13:37  216.87KB 执行命令并创建新的镜像层
RUN |7 TARGETPLATFORM=linux/amd64 TARGETOS=linux TARGETARCH=amd64 PYTHON_VERSION=3.12 VLLM_VERSION=v0.20.1 VLLM_TORCH_VERSION=2.12.0 VLLM_TORCH_CUDA_VERSION=12.9.1 /bin/bash -eo pipefail -c     # Patch

    tree -hs /workspace/patches
    pushd $(pip show vllm | grep Location: | cut -d" " -f 2) \
        && patch -p1 < /workspace/patches/vllm/*.patch
    if pip show vllm_omni > /dev/null 2>&1; then \
        pushd $(pip show vllm_omni | grep Location: | cut -d" " -f 2) \
            && patch -p1 < /workspace/patches/vllm_omni/*.patch; \
    fi
 # buildkit
                        
# 2026-05-07 11:13:36  0.00B 执行命令并创建新的镜像层
RUN |7 TARGETPLATFORM=linux/amd64 TARGETOS=linux TARGETARCH=amd64 PYTHON_VERSION=3.12 VLLM_VERSION=v0.20.1 VLLM_TORCH_VERSION=2.12.0 VLLM_TORCH_CUDA_VERSION=12.9.1 /bin/bash -eo pipefail -c     # Postprocess

    # Review
    uv pip tree \
        --package vllm \
        --package vllm-omni \
        --package flashinfer-python \
        --package torch \
        --package triton \
        --package deep-gemm \
        --package deep-ep \
        --package lmcache \
        --package mooncake-transfer-engine \
        --package ray
 # buildkit
                        
# 2026-05-07 11:13:35  0.00B 执行命令并创建新的镜像层
RUN |7 TARGETPLATFORM=linux/amd64 TARGETOS=linux TARGETARCH=amd64 PYTHON_VERSION=3.12 VLLM_VERSION=v0.20.1 VLLM_TORCH_VERSION=2.12.0 VLLM_TORCH_CUDA_VERSION=12.9.1 /bin/bash -eo pipefail -c     # Omni

    if [[ ! -d /omni/workspace ]]; then
        echo "Skipping vLLM Omni installation for ${TARGETARCH}..."
        exit 0
    fi

    # Install
    uv pip install --no-build-isolation \
        /omni/workspace/*.whl

    # Dependencies
    cat <<EOT >/tmp/requirements.txt
onnxruntime
sox
EOT
    uv pip install \
        -r /tmp/requirements.txt

    # Cleanup
    rm -rf /var/tmp/* \
        && rm -rf /tmp/*
 # buildkit
                        
# 2026-05-07 11:13:03  460.38MB 执行命令并创建新的镜像层
RUN |7 TARGETPLATFORM=linux/amd64 TARGETOS=linux TARGETARCH=amd64 PYTHON_VERSION=3.12 VLLM_VERSION=v0.20.1 VLLM_TORCH_VERSION=2.12.0 VLLM_TORCH_CUDA_VERSION=12.9.1 /bin/bash -eo pipefail -c     # Ray

    # Install Ray Client and Default
    RAY_VERSION=2.55.1
    cat <<EOT >/tmp/requirements.txt
ray[cgraph]==${RAY_VERSION}
ray[client]==${RAY_VERSION}
ray[default]==${RAY_VERSION}
EOT
    uv pip install \
        -r /tmp/requirements.txt

    # Cleanup
    rm -rf /var/tmp/* \
        && rm -rf /tmp/*
 # buildkit
                        
# 2026-05-07 11:04:43  0.00B 定义构建参数
ARG TARGETARCH=amd64
                        
# 2026-05-07 11:04:43  0.00B 定义构建参数
ARG TARGETOS=linux
                        
# 2026-05-07 11:04:43  0.00B 定义构建参数
ARG TARGETPLATFORM=linux/amd64
                        
# 2026-05-07 11:04:43  0.00B 
SHELL [/bin/bash -eo pipefail -c]
                        
# 2026-05-07 11:04:43  0.00B 设置环境变量 VLLM_VERSION VLLM_TORCH_VERSION VLLM_TORCH_CUDA_VERSION
ENV VLLM_VERSION=v0.20.1 VLLM_TORCH_VERSION=2.12.0 VLLM_TORCH_CUDA_VERSION=12.9.1
                        
# 2026-05-07 11:04:43  0.00B 定义构建参数
ARG VLLM_TORCH_CUDA_VERSION=12.9.1
                        
# 2026-05-07 11:04:43  0.00B 定义构建参数
ARG VLLM_TORCH_VERSION=2.12.0
                        
# 2026-05-07 11:04:43  0.00B 定义构建参数
ARG VLLM_VERSION=v0.20.1
                        
# 2026-05-07 11:04:43  0.00B 设置环境变量 UV_SYSTEM_PYTHON UV_PRERELEASE UV_SKIP_WHEEL_FILENAME_CHECK
ENV UV_SYSTEM_PYTHON=1 UV_PRERELEASE=allow UV_SKIP_WHEEL_FILENAME_CHECK=1
                        
# 2026-05-07 11:04:43  14.12KB 执行命令并创建新的镜像层
RUN |4 TARGETPLATFORM=linux/amd64 TARGETOS=linux TARGETARCH=amd64 PYTHON_VERSION=3.12 /bin/bash -eo pipefail -c     # Python

    # Update alternatives
    if [[ -f /etc/alternatives/python ]]; then update-alternatives --remove-all python; fi; update-alternatives --install /usr/bin/python python /usr/bin/python${PYTHON_VERSION} 1

 # buildkit
                        
# 2026-05-07 11:04:43  0.00B 设置环境变量 PYTHON_VERSION
ENV PYTHON_VERSION=3.12
                        
# 2026-05-07 11:04:43  0.00B 定义构建参数
ARG PYTHON_VERSION=3.12
                        
# 2026-05-07 11:04:43  312.97MB 执行命令并创建新的镜像层
RUN |3 TARGETPLATFORM=linux/amd64 TARGETOS=linux TARGETARCH=amd64 /bin/bash -eo pipefail -c     # Tools

    # Refresh
    apt-get update -y && apt-get install -y --no-install-recommends \
        software-properties-common apt-transport-https \
        ca-certificates gnupg2 lsb-release gnupg-agent \
      && apt-get update -y

    # Install
    apt-get install -y --no-install-recommends \
        ca-certificates build-essential binutils bash openssl \
        curl wget aria2 \
        git git-lfs \
        unzip xz-utils \
        tzdata locales \
        iproute2 iputils-ping ifstat net-tools dnsutils pciutils ipmitool \
        rdma-core rdmacm-utils infiniband-diags \
        procps sysstat htop \
        tini vim jq bc tree

    # Update locale
    localedef -i en_US -c -f UTF-8 -A /usr/share/locale/locale.alias en_US.UTF-8

    # Update timezone
    rm -f /etc/localtime \
        && ln -sf /usr/share/zoneinfo/Asia/Shanghai /etc/localtime \
        && echo "Asia/Shanghai" > /etc/timezone \
        && dpkg-reconfigure --frontend noninteractive tzdata

    # Cleanup
    rm -rf /var/tmp/* \
        && rm -rf /tmp/* \
        && rm -rf /var/cache/apt
 # buildkit
                        
# 2026-05-07 11:04:43  0.00B 设置环境变量 DEBIAN_FRONTEND LANG LANGUAGE LC_ALL
ENV DEBIAN_FRONTEND=noninteractive LANG=en_US.UTF-8 LANGUAGE=en_US:en LC_ALL=en_US.UTF-8
                        
# 2026-05-07 11:04:43  0.00B 定义构建参数
ARG TARGETARCH=amd64
                        
# 2026-05-07 11:04:43  0.00B 定义构建参数
ARG TARGETOS=linux
                        
# 2026-05-07 11:04:43  0.00B 定义构建参数
ARG TARGETPLATFORM=linux/amd64
                        
# 2026-05-07 11:04:43  0.00B 
SHELL [/bin/bash -eo pipefail -c]
                        
# 2026-05-04 17:21:48  0.00B 配置容器启动时运行的命令
ENTRYPOINT ["vllm" "serve"]
                        
# 2026-05-04 17:21:48  0.00B 设置环境变量 VLLM_USAGE_SOURCE
ENV VLLM_USAGE_SOURCE=production-docker-image
                        
# 2026-05-04 17:21:48  994.34MB 执行命令并创建新的镜像层
RUN |8 TARGETPLATFORM=linux/amd64 INSTALL_KV_CONNECTORS=true CUDA_VERSION=12.9.1 PIP_INDEX_URL= UV_INDEX_URL= PIP_EXTRA_INDEX_URL= UV_EXTRA_INDEX_URL= torch_cuda_arch_list=7.5 8.0 8.6 8.9 9.0 10.0 12.0 /bin/sh -c CUDA_MAJOR="${CUDA_VERSION%%.*}";     CUDA_VERSION_DASH=$(echo $CUDA_VERSION | cut -d. -f1,2 | tr '.' '-');     CUDA_HOME=/usr/local/cuda;     BUILD_PKGS="libcusparse-dev-${CUDA_VERSION_DASH}                 libcublas-dev-${CUDA_VERSION_DASH}                 libcusolver-dev-${CUDA_VERSION_DASH}";     if [ "$INSTALL_KV_CONNECTORS" = "true" ]; then         if [ "$CUDA_MAJOR" -ge 13 ]; then             uv pip install --system nixl-cu13;         fi;         uv pip install --system -r /tmp/kv_connectors.txt --no-build || (             apt-get update -y &&             apt-get install -y --no-install-recommends --allow-change-held-packages ${BUILD_PKGS} &&             uv pip install --system -r /tmp/kv_connectors.txt --no-build-isolation &&             apt-get purge -y ${BUILD_PKGS} &&             rm -rf /var/lib/apt/lists/*         );     fi # buildkit
                        
# 2026-05-04 17:21:41  0.00B 设置环境变量 TORCH_CUDA_ARCH_LIST
ENV TORCH_CUDA_ARCH_LIST=7.5 8.0 8.6 8.9 9.0 10.0 12.0
                        
# 2026-05-04 17:21:41  0.00B 定义构建参数
ARG torch_cuda_arch_list=7.5 8.0 8.6 8.9 9.0 10.0 11.0 12.0+PTX
                        
# 2026-05-04 17:21:41  0.00B 设置环境变量 UV_HTTP_TIMEOUT
ENV UV_HTTP_TIMEOUT=500
                        
# 2026-05-04 17:21:41  0.00B 定义构建参数
ARG PIP_EXTRA_INDEX_URL UV_EXTRA_INDEX_URL
                        
# 2026-05-04 17:21:41  0.00B 定义构建参数
ARG PIP_INDEX_URL UV_INDEX_URL
                        
# 2026-05-04 17:21:41  0.00B 定义构建参数
ARG CUDA_VERSION
                        
# 2026-05-04 17:21:41  0.00B 定义构建参数
ARG INSTALL_KV_CONNECTORS=false
                        
# 2026-05-04 17:21:41  0.00B 定义构建参数
ARG TARGETPLATFORM
                        
# 2026-05-04 17:21:41  35.09KB 复制新文件或目录到容器中
COPY ./vllm/collect_env.py . # buildkit
                        
# 2026-05-04 17:21:41  986.54KB 复制新文件或目录到容器中
COPY benchmarks benchmarks # buildkit
                        
# 2026-05-04 17:21:41  1.25MB 复制新文件或目录到容器中
COPY examples examples # buildkit
                        
# 2026-05-04 17:21:41  0.00B 设置环境变量 LD_LIBRARY_PATH
ENV LD_LIBRARY_PATH=/usr/local/nvidia/lib64:/usr/local/cuda/lib64:/usr/local/cuda/lib64
                        
# 2026-05-04 17:21:41  40.47MB 执行命令并创建新的镜像层
RUN |22 CUDA_VERSION=12.9.1 PYTHON_VERSION=3.12 DEADSNAKES_MIRROR_URL= DEADSNAKES_GPGKEY_URL= GET_PIP_URL=https://bootstrap.pypa.io/get-pip.py PYTORCH_CUDA_INDEX_BASE_URL=https://download.pytorch.org/whl FLASHINFER_VERSION=0.6.8.post1 GDRCOPY_CUDA_VERSION=12.8 GDRCOPY_OS_VERSION=Ubuntu24_04 TARGETPLATFORM=linux/amd64 BITSANDBYTES_VERSION_X86=0.46.1 BITSANDBYTES_VERSION_ARM64=0.42.0 TIMM_VERSION=>=1.0.17 RUNAI_MODEL_STREAMER_VERSION=>=0.15.7 PIP_INDEX_URL= UV_INDEX_URL= PIP_EXTRA_INDEX_URL= UV_EXTRA_INDEX_URL= PYTORCH_CUDA_INDEX_BASE_URL=https://download.pytorch.org/whl PIP_KEYRING_PROVIDER=disabled UV_KEYRING_PROVIDER=disabled PYTORCH_NIGHTLY= /bin/sh -c uv pip install --system ep_kernels/dist/*.whl --verbose         --extra-index-url ${PYTORCH_CUDA_INDEX_BASE_URL}/cu$(echo $CUDA_VERSION | cut -d. -f1,2 | tr -d '.') # buildkit
                        
# 2026-05-04 17:21:40  0.00B 设置环境变量 LD_LIBRARY_PATH
ENV LD_LIBRARY_PATH=/usr/local/cuda/lib64:/usr/local/cuda/lib64
                        
# 2026-05-04 17:21:40  0.00B 执行命令并创建新的镜像层
RUN |22 CUDA_VERSION=12.9.1 PYTHON_VERSION=3.12 DEADSNAKES_MIRROR_URL= DEADSNAKES_GPGKEY_URL= GET_PIP_URL=https://bootstrap.pypa.io/get-pip.py PYTORCH_CUDA_INDEX_BASE_URL=https://download.pytorch.org/whl FLASHINFER_VERSION=0.6.8.post1 GDRCOPY_CUDA_VERSION=12.8 GDRCOPY_OS_VERSION=Ubuntu24_04 TARGETPLATFORM=linux/amd64 BITSANDBYTES_VERSION_X86=0.46.1 BITSANDBYTES_VERSION_ARM64=0.42.0 TIMM_VERSION=>=1.0.17 RUNAI_MODEL_STREAMER_VERSION=>=0.15.7 PIP_INDEX_URL= UV_INDEX_URL= PIP_EXTRA_INDEX_URL= UV_EXTRA_INDEX_URL= PYTORCH_CUDA_INDEX_BASE_URL=https://download.pytorch.org/whl PIP_KEYRING_PROVIDER=disabled UV_KEYRING_PROVIDER=disabled PYTORCH_NIGHTLY= /bin/sh -c . /etc/environment && uv pip list # buildkit
                        
# 2026-05-04 17:21:40  1.28GB 执行命令并创建新的镜像层
RUN |22 CUDA_VERSION=12.9.1 PYTHON_VERSION=3.12 DEADSNAKES_MIRROR_URL= DEADSNAKES_GPGKEY_URL= GET_PIP_URL=https://bootstrap.pypa.io/get-pip.py PYTORCH_CUDA_INDEX_BASE_URL=https://download.pytorch.org/whl FLASHINFER_VERSION=0.6.8.post1 GDRCOPY_CUDA_VERSION=12.8 GDRCOPY_OS_VERSION=Ubuntu24_04 TARGETPLATFORM=linux/amd64 BITSANDBYTES_VERSION_X86=0.46.1 BITSANDBYTES_VERSION_ARM64=0.42.0 TIMM_VERSION=>=1.0.17 RUNAI_MODEL_STREAMER_VERSION=>=0.15.7 PIP_INDEX_URL= UV_INDEX_URL= PIP_EXTRA_INDEX_URL= UV_EXTRA_INDEX_URL= PYTORCH_CUDA_INDEX_BASE_URL=https://download.pytorch.org/whl PIP_KEYRING_PROVIDER=disabled UV_KEYRING_PROVIDER=disabled PYTORCH_NIGHTLY= /bin/sh -c if [ "${PYTORCH_NIGHTLY}" = "1" ]; then         echo "Installing torch nightly..."         && uv pip install --system $(cat torch_lib_versions.txt | xargs) --pre         --index-url ${PYTORCH_CUDA_INDEX_BASE_URL}/nightly/cu$(echo $CUDA_VERSION | cut -d. -f1,2 | tr -d '.')         && echo "Installing vLLM..."         && uv pip install --system dist/*.whl --verbose         --extra-index-url ${PYTORCH_CUDA_INDEX_BASE_URL}/nightly/cu$(echo $CUDA_VERSION | cut -d. -f1,2 | tr -d '.');     else         echo "Installing vLLM..."         && uv pip install --system dist/*.whl --verbose         --extra-index-url ${PYTORCH_CUDA_INDEX_BASE_URL}/cu$(echo $CUDA_VERSION | cut -d. -f1,2 | tr -d '.');     fi # buildkit
                        
# 2026-05-04 17:18:37  71.00B 复制新文件或目录到容器中
COPY /workspace/torch_lib_versions.txt torch_lib_versions.txt # buildkit
                        
# 2026-05-04 17:18:37  0.00B 定义构建参数
ARG PYTORCH_NIGHTLY
                        
# 2026-05-04 17:18:37  0.00B 定义构建参数
ARG PIP_KEYRING_PROVIDER UV_KEYRING_PROVIDER
                        
# 2026-05-04 17:18:37  0.00B 定义构建参数
ARG PYTORCH_CUDA_INDEX_BASE_URL
                        
# 2026-05-04 17:18:37  0.00B 定义构建参数
ARG PIP_EXTRA_INDEX_URL UV_EXTRA_INDEX_URL
                        
# 2026-05-04 17:18:37  0.00B 定义构建参数
ARG PIP_INDEX_URL UV_INDEX_URL
                        
# 2026-05-04 17:18:37  388.23MB 执行命令并创建新的镜像层
RUN |14 CUDA_VERSION=12.9.1 PYTHON_VERSION=3.12 DEADSNAKES_MIRROR_URL= DEADSNAKES_GPGKEY_URL= GET_PIP_URL=https://bootstrap.pypa.io/get-pip.py PYTORCH_CUDA_INDEX_BASE_URL=https://download.pytorch.org/whl FLASHINFER_VERSION=0.6.8.post1 GDRCOPY_CUDA_VERSION=12.8 GDRCOPY_OS_VERSION=Ubuntu24_04 TARGETPLATFORM=linux/amd64 BITSANDBYTES_VERSION_X86=0.46.1 BITSANDBYTES_VERSION_ARM64=0.42.0 TIMM_VERSION=>=1.0.17 RUNAI_MODEL_STREAMER_VERSION=>=0.15.7 /bin/sh -c if [ "$TARGETPLATFORM" = "linux/arm64" ]; then         BITSANDBYTES_VERSION="${BITSANDBYTES_VERSION_ARM64}";     else         BITSANDBYTES_VERSION="${BITSANDBYTES_VERSION_X86}";     fi;     uv pip install --system accelerate modelscope         "bitsandbytes>=${BITSANDBYTES_VERSION}" "timm${TIMM_VERSION}" "runai-model-streamer[s3,gcs,azure]${RUNAI_MODEL_STREAMER_VERSION}" # buildkit
                        
# 2026-05-04 17:18:33  0.00B 定义构建参数
ARG RUNAI_MODEL_STREAMER_VERSION=>=0.15.7
                        
# 2026-05-04 17:18:33  0.00B 定义构建参数
ARG TIMM_VERSION=>=1.0.17
                        
# 2026-05-04 17:18:33  0.00B 定义构建参数
ARG BITSANDBYTES_VERSION_ARM64=0.42.0
                        
# 2026-05-04 17:18:33  0.00B 定义构建参数
ARG BITSANDBYTES_VERSION_X86=0.46.1
                        
# 2026-05-04 17:18:33  2.34MB 执行命令并创建新的镜像层
RUN |10 CUDA_VERSION=12.9.1 PYTHON_VERSION=3.12 DEADSNAKES_MIRROR_URL= DEADSNAKES_GPGKEY_URL= GET_PIP_URL=https://bootstrap.pypa.io/get-pip.py PYTORCH_CUDA_INDEX_BASE_URL=https://download.pytorch.org/whl FLASHINFER_VERSION=0.6.8.post1 GDRCOPY_CUDA_VERSION=12.8 GDRCOPY_OS_VERSION=Ubuntu24_04 TARGETPLATFORM=linux/amd64 /bin/sh -c set -eux;     case "${TARGETPLATFORM}" in       linux/arm64) UUARCH="aarch64" ;;       linux/amd64) UUARCH="x64" ;;       *) echo "Unsupported TARGETPLATFORM: ${TARGETPLATFORM}" >&2; exit 1 ;;     esac;     /tmp/install_gdrcopy.sh "${GDRCOPY_OS_VERSION}" "${GDRCOPY_CUDA_VERSION}" "${UUARCH}" &&     rm /tmp/install_gdrcopy.sh # buildkit
                        
# 2026-05-04 17:18:28  1.44KB 复制新文件或目录到容器中
COPY tools/install_gdrcopy.sh /tmp/install_gdrcopy.sh # buildkit
                        
# 2026-05-04 17:18:28  0.00B 定义构建参数
ARG TARGETPLATFORM
                        
# 2026-05-04 17:18:28  0.00B 定义构建参数
ARG GDRCOPY_OS_VERSION=Ubuntu22_04
                        
# 2026-05-04 17:18:28  0.00B 定义构建参数
ARG GDRCOPY_CUDA_VERSION=12.8
                        
# 2026-05-04 17:18:28  9.40GB 执行命令并创建新的镜像层
RUN |7 CUDA_VERSION=12.9.1 PYTHON_VERSION=3.12 DEADSNAKES_MIRROR_URL= DEADSNAKES_GPGKEY_URL= GET_PIP_URL=https://bootstrap.pypa.io/get-pip.py PYTORCH_CUDA_INDEX_BASE_URL=https://download.pytorch.org/whl FLASHINFER_VERSION=0.6.8.post1 /bin/sh -c uv pip install --system flashinfer-jit-cache==${FLASHINFER_VERSION}         --extra-index-url https://flashinfer.ai/whl/cu$(echo $CUDA_VERSION | cut -d. -f1,2 | tr -d '.')     && flashinfer show-config     && flashinfer download-cubin # buildkit
                        
# 2026-05-04 17:08:09  0.00B 定义构建参数
ARG FLASHINFER_VERSION=0.6.8.post1
                        
# 2026-05-04 17:08:09  10.16GB 执行命令并创建新的镜像层
RUN |6 CUDA_VERSION=12.9.1 PYTHON_VERSION=3.12 DEADSNAKES_MIRROR_URL= DEADSNAKES_GPGKEY_URL= GET_PIP_URL=https://bootstrap.pypa.io/get-pip.py PYTORCH_CUDA_INDEX_BASE_URL=https://download.pytorch.org/whl /bin/sh -c uv pip install --system -r /tmp/requirements-cuda.txt         --extra-index-url ${PYTORCH_CUDA_INDEX_BASE_URL}/cu$(echo $CUDA_VERSION | cut -d. -f1,2 | tr -d '.') &&     rm /tmp/requirements-cuda.txt /tmp/common.txt # buildkit
                        
# 2026-05-04 17:07:45  821.00B 复制新文件或目录到容器中
COPY requirements/cuda.txt /tmp/requirements-cuda.txt # buildkit
                        
# 2026-05-04 17:07:45  2.94KB 复制新文件或目录到容器中
COPY requirements/common.txt /tmp/common.txt # buildkit
                        
# 2026-05-04 17:07:45  0.00B 定义构建参数
ARG PYTORCH_CUDA_INDEX_BASE_URL
                        
# 2026-05-04 17:07:45  0.00B 设置环境变量 VLLM_ENABLE_CUDA_COMPATIBILITY
ENV VLLM_ENABLE_CUDA_COMPATIBILITY=0
                        
# 2026-05-04 17:07:45  0.00B 设置环境变量 UV_LINK_MODE
ENV UV_LINK_MODE=copy
                        
# 2026-05-04 17:07:45  0.00B 设置环境变量 UV_INDEX_STRATEGY
ENV UV_INDEX_STRATEGY=unsafe-best-match
                        
# 2026-05-04 17:07:45  0.00B 设置环境变量 UV_HTTP_TIMEOUT
ENV UV_HTTP_TIMEOUT=500
                        
# 2026-05-04 17:07:45  86.40MB 执行命令并创建新的镜像层
RUN |5 CUDA_VERSION=12.9.1 PYTHON_VERSION=3.12 DEADSNAKES_MIRROR_URL= DEADSNAKES_GPGKEY_URL= GET_PIP_URL=https://bootstrap.pypa.io/get-pip.py /bin/sh -c python3 -m pip install uv # buildkit
                        
# 2026-05-04 17:07:43  2.69GB 执行命令并创建新的镜像层
RUN |5 CUDA_VERSION=12.9.1 PYTHON_VERSION=3.12 DEADSNAKES_MIRROR_URL= DEADSNAKES_GPGKEY_URL= GET_PIP_URL=https://bootstrap.pypa.io/get-pip.py /bin/sh -c CUDA_VERSION_DASH=$(echo $CUDA_VERSION | cut -d. -f1,2 | tr '.' '-') &&     CUDA_VERSION_SHORT=$(echo $CUDA_VERSION | cut -d. -f1,2) &&     apt-get update -y &&     apt-get install -y --no-install-recommends --allow-change-held-packages         cuda-nvcc-${CUDA_VERSION_DASH}         cuda-cudart-${CUDA_VERSION_DASH}         cuda-nvrtc-${CUDA_VERSION_DASH}         cuda-cuobjdump-${CUDA_VERSION_DASH}         libcurand-dev-${CUDA_VERSION_DASH}         libcublas-${CUDA_VERSION_DASH}         libnuma-dev         numactl &&     NCCL_VER=$(apt-cache madison libnccl-dev | grep "+cuda${CUDA_VERSION_SHORT}" | head -1 | awk -F'|' '{gsub(/^ +| +$/, "", $2); print $2}') &&     apt-get install -y --no-install-recommends --allow-change-held-packages libnccl-dev=${NCCL_VER} libnccl2=${NCCL_VER} &&     rm -rf /var/lib/apt/lists/* # buildkit
                        
# 2026-05-04 17:07:13  1.21GB 执行命令并创建新的镜像层
RUN |5 CUDA_VERSION=12.9.1 PYTHON_VERSION=3.12 DEADSNAKES_MIRROR_URL= DEADSNAKES_GPGKEY_URL= GET_PIP_URL=https://bootstrap.pypa.io/get-pip.py /bin/sh -c apt-get update -y     && apt-get install -y --no-install-recommends         curl         sudo         ffmpeg         libsm6         libxext6         libgl1         libibverbs-dev         build-essential         libssl-dev         libffi-dev         zlib1g-dev         libbz2-dev         libreadline-dev         libsqlite3-dev         libncurses-dev         liblzma-dev         libgdbm-dev         uuid-dev         tk-dev     && PYTHON_MAJOR_MINOR=${PYTHON_VERSION}     && PYTHON_FULL_VERSION=$(curl -s https://www.python.org/ftp/python/         | grep -oE "${PYTHON_MAJOR_MINOR}\.[0-9]+"         | sort -t. -k3 -n | uniq         | tail -1)     && echo "Building Python ${PYTHON_FULL_VERSION} from source..."     && curl -fSL https://www.python.org/ftp/python/${PYTHON_FULL_VERSION}/Python-${PYTHON_FULL_VERSION}.tgz -o /tmp/python.tgz     && tar -xzf /tmp/python.tgz -C /tmp     && cd /tmp/Python-${PYTHON_FULL_VERSION}     && ./configure --enable-optimizations --with-ensurepip=install --prefix=/usr/local     && make -j$(nproc)     && make install     && cd /     && rm -rf /tmp/python.tgz /tmp/Python-${PYTHON_FULL_VERSION}     && ln -sf /usr/local/bin/python${PYTHON_MAJOR_MINOR} /usr/bin/python3     && ln -sf /usr/local/bin/python${PYTHON_MAJOR_MINOR}-config /usr/bin/python3-config     && ln -sf /usr/local/bin/pip${PYTHON_MAJOR_MINOR} /usr/bin/pip     && rm -rf /var/lib/apt/lists/*     && python3 --version && python3 -m pip --version # buildkit
                        
# 2026-05-04 17:03:50  136.00B 执行命令并创建新的镜像层
RUN |5 CUDA_VERSION=12.9.1 PYTHON_VERSION=3.12 DEADSNAKES_MIRROR_URL= DEADSNAKES_GPGKEY_URL= GET_PIP_URL=https://bootstrap.pypa.io/get-pip.py /bin/sh -c PYTHON_VERSION_STR=$(echo ${PYTHON_VERSION} | sed 's/\.//g') &&     echo "export PYTHON_VERSION_STR=${PYTHON_VERSION_STR}" >> /etc/environment # buildkit
                        
# 2026-05-04 17:03:48  0.00B 设置工作目录为/vllm-workspace
WORKDIR /vllm-workspace
                        
# 2026-05-04 17:03:48  0.00B 设置环境变量 DEBIAN_FRONTEND
ENV DEBIAN_FRONTEND=noninteractive
                        
# 2026-05-04 17:03:48  0.00B 定义构建参数
ARG GET_PIP_URL
                        
# 2026-05-04 17:03:48  0.00B 定义构建参数
ARG DEADSNAKES_GPGKEY_URL
                        
# 2026-05-04 17:03:48  0.00B 定义构建参数
ARG DEADSNAKES_MIRROR_URL
                        
# 2026-05-04 17:03:48  0.00B 定义构建参数
ARG PYTHON_VERSION
                        
# 2026-05-04 17:03:48  0.00B 定义构建参数
ARG CUDA_VERSION
                        
# 2025-07-19 04:11:18  0.00B 设置环境变量 NVIDIA_DRIVER_CAPABILITIES
ENV NVIDIA_DRIVER_CAPABILITIES=compute,utility
                        
# 2025-07-19 04:11:18  0.00B 设置环境变量 NVIDIA_VISIBLE_DEVICES
ENV NVIDIA_VISIBLE_DEVICES=all
                        
# 2025-07-19 04:11:18  17.29KB 复制新文件或目录到容器中
COPY NGC-DL-CONTAINER-LICENSE / # buildkit
                        
# 2025-07-19 04:11:18  0.00B 设置环境变量 LD_LIBRARY_PATH
ENV LD_LIBRARY_PATH=/usr/local/cuda/lib64
                        
# 2025-07-19 04:11:18  0.00B 设置环境变量 PATH
ENV PATH=/usr/local/cuda/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin
                        
# 2025-07-19 04:11:18  22.00B 执行命令并创建新的镜像层
RUN |1 TARGETARCH=amd64 /bin/sh -c echo "/usr/local/cuda/lib64" >> /etc/ld.so.conf.d/nvidia.conf # buildkit
                        
# 2025-07-19 04:11:18  315.62MB 执行命令并创建新的镜像层
RUN |1 TARGETARCH=amd64 /bin/sh -c apt-get update && apt-get install -y --no-install-recommends     cuda-cudart-12-9=${NV_CUDA_CUDART_VERSION}     cuda-compat-12-9     && rm -rf /var/lib/apt/lists/* # buildkit
                        
# 2025-07-19 04:11:02  0.00B 设置环境变量 CUDA_VERSION
ENV CUDA_VERSION=12.9.1
                        
# 2025-07-19 04:11:02  10.26MB 执行命令并创建新的镜像层
RUN |1 TARGETARCH=amd64 /bin/sh -c apt-get update && apt-get install -y --no-install-recommends     gnupg2 curl ca-certificates &&     curl -fsSL https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2404/${NVARCH}/3bf863cc.pub | apt-key add - &&     echo "deb https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2404/${NVARCH} /" > /etc/apt/sources.list.d/cuda.list &&     apt-get purge --autoremove -y curl     && rm -rf /var/lib/apt/lists/* # buildkit
                        
# 2025-07-19 04:11:02  0.00B 添加元数据标签
LABEL maintainer=NVIDIA CORPORATION <cudatools@nvidia.com>
                        
# 2025-07-19 04:11:02  0.00B 定义构建参数
ARG TARGETARCH
                        
# 2025-07-19 04:11:02  0.00B 设置环境变量 NV_CUDA_CUDART_VERSION
ENV NV_CUDA_CUDART_VERSION=12.9.79-1
                        
# 2025-07-19 04:11:02  0.00B 设置环境变量 NVIDIA_REQUIRE_CUDA brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand
ENV NVIDIA_REQUIRE_CUDA=cuda>=12.9 brand=unknown,driver>=535,driver<536 brand=grid,driver>=535,driver<536 brand=tesla,driver>=535,driver<536 brand=nvidia,driver>=535,driver<536 brand=quadro,driver>=535,driver<536 brand=quadrortx,driver>=535,driver<536 brand=nvidiartx,driver>=535,driver<536 brand=vapps,driver>=535,driver<536 brand=vpc,driver>=535,driver<536 brand=vcs,driver>=535,driver<536 brand=vws,driver>=535,driver<536 brand=cloudgaming,driver>=535,driver<536 brand=unknown,driver>=550,driver<551 brand=grid,driver>=550,driver<551 brand=tesla,driver>=550,driver<551 brand=nvidia,driver>=550,driver<551 brand=quadro,driver>=550,driver<551 brand=quadrortx,driver>=550,driver<551 brand=nvidiartx,driver>=550,driver<551 brand=vapps,driver>=550,driver<551 brand=vpc,driver>=550,driver<551 brand=vcs,driver>=550,driver<551 brand=vws,driver>=550,driver<551 brand=cloudgaming,driver>=550,driver<551 brand=unknown,driver>=560,driver<561 brand=grid,driver>=560,driver<561 brand=tesla,driver>=560,driver<561 brand=nvidia,driver>=560,driver<561 brand=quadro,driver>=560,driver<561 brand=quadrortx,driver>=560,driver<561 brand=nvidiartx,driver>=560,driver<561 brand=vapps,driver>=560,driver<561 brand=vpc,driver>=560,driver<561 brand=vcs,driver>=560,driver<561 brand=vws,driver>=560,driver<561 brand=cloudgaming,driver>=560,driver<561 brand=unknown,driver>=565,driver<566 brand=grid,driver>=565,driver<566 brand=tesla,driver>=565,driver<566 brand=nvidia,driver>=565,driver<566 brand=quadro,driver>=565,driver<566 brand=quadrortx,driver>=565,driver<566 brand=nvidiartx,driver>=565,driver<566 brand=vapps,driver>=565,driver<566 brand=vpc,driver>=565,driver<566 brand=vcs,driver>=565,driver<566 brand=vws,driver>=565,driver<566 brand=cloudgaming,driver>=565,driver<566 brand=unknown,driver>=570,driver<571 brand=grid,driver>=570,driver<571 brand=tesla,driver>=570,driver<571 brand=nvidia,driver>=570,driver<571 brand=quadro,driver>=570,driver<571 brand=quadrortx,driver>=570,driver<571 brand=nvidiartx,driver>=570,driver<571 brand=vapps,driver>=570,driver<571 brand=vpc,driver>=570,driver<571 brand=vcs,driver>=570,driver<571 brand=vws,driver>=570,driver<571 brand=cloudgaming,driver>=570,driver<571
                        
# 2025-07-19 04:11:02  0.00B 设置环境变量 NVARCH
ENV NVARCH=x86_64
                        
# 2025-07-15 00:40:23  0.00B 
/bin/sh -c #(nop)  CMD ["/bin/bash"]
                        
# 2025-07-15 00:40:22  78.12MB 
/bin/sh -c #(nop) ADD file:b4619a63cd7829e1338ddaa4995ca17003002dd54b0dfd675a6f54a2b69151a6 in / 
                        
# 2025-07-15 00:40:20  0.00B 
/bin/sh -c #(nop)  LABEL org.opencontainers.image.version=24.04
                        
# 2025-07-15 00:40:20  0.00B 
/bin/sh -c #(nop)  LABEL org.opencontainers.image.ref.name=ubuntu
                        
# 2025-07-15 00:40:20  0.00B 
/bin/sh -c #(nop)  ARG LAUNCHPAD_BUILD_ARCH
                        
# 2025-07-15 00:40:20  0.00B 
/bin/sh -c #(nop)  ARG RELEASE
                        
                    

镜像信息

{
    "Id": "sha256:811925dbebbb080875b2097f672bf2537956592251bd67439a11fb8565f06235",
    "RepoTags": [
        "gpustack/runner:cuda12.9-vllmv0.20.1",
        "swr.cn-north-4.myhuaweicloud.com/ddn-k8s/docker.io/gpustack/runner:cuda12.9-vllmv0.20.1"
    ],
    "RepoDigests": [
        "gpustack/runner@sha256:29f4ab420485a6a4904cc620a3018d19f5e77b13669782f92661f4c6d40df109",
        "swr.cn-north-4.myhuaweicloud.com/ddn-k8s/docker.io/gpustack/runner@sha256:265b8b46075063bb88ee511086b930cf868992960a7a16e3e647174bec0d33bf"
    ],
    "Parent": "",
    "Comment": "buildkit.dockerfile.v0",
    "Created": "2026-05-07T03:13:37.009171821Z",
    "Container": "",
    "ContainerConfig": null,
    "DockerVersion": "",
    "Author": "",
    "Config": {
        "Hostname": "",
        "Domainname": "",
        "User": "",
        "AttachStdin": false,
        "AttachStdout": false,
        "AttachStderr": false,
        "Tty": false,
        "OpenStdin": false,
        "StdinOnce": false,
        "Env": [
            "PATH=/usr/local/cuda/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin",
            "NVARCH=x86_64",
            "NVIDIA_REQUIRE_CUDA=cuda\u003e=12.9 brand=unknown,driver\u003e=535,driver\u003c536 brand=grid,driver\u003e=535,driver\u003c536 brand=tesla,driver\u003e=535,driver\u003c536 brand=nvidia,driver\u003e=535,driver\u003c536 brand=quadro,driver\u003e=535,driver\u003c536 brand=quadrortx,driver\u003e=535,driver\u003c536 brand=nvidiartx,driver\u003e=535,driver\u003c536 brand=vapps,driver\u003e=535,driver\u003c536 brand=vpc,driver\u003e=535,driver\u003c536 brand=vcs,driver\u003e=535,driver\u003c536 brand=vws,driver\u003e=535,driver\u003c536 brand=cloudgaming,driver\u003e=535,driver\u003c536 brand=unknown,driver\u003e=550,driver\u003c551 brand=grid,driver\u003e=550,driver\u003c551 brand=tesla,driver\u003e=550,driver\u003c551 brand=nvidia,driver\u003e=550,driver\u003c551 brand=quadro,driver\u003e=550,driver\u003c551 brand=quadrortx,driver\u003e=550,driver\u003c551 brand=nvidiartx,driver\u003e=550,driver\u003c551 brand=vapps,driver\u003e=550,driver\u003c551 brand=vpc,driver\u003e=550,driver\u003c551 brand=vcs,driver\u003e=550,driver\u003c551 brand=vws,driver\u003e=550,driver\u003c551 brand=cloudgaming,driver\u003e=550,driver\u003c551 brand=unknown,driver\u003e=560,driver\u003c561 brand=grid,driver\u003e=560,driver\u003c561 brand=tesla,driver\u003e=560,driver\u003c561 brand=nvidia,driver\u003e=560,driver\u003c561 brand=quadro,driver\u003e=560,driver\u003c561 brand=quadrortx,driver\u003e=560,driver\u003c561 brand=nvidiartx,driver\u003e=560,driver\u003c561 brand=vapps,driver\u003e=560,driver\u003c561 brand=vpc,driver\u003e=560,driver\u003c561 brand=vcs,driver\u003e=560,driver\u003c561 brand=vws,driver\u003e=560,driver\u003c561 brand=cloudgaming,driver\u003e=560,driver\u003c561 brand=unknown,driver\u003e=565,driver\u003c566 brand=grid,driver\u003e=565,driver\u003c566 brand=tesla,driver\u003e=565,driver\u003c566 brand=nvidia,driver\u003e=565,driver\u003c566 brand=quadro,driver\u003e=565,driver\u003c566 brand=quadrortx,driver\u003e=565,driver\u003c566 brand=nvidiartx,driver\u003e=565,driver\u003c566 brand=vapps,driver\u003e=565,driver\u003c566 brand=vpc,driver\u003e=565,driver\u003c566 brand=vcs,driver\u003e=565,driver\u003c566 brand=vws,driver\u003e=565,driver\u003c566 brand=cloudgaming,driver\u003e=565,driver\u003c566 brand=unknown,driver\u003e=570,driver\u003c571 brand=grid,driver\u003e=570,driver\u003c571 brand=tesla,driver\u003e=570,driver\u003c571 brand=nvidia,driver\u003e=570,driver\u003c571 brand=quadro,driver\u003e=570,driver\u003c571 brand=quadrortx,driver\u003e=570,driver\u003c571 brand=nvidiartx,driver\u003e=570,driver\u003c571 brand=vapps,driver\u003e=570,driver\u003c571 brand=vpc,driver\u003e=570,driver\u003c571 brand=vcs,driver\u003e=570,driver\u003c571 brand=vws,driver\u003e=570,driver\u003c571 brand=cloudgaming,driver\u003e=570,driver\u003c571",
            "NV_CUDA_CUDART_VERSION=12.9.79-1",
            "CUDA_VERSION=12.9.1",
            "LD_LIBRARY_PATH=/usr/local/nvidia/lib64:/usr/local/cuda/lib64:/usr/local/cuda/lib64",
            "NVIDIA_VISIBLE_DEVICES=all",
            "NVIDIA_DRIVER_CAPABILITIES=compute,utility",
            "DEBIAN_FRONTEND=noninteractive",
            "UV_HTTP_TIMEOUT=500",
            "UV_INDEX_STRATEGY=unsafe-best-match",
            "UV_LINK_MODE=copy",
            "VLLM_ENABLE_CUDA_COMPATIBILITY=0",
            "TORCH_CUDA_ARCH_LIST=7.5 8.0 8.6 8.9 9.0 10.0 12.0",
            "VLLM_USAGE_SOURCE=production-docker-image",
            "LANG=en_US.UTF-8",
            "LANGUAGE=en_US:en",
            "LC_ALL=en_US.UTF-8",
            "PYTHON_VERSION=3.12",
            "UV_SYSTEM_PYTHON=1",
            "UV_PRERELEASE=allow",
            "UV_SKIP_WHEEL_FILENAME_CHECK=1",
            "VLLM_VERSION=v0.20.1",
            "VLLM_TORCH_VERSION=2.12.0",
            "VLLM_TORCH_CUDA_VERSION=12.9.1",
            "RAY_EXPERIMENTAL_NOSET_CUDA_VISIBLE_DEVICES=1",
            "SAFETENSORS_FAST_GPU=1",
            "RUNAI_STREAMER_MEMORY_LIMIT=0",
            "RUNAI_STREAMER_LOG_TO_STDERR=1",
            "RUNAI_STREAMER_LOG_LEVEL=INFO"
        ],
        "Cmd": null,
        "Image": "",
        "Volumes": null,
        "WorkingDir": "/",
        "Entrypoint": [
            "tini",
            "--"
        ],
        "OnBuild": null,
        "Labels": {
            "maintainer": "NVIDIA CORPORATION \u003ccudatools@nvidia.com\u003e",
            "org.opencontainers.image.ref.name": "ubuntu",
            "org.opencontainers.image.version": "24.04"
        },
        "Shell": [
            "/bin/bash",
            "-eo",
            "pipefail",
            "-c"
        ]
    },
    "Architecture": "amd64",
    "Os": "linux",
    "Size": 27439456946,
    "GraphDriver": {
        "Data": {
            "LowerDir": "/var/lib/docker/overlay2/c81b4b0414801923c9d818bb9f9e583e5e52a79cd8d486c6e56106f91f6070d5/diff:/var/lib/docker/overlay2/f5d1e5efe466b73a7bfcd29da7c5eb177f86e45af8aef9e3b57eda883acd02e3/diff:/var/lib/docker/overlay2/53be8de4df7d6820db86017898aed40a02e0fe7a95a7b3ba91ee99178d77a6c8/diff:/var/lib/docker/overlay2/b032f5e9b463640ed0e37589045a2f6b9fe5416aa6c989f2391bcf55d156132a/diff:/var/lib/docker/overlay2/ba764829b89c3ec5d7645c42fc5a7bb99a5c6fb795449eb24591f222fd9c8d2b/diff:/var/lib/docker/overlay2/57e932de9ddce617d6eb092bd6b6ec19e2e14769a1d6ab6441ce3d1e708991c4/diff:/var/lib/docker/overlay2/148f47352ed18b1c58d708b9cc645419ecefeacc1a328e13ef403bc475c45a7d/diff:/var/lib/docker/overlay2/3ebbb1eaaa07b1625673e37edb02f916fc4e6e74020b9e65641de65b9c6f33b7/diff:/var/lib/docker/overlay2/ab462e90fc836042deb755d4a78241b794fd737106fdf2e3a64a4ef4b2a33a0a/diff:/var/lib/docker/overlay2/ddfb663231094ad8a2cf3bdc04b2e16efcfca3411d738bbd479a629b69475c3d/diff:/var/lib/docker/overlay2/e86cfb17ed0776d976d2c8373085edc70a79e2b5011fa822701ec6f270edebbc/diff:/var/lib/docker/overlay2/a72a1028455ba0a99767115313c2614432a7183c533f93ff263bae8a8daf28a1/diff:/var/lib/docker/overlay2/2b5330d1ae4d886d79de687ec282aed7bd8aa9b13a6ee6f97b981274496a31d0/diff:/var/lib/docker/overlay2/6ac32b2f1965c8ec3867dbbba2de9fafc235ac8425b725853ac531e23736ba52/diff:/var/lib/docker/overlay2/e9246af0a5e708b2cb0ce8d04fa77179c3dec4834039fff0bb54d7d50c048022/diff:/var/lib/docker/overlay2/f8224461a6c005b7b93ff4dde48cf8c817cdbe540cfcfb9043c80ec276e4fdfa/diff:/var/lib/docker/overlay2/3f54d860ab663ff6704b7782322b127ac876b4f28c4be290b359e8428ddaf68b/diff:/var/lib/docker/overlay2/378fc4a94969ae7303cf9701ccf8e3f6d7f275cea794a59ecdbf84a0edcdd2e4/diff:/var/lib/docker/overlay2/e57f98b28171b5a4e8e45c58719e12ab5c11126d2fa8e51f773e6ac4a1025f33/diff:/var/lib/docker/overlay2/9565debe1f30a8e2eec0dd7044b18323dcfd6575b163a7d3721f727e37154d44/diff:/var/lib/docker/overlay2/4916d579b6a2943e275813c32cc111b8d136fb912fd51995c323ab43a98a67e1/diff:/var/lib/docker/overlay2/a016b35dbab6d558303b50e7ced7b04c930e8a1f71b6d9aa557266f179aea6d2/diff:/var/lib/docker/overlay2/28e58f2cd499c02c6ccd38cb871c22d27abed8fb420665c3e1a9aa3370e051ef/diff:/var/lib/docker/overlay2/3230403b0eb1c4f1a0abdb84449500bb80ab02571d4ad3305deee562496caa7a/diff:/var/lib/docker/overlay2/6eaa3533da50b7e605c85a27505df1b3f958097a462baf7f8075a5b3fce051a7/diff:/var/lib/docker/overlay2/bfe980edb74d9b59db70c24f3170699c622c835a6dcd0bbc3b0d57063fcec902/diff:/var/lib/docker/overlay2/bc2896ea2401897cb5864b178ea5858ec6a886e9eb4ec4d527af5fa6c2855ae5/diff:/var/lib/docker/overlay2/5bbc424cfac7675d19b29b7189d326750e54613baaf160492d9795c2980f96df/diff:/var/lib/docker/overlay2/91fe6b5c878f52b9e01a8aca536a3489984bf1b3a6de561a805a8cacf95f19c6/diff:/var/lib/docker/overlay2/3aadd826d5b7e09d3cf0bd889bc9dba9f1c593ee764dc91ffac0eba6fe052235/diff",
            "MergedDir": "/var/lib/docker/overlay2/f435cb23a677875b49847af14a7af629302ca4f6ca89cc0e5644814130dd270e/merged",
            "UpperDir": "/var/lib/docker/overlay2/f435cb23a677875b49847af14a7af629302ca4f6ca89cc0e5644814130dd270e/diff",
            "WorkDir": "/var/lib/docker/overlay2/f435cb23a677875b49847af14a7af629302ca4f6ca89cc0e5644814130dd270e/work"
        },
        "Name": "overlay2"
    },
    "RootFS": {
        "Type": "layers",
        "Layers": [
            "sha256:107cbdaeec042e6154640c94972c638f4e2fee795902b149e8ce9acbd03d59d7",
            "sha256:d54e805fe9aac70390dd3e3ca1b6fd0f63b2cf5913eb0023bf741bd291d2a746",
            "sha256:35a8bdbbbf277997b1e1efcbdc9ae5321ec3a96ceb81ad194ad58d671ab47bd4",
            "sha256:12f5e36b8e7e83c3fa409e1b34eaec072b9381c8a5e600e9f03c53dbd295bec1",
            "sha256:455bca42f6ec40ca42fea4bea15c6c17b97101af90413cdd25647de2b9d98960",
            "sha256:be1ea5a6d87ed76fe6b3cd593192bb5e588d3e76c3dbee09959a55fcb13f1fa6",
            "sha256:b3f449ad002f4dd979cc74e5acfd9c3dc22f23b42df7e768930a0602bcdb4dfa",
            "sha256:a2e280d9d3759e36b906063950b2ae578d274f2aac6e9c48fac5c17c9dccf870",
            "sha256:f104cb02332db40d078c2c1790f15052f9c5420cfd63e2334f6541d60c7f7212",
            "sha256:47e23f6b4dadaba72a8a043dc85982b6c02a22f3920c58f335071d897816bf50",
            "sha256:7b141371e584f11c1156421847eaca203a629f5b7860801155699eaa823e05b3",
            "sha256:7f47e72be4d5e82235e62bdcb30d2a0910e7308eea766c38bd5658f54eb457fa",
            "sha256:346a8c6ad6d97baa521577f916f7b9172b7ba39348e0a3997b214eab3fe54631",
            "sha256:c772d08c4f6d3fc78455295d43120ff298ae1ab6ee071c7429814e7e07fc4f2b",
            "sha256:0603aba41ebd53112c8f7afa76188fb28939393d3a55795d7bc8c01dcd72ef75",
            "sha256:f7a8d29961c42f0d512d406c7f40b577c1ff3ac1e4bb20e162229fd50073ea26",
            "sha256:488790ae905ce33c0912665c504c593256889dda15bd20fb08532ebcb5d27ba3",
            "sha256:00d5f4243cfb0fc058bd99a282ebe622f2fe57b9e95de46c46ca499c27ae73f8",
            "sha256:0b3b8d513a404a059d189ac976b4dd8cbc03a3bb9e4bbe5df8dd4ebf11c9a82f",
            "sha256:a77810403d4a5e41393ee0ae8168122d6803d90c3a408a064ab17796b284673b",
            "sha256:c0809655ae46f46cef3213b0fbf6b134d574ba0fe3e313520b24485ff0360005",
            "sha256:ea15cc760c6230f2caf4d803ab9b1adfd813daa201bdce4c27224bfd4d2dd533",
            "sha256:52f2d0425ed38e669c5e6cb95fe27e2f1b8c660ee5cd941f0f752725ea0a341c",
            "sha256:eefdaa938e3e7a53bdc37ccadba179927bc13f58235a9729434df9e2b0d4b270",
            "sha256:b50fb8762bb73be8526a8d906e82859c5685791b9f0f38186c525a17a05e6b2b",
            "sha256:2de4af21747abd726d7b683633a0c029db41971884e78ec3c238570ac57cb6e3",
            "sha256:76857e366a00448bc06628dabf762ebef94af16b444f668d3879432c19b46482",
            "sha256:8faef32b82c375f2caf8c66fb491fc5900cbf850c0bf2bd142d70cf3a1155203",
            "sha256:5f70bf18a086007016e948b04aed3b82103a36bea41755b6cddfaf10ace3c6ef",
            "sha256:5f70bf18a086007016e948b04aed3b82103a36bea41755b6cddfaf10ace3c6ef",
            "sha256:616a149b0c83f1f742d4e3dd2f48efb9c8ed3e43dbf64089cf3d30c5f5b53ea4"
        ]
    },
    "Metadata": {
        "LastTagTime": "2026-05-09T01:59:38.667272556+08:00"
    }
}

更多版本

docker.io/gpustack/runner:cann8.2-910b-vllm0.11.0

linux/arm64 docker.io15.96GB2025-11-27 00:53
508

docker.io/gpustack/runner:cuda12.9-vllm0.11.2

linux/amd64 docker.io33.62GB2025-12-03 00:42
539

docker.io/gpustack/runner:cann8.2-910b-sglang0.5.2

linux/amd64 docker.io17.03GB2025-12-05 01:02
303

docker.io/gpustack/runner:cann8.2-910b-mindie2.1.rc2

linux/arm64 docker.io16.02GB2025-12-05 01:13
339

docker.io/gpustack/runner:cann8.2-910b-sglang0.5.2

linux/arm64 docker.io18.27GB2025-12-05 01:28
388

docker.io/gpustack/runner:cuda12.4-vllm0.11.0

linux/amd64 docker.io24.31GB2025-12-05 02:03
608

docker.io/gpustack/runner:cuda12.8-sglang0.5.5

linux/amd64 docker.io32.96GB2025-12-05 02:43
551

docker.io/gpustack/runner:cuda12.4-voxbox0.0.20

linux/amd64 docker.io17.15GB2025-12-05 04:14
240

docker.io/gpustack/runner:cuda12.8-voxbox0.0.21

linux/amd64 docker.io20.57GB2026-03-05 00:39
162

docker.io/gpustack/runner:rocm7.1-vllm0.17.1

linux/amd64 docker.io31.12GB2026-04-22 00:42
62

docker.io/gpustack/runner:cuda13.0-vllmv0.20.1

linux/amd64 docker.io24.18GB2026-05-08 14:23
16

docker.io/gpustack/runner:cuda12.9-vllmv0.20.1

linux/amd64 docker.io27.44GB2026-05-09 02:28
6