ghcr.io/huggingface/text-generation-inference:3.3 linux/amd64

ghcr.io/huggingface/text-generation-inference:3.3 - 国内下载镜像源 浏览次数:18

用于文本生成的 Hugging Face Inference 镜像。它旨在提供高性能的文本生成服务,优化了延迟和吞吐量。该镜像支持多种模型,并提供了易于使用的 API,方便部署和扩展。

源镜像 ghcr.io/huggingface/text-generation-inference:3.3
国内镜像 swr.cn-north-4.myhuaweicloud.com/ddn-k8s/ghcr.io/huggingface/text-generation-inference:3.3
镜像ID sha256:3f235b36e71ac31340335f441d7268aeec5b90c806140ccc6113d49615639787
镜像TAG 3.3
大小 16.92GB
镜像源 ghcr.io
CMD
启动入口 /tgi-entrypoint.sh
工作目录 /usr/src
OS/平台 linux/amd64
浏览量 18 次
贡献者
镜像创建 2025-09-16T23:24:30.780821409Z
同步时间 2025-10-15 01:06
更新时间 2025-10-15 20:06
环境变量
PATH=/usr/local/nvidia/bin:/usr/local/cuda/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin:/usr/src/.venv/bin/ NVARCH=x86_64 NVIDIA_REQUIRE_CUDA=cuda>=12.4 brand=tesla,driver>=470,driver<471 brand=unknown,driver>=470,driver<471 brand=nvidia,driver>=470,driver<471 brand=nvidiartx,driver>=470,driver<471 brand=geforce,driver>=470,driver<471 brand=geforcertx,driver>=470,driver<471 brand=quadro,driver>=470,driver<471 brand=quadrortx,driver>=470,driver<471 brand=titan,driver>=470,driver<471 brand=titanrtx,driver>=470,driver<471 brand=tesla,driver>=525,driver<526 brand=unknown,driver>=525,driver<526 brand=nvidia,driver>=525,driver<526 brand=nvidiartx,driver>=525,driver<526 brand=geforce,driver>=525,driver<526 brand=geforcertx,driver>=525,driver<526 brand=quadro,driver>=525,driver<526 brand=quadrortx,driver>=525,driver<526 brand=titan,driver>=525,driver<526 brand=titanrtx,driver>=525,driver<526 brand=tesla,driver>=535,driver<536 brand=unknown,driver>=535,driver<536 brand=nvidia,driver>=535,driver<536 brand=nvidiartx,driver>=535,driver<536 brand=geforce,driver>=535,driver<536 brand=geforcertx,driver>=535,driver<536 brand=quadro,driver>=535,driver<536 brand=quadrortx,driver>=535,driver<536 brand=titan,driver>=535,driver<536 brand=titanrtx,driver>=535,driver<536 NV_CUDA_CUDART_VERSION=12.4.99-1 NV_CUDA_COMPAT_PACKAGE=cuda-compat-12-4 CUDA_VERSION=12.4.0 LD_LIBRARY_PATH=/usr/local/nvidia/lib:/usr/local/nvidia/lib64:/root/.local/share/uv/python/cpython-3.11.11-linux-x86_64-gnu/lib/ NVIDIA_VISIBLE_DEVICES=all NVIDIA_DRIVER_CAPABILITIES=compute,utility HF_HOME=/data HF_HUB_ENABLE_HF_TRANSFER=1 PORT=80 PYTHON_VERSION=3.11 VIRTUAL_ENV=/usr/src/.venv/ HF_KERNELS_CACHE=/kernels EXLLAMA_NO_FLASH_ATTN=1
镜像标签
NVIDIA CORPORATION <cudatools@nvidia.com>: maintainer 2025-09-17T00:49:25.224Z: org.opencontainers.image.created Large Language Model Text Generation Inference: org.opencontainers.image.description Apache-2.0: org.opencontainers.image.licenses ubuntu: org.opencontainers.image.ref.name efb94e0d3db6aba9d464bc9a2f83191146203152: org.opencontainers.image.revision https://github.com/huggingface/text-generation-inference: org.opencontainers.image.source text-generation-inference: org.opencontainers.image.title https://github.com/huggingface/text-generation-inference: org.opencontainers.image.url 3.3.6: org.opencontainers.image.version

Docker拉取命令

docker pull swr.cn-north-4.myhuaweicloud.com/ddn-k8s/ghcr.io/huggingface/text-generation-inference:3.3
docker tag  swr.cn-north-4.myhuaweicloud.com/ddn-k8s/ghcr.io/huggingface/text-generation-inference:3.3  ghcr.io/huggingface/text-generation-inference:3.3

Containerd拉取命令

ctr images pull swr.cn-north-4.myhuaweicloud.com/ddn-k8s/ghcr.io/huggingface/text-generation-inference:3.3
ctr images tag  swr.cn-north-4.myhuaweicloud.com/ddn-k8s/ghcr.io/huggingface/text-generation-inference:3.3  ghcr.io/huggingface/text-generation-inference:3.3

Shell快速替换命令

sed -i 's#ghcr.io/huggingface/text-generation-inference:3.3#swr.cn-north-4.myhuaweicloud.com/ddn-k8s/ghcr.io/huggingface/text-generation-inference:3.3#' deployment.yaml

Ansible快速分发-Docker

#ansible k8s -m shell -a 'docker pull swr.cn-north-4.myhuaweicloud.com/ddn-k8s/ghcr.io/huggingface/text-generation-inference:3.3 && docker tag  swr.cn-north-4.myhuaweicloud.com/ddn-k8s/ghcr.io/huggingface/text-generation-inference:3.3  ghcr.io/huggingface/text-generation-inference:3.3'

Ansible快速分发-Containerd

#ansible k8s -m shell -a 'ctr images pull swr.cn-north-4.myhuaweicloud.com/ddn-k8s/ghcr.io/huggingface/text-generation-inference:3.3 && ctr images tag  swr.cn-north-4.myhuaweicloud.com/ddn-k8s/ghcr.io/huggingface/text-generation-inference:3.3  ghcr.io/huggingface/text-generation-inference:3.3'

镜像构建历史


# 2025-09-17 07:24:30  0.00B 配置容器启动时运行的命令
ENTRYPOINT ["/tgi-entrypoint.sh"]
                        
# 2025-09-17 07:24:30  0.00B 设置环境变量 LD_LIBRARY_PATH
ENV LD_LIBRARY_PATH=/usr/local/nvidia/lib:/usr/local/nvidia/lib64:/root/.local/share/uv/python/cpython-3.11.11-linux-x86_64-gnu/lib/
                        
# 2025-09-17 07:24:30  0.00B 执行命令并创建新的镜像层
RUN /bin/sh -c chmod +x /tgi-entrypoint.sh # buildkit
                        
# 2025-09-17 07:24:30  170.00B 复制新文件或目录到容器中
COPY ./tgi-entrypoint.sh /tgi-entrypoint.sh # buildkit
                        
# 2025-09-17 07:24:30  6.98MB 复制新文件或目录到容器中
COPY /usr/src/target/release-opt/text-generation-launcher /usr/local/bin/text-generation-launcher # buildkit
                        
# 2025-09-17 07:24:30  39.42MB 复制新文件或目录到容器中
COPY /usr/src/target/release-opt/text-generation-router /usr/local/bin/text-generation-router # buildkit
                        
# 2025-09-17 05:02:48  11.85MB 复制新文件或目录到容器中
COPY /usr/src/target/release-opt/text-generation-benchmark /usr/local/bin/text-generation-benchmark # buildkit
                        
# 2025-09-09 03:51:58  223.17MB 执行命令并创建新的镜像层
RUN /bin/sh -c apt-get update && DEBIAN_FRONTEND=noninteractive apt-get install -y --no-install-recommends         build-essential         g++         && rm -rf /var/lib/apt/lists/* # buildkit
                        
# 2025-09-09 03:51:47  0.00B 设置环境变量 EXLLAMA_NO_FLASH_ATTN
ENV EXLLAMA_NO_FLASH_ATTN=1
                        
# 2025-09-09 03:51:47  1.31GB 复制新文件或目录到容器中
COPY /usr/src/.venv/lib/python3.11/site-packages/flashinfer/ /usr/src/.venv/lib/python3.11/site-packages/flashinfer/ # buildkit
                        
# 2025-09-09 02:44:08  34.18MB 复制新文件或目录到容器中
COPY /usr/src/causal-conv1d/build/lib.linux-x86_64-cpython-311/ /usr/src/.venv/lib/python3.11/site-packages # buildkit
                        
# 2025-09-09 02:44:08  194.24MB 复制新文件或目录到容器中
COPY /usr/src/mamba/build/lib.linux-x86_64-cpython-311/ /usr/src/.venv/lib/python3.11/site-packages # buildkit
                        
# 2025-09-09 02:44:08  10.25MB 复制新文件或目录到容器中
COPY /usr/src/llm-awq/awq/kernels/build/lib.linux-x86_64-cpython-311 /usr/src/.venv/lib/python3.11/site-packages # buildkit
                        
# 2025-09-09 02:44:08  163.26MB 复制新文件或目录到容器中
COPY /usr/src/exllamav2/build/lib.linux-x86_64-cpython-311 /usr/src/.venv/lib/python3.11/site-packages # buildkit
                        
# 2025-09-09 02:44:08  10.65MB 复制新文件或目录到容器中
COPY /usr/src/build/lib.linux-x86_64-cpython-311 /usr/src/.venv/lib/python3.11/site-packages # buildkit
                        
# 2025-09-09 02:44:08  3.09MB 复制新文件或目录到容器中
COPY /usr/src/build/lib.linux-x86_64-cpython-311 /usr/src/.venv/lib/python3.11/site-packages # buildkit
                        
# 2025-09-09 02:44:08  632.67MB 复制新文件或目录到容器中
COPY /usr/src/.venv/lib/python3.11/site-packages/flash_attn_2_cuda.cpython-311-x86_64-linux-gnu.so /usr/src/.venv/lib/python3.11/site-packages # buildkit
                        
# 2025-09-09 02:08:16  12.47MB 复制新文件或目录到容器中
COPY /usr/src/flash-attention/csrc/rotary/build/lib.linux-x86_64-cpython-311 /usr/src/.venv/lib/python3.11/site-packages # buildkit
                        
# 2025-09-09 02:08:16  736.53MB 复制新文件或目录到容器中
COPY /usr/src/flash-attention/csrc/layer_norm/build/lib.linux-x86_64-cpython-311 /usr/src/.venv/lib/python3.11/site-packages # buildkit
                        
# 2025-09-09 02:08:16  165.07MB 复制新文件或目录到容器中
COPY /usr/src/flash-attention/build/lib.linux-x86_64-cpython-311 /usr/src/.venv/lib/python3.11/site-packages # buildkit
                        
# 2025-09-09 01:58:21  332.21MB 执行命令并创建新的镜像层
RUN /bin/sh -c cd server &&     uv sync --frozen --extra gen --extra bnb --extra accelerate --extra compressed-tensors --extra quantize --extra peft --extra outlines --extra torch --active --python=${PYTHON_VERSION} &&     uv pip install nvidia-nccl-cu12==2.25.1 &&     pwd &&     text-generation-server --help # buildkit
                        
# 2025-09-09 01:58:08  7.15GB 执行命令并创建新的镜像层
RUN /bin/sh -c cd server && 	uv sync --frozen --extra gen --extra bnb --extra accelerate --extra compressed-tensors --extra quantize --extra peft --extra outlines --extra torch --no-install-project --active &&     make gen-server-raw &&     kernels download . # buildkit
                        
# 2025-09-17 07:24:30  0.00B 设置环境变量 HF_KERNELS_CACHE
ENV HF_KERNELS_CACHE=/kernels
                        
# 2025-09-17 07:24:30  0.00B 复制新文件或目录到容器中
COPY server/Makefile server/Makefile # buildkit
                        
# 2025-09-09 01:57:07  2.85MB 复制新文件或目录到容器中
COPY server server # buildkit
                        
# 2025-05-12 19:03:15  13.42KB 复制新文件或目录到容器中
COPY proto proto # buildkit
                        
# 2025-05-12 19:03:15  0.00B 设置环境变量 PATH
ENV PATH=/usr/local/nvidia/bin:/usr/local/cuda/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin:/usr/src/.venv/bin/
                        
# 2025-05-12 19:03:15  0.00B 设置环境变量 VIRTUAL_ENV
ENV VIRTUAL_ENV=/usr/src/.venv/
                        
# 2025-05-12 19:03:15  61.55MB 执行命令并创建新的镜像层
RUN /bin/sh -c uv python install ${PYTHON_VERSION} # buildkit
                        
# 2025-05-12 19:03:14  0.00B 设置环境变量 PYTHON_VERSION
ENV PYTHON_VERSION=3.11
                        
# 2025-05-12 19:03:14  5.46GB 复制新文件或目录到容器中
COPY /usr/src/.venv /usr/src/.venv # buildkit
                        
# 2025-04-07 19:06:10  37.65MB 复制新文件或目录到容器中
COPY /uv /uvx /bin/ # buildkit
                        
# 2025-02-14 18:40:32  85.15MB 执行命令并创建新的镜像层
RUN /bin/sh -c apt-get update && DEBIAN_FRONTEND=noninteractive apt-get install -y --no-install-recommends         libssl-dev         ca-certificates         make         curl         git         && rm -rf /var/lib/apt/lists/* # buildkit
                        
# 2025-09-17 07:24:30  0.00B 设置工作目录为/usr/src
WORKDIR /usr/src
                        
# 2025-09-17 07:24:30  0.00B 设置环境变量 HF_HOME HF_HUB_ENABLE_HF_TRANSFER PORT
ENV HF_HOME=/data HF_HUB_ENABLE_HF_TRANSFER=1 PORT=80
                        
# 2024-04-05 07:36:23  0.00B 设置环境变量 NVIDIA_DRIVER_CAPABILITIES
ENV NVIDIA_DRIVER_CAPABILITIES=compute,utility
                        
# 2024-04-05 07:36:23  0.00B 设置环境变量 NVIDIA_VISIBLE_DEVICES
ENV NVIDIA_VISIBLE_DEVICES=all
                        
# 2024-04-05 07:36:23  17.29KB 复制新文件或目录到容器中
COPY NGC-DL-CONTAINER-LICENSE / # buildkit
                        
# 2024-04-05 07:36:23  0.00B 设置环境变量 LD_LIBRARY_PATH
ENV LD_LIBRARY_PATH=/usr/local/nvidia/lib:/usr/local/nvidia/lib64
                        
# 2024-04-05 07:36:23  0.00B 设置环境变量 PATH
ENV PATH=/usr/local/nvidia/bin:/usr/local/cuda/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin
                        
# 2024-04-05 07:36:23  46.00B 执行命令并创建新的镜像层
RUN |1 TARGETARCH=amd64 /bin/sh -c echo "/usr/local/nvidia/lib" >> /etc/ld.so.conf.d/nvidia.conf     && echo "/usr/local/nvidia/lib64" >> /etc/ld.so.conf.d/nvidia.conf # buildkit
                        
# 2024-04-05 07:36:23  155.92MB 执行命令并创建新的镜像层
RUN |1 TARGETARCH=amd64 /bin/sh -c apt-get update && apt-get install -y --no-install-recommends     cuda-cudart-12-4=${NV_CUDA_CUDART_VERSION}     ${NV_CUDA_COMPAT_PACKAGE}     && rm -rf /var/lib/apt/lists/* # buildkit
                        
# 2024-04-05 07:36:11  0.00B 设置环境变量 CUDA_VERSION
ENV CUDA_VERSION=12.4.0
                        
# 2024-04-05 07:36:11  10.56MB 执行命令并创建新的镜像层
RUN |1 TARGETARCH=amd64 /bin/sh -c apt-get update && apt-get install -y --no-install-recommends     gnupg2 curl ca-certificates &&     curl -fsSLO https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/${NVARCH}/cuda-keyring_1.1-1_all.deb &&     dpkg -i cuda-keyring_1.1-1_all.deb &&     apt-get purge --autoremove -y curl     && rm -rf /var/lib/apt/lists/* # buildkit
                        
# 2024-04-05 07:36:11  0.00B 添加元数据标签
LABEL maintainer=NVIDIA CORPORATION <cudatools@nvidia.com>
                        
# 2024-04-05 07:36:11  0.00B 定义构建参数
ARG TARGETARCH
                        
# 2024-04-05 07:36:11  0.00B 设置环境变量 NV_CUDA_COMPAT_PACKAGE
ENV NV_CUDA_COMPAT_PACKAGE=cuda-compat-12-4
                        
# 2024-04-05 07:36:11  0.00B 设置环境变量 NV_CUDA_CUDART_VERSION
ENV NV_CUDA_CUDART_VERSION=12.4.99-1
                        
# 2024-04-05 07:36:11  0.00B 设置环境变量 NVIDIA_REQUIRE_CUDA brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand
ENV NVIDIA_REQUIRE_CUDA=cuda>=12.4 brand=tesla,driver>=470,driver<471 brand=unknown,driver>=470,driver<471 brand=nvidia,driver>=470,driver<471 brand=nvidiartx,driver>=470,driver<471 brand=geforce,driver>=470,driver<471 brand=geforcertx,driver>=470,driver<471 brand=quadro,driver>=470,driver<471 brand=quadrortx,driver>=470,driver<471 brand=titan,driver>=470,driver<471 brand=titanrtx,driver>=470,driver<471 brand=tesla,driver>=525,driver<526 brand=unknown,driver>=525,driver<526 brand=nvidia,driver>=525,driver<526 brand=nvidiartx,driver>=525,driver<526 brand=geforce,driver>=525,driver<526 brand=geforcertx,driver>=525,driver<526 brand=quadro,driver>=525,driver<526 brand=quadrortx,driver>=525,driver<526 brand=titan,driver>=525,driver<526 brand=titanrtx,driver>=525,driver<526 brand=tesla,driver>=535,driver<536 brand=unknown,driver>=535,driver<536 brand=nvidia,driver>=535,driver<536 brand=nvidiartx,driver>=535,driver<536 brand=geforce,driver>=535,driver<536 brand=geforcertx,driver>=535,driver<536 brand=quadro,driver>=535,driver<536 brand=quadrortx,driver>=535,driver<536 brand=titan,driver>=535,driver<536 brand=titanrtx,driver>=535,driver<536
                        
# 2024-04-05 07:36:11  0.00B 设置环境变量 NVARCH
ENV NVARCH=x86_64
                        
# 2024-02-28 02:52:59  0.00B 
/bin/sh -c #(nop)  CMD ["/bin/bash"]
                        
# 2024-02-28 02:52:58  77.86MB 
/bin/sh -c #(nop) ADD file:21c2e8d95909bec6f4acdaf4aed55b44ee13603681f93b152e423e3e6a4a207b in / 
                        
# 2024-02-28 02:52:57  0.00B 
/bin/sh -c #(nop)  LABEL org.opencontainers.image.version=22.04
                        
# 2024-02-28 02:52:57  0.00B 
/bin/sh -c #(nop)  LABEL org.opencontainers.image.ref.name=ubuntu
                        
# 2024-02-28 02:52:57  0.00B 
/bin/sh -c #(nop)  ARG LAUNCHPAD_BUILD_ARCH
                        
# 2024-02-28 02:52:57  0.00B 
/bin/sh -c #(nop)  ARG RELEASE
                        
                    

镜像信息

{
    "Id": "sha256:3f235b36e71ac31340335f441d7268aeec5b90c806140ccc6113d49615639787",
    "RepoTags": [
        "ghcr.io/huggingface/text-generation-inference:3.3",
        "swr.cn-north-4.myhuaweicloud.com/ddn-k8s/ghcr.io/huggingface/text-generation-inference:3.3"
    ],
    "RepoDigests": [
        "ghcr.io/huggingface/text-generation-inference@sha256:07de704f7b26353a008416e78f3d2a575a3e81ce88882263c565c066b446ca73",
        "swr.cn-north-4.myhuaweicloud.com/ddn-k8s/ghcr.io/huggingface/text-generation-inference@sha256:f17bd2b3a1de583bd44aad3e96e6d1975bd9ba6c3b33bf9f4147b848ddc3e629"
    ],
    "Parent": "",
    "Comment": "buildkit.dockerfile.v0",
    "Created": "2025-09-16T23:24:30.780821409Z",
    "Container": "",
    "ContainerConfig": null,
    "DockerVersion": "",
    "Author": "",
    "Config": {
        "Hostname": "",
        "Domainname": "",
        "User": "",
        "AttachStdin": false,
        "AttachStdout": false,
        "AttachStderr": false,
        "Tty": false,
        "OpenStdin": false,
        "StdinOnce": false,
        "Env": [
            "PATH=/usr/local/nvidia/bin:/usr/local/cuda/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin:/usr/src/.venv/bin/",
            "NVARCH=x86_64",
            "NVIDIA_REQUIRE_CUDA=cuda\u003e=12.4 brand=tesla,driver\u003e=470,driver\u003c471 brand=unknown,driver\u003e=470,driver\u003c471 brand=nvidia,driver\u003e=470,driver\u003c471 brand=nvidiartx,driver\u003e=470,driver\u003c471 brand=geforce,driver\u003e=470,driver\u003c471 brand=geforcertx,driver\u003e=470,driver\u003c471 brand=quadro,driver\u003e=470,driver\u003c471 brand=quadrortx,driver\u003e=470,driver\u003c471 brand=titan,driver\u003e=470,driver\u003c471 brand=titanrtx,driver\u003e=470,driver\u003c471 brand=tesla,driver\u003e=525,driver\u003c526 brand=unknown,driver\u003e=525,driver\u003c526 brand=nvidia,driver\u003e=525,driver\u003c526 brand=nvidiartx,driver\u003e=525,driver\u003c526 brand=geforce,driver\u003e=525,driver\u003c526 brand=geforcertx,driver\u003e=525,driver\u003c526 brand=quadro,driver\u003e=525,driver\u003c526 brand=quadrortx,driver\u003e=525,driver\u003c526 brand=titan,driver\u003e=525,driver\u003c526 brand=titanrtx,driver\u003e=525,driver\u003c526 brand=tesla,driver\u003e=535,driver\u003c536 brand=unknown,driver\u003e=535,driver\u003c536 brand=nvidia,driver\u003e=535,driver\u003c536 brand=nvidiartx,driver\u003e=535,driver\u003c536 brand=geforce,driver\u003e=535,driver\u003c536 brand=geforcertx,driver\u003e=535,driver\u003c536 brand=quadro,driver\u003e=535,driver\u003c536 brand=quadrortx,driver\u003e=535,driver\u003c536 brand=titan,driver\u003e=535,driver\u003c536 brand=titanrtx,driver\u003e=535,driver\u003c536",
            "NV_CUDA_CUDART_VERSION=12.4.99-1",
            "NV_CUDA_COMPAT_PACKAGE=cuda-compat-12-4",
            "CUDA_VERSION=12.4.0",
            "LD_LIBRARY_PATH=/usr/local/nvidia/lib:/usr/local/nvidia/lib64:/root/.local/share/uv/python/cpython-3.11.11-linux-x86_64-gnu/lib/",
            "NVIDIA_VISIBLE_DEVICES=all",
            "NVIDIA_DRIVER_CAPABILITIES=compute,utility",
            "HF_HOME=/data",
            "HF_HUB_ENABLE_HF_TRANSFER=1",
            "PORT=80",
            "PYTHON_VERSION=3.11",
            "VIRTUAL_ENV=/usr/src/.venv/",
            "HF_KERNELS_CACHE=/kernels",
            "EXLLAMA_NO_FLASH_ATTN=1"
        ],
        "Cmd": null,
        "Image": "",
        "Volumes": null,
        "WorkingDir": "/usr/src",
        "Entrypoint": [
            "/tgi-entrypoint.sh"
        ],
        "OnBuild": null,
        "Labels": {
            "maintainer": "NVIDIA CORPORATION \u003ccudatools@nvidia.com\u003e",
            "org.opencontainers.image.created": "2025-09-17T00:49:25.224Z",
            "org.opencontainers.image.description": "Large Language Model Text Generation Inference",
            "org.opencontainers.image.licenses": "Apache-2.0",
            "org.opencontainers.image.ref.name": "ubuntu",
            "org.opencontainers.image.revision": "efb94e0d3db6aba9d464bc9a2f83191146203152",
            "org.opencontainers.image.source": "https://github.com/huggingface/text-generation-inference",
            "org.opencontainers.image.title": "text-generation-inference",
            "org.opencontainers.image.url": "https://github.com/huggingface/text-generation-inference",
            "org.opencontainers.image.version": "3.3.6"
        }
    },
    "Architecture": "amd64",
    "Os": "linux",
    "Size": 16923337297,
    "GraphDriver": {
        "Data": {
            "LowerDir": "/var/lib/docker/overlay2/fd7345b3c722ae4eb2a07fb22060858fcc3891155ae5e2a337407692c9458607/diff:/var/lib/docker/overlay2/5f14f15aad5def215bc9fe535ce88dc76b061cf79c7e7576e9ccff385e78385d/diff:/var/lib/docker/overlay2/ae7c478c26720f399c301165b5532012067d122ff9ba13d1bdc59323fb17c226/diff:/var/lib/docker/overlay2/b1b6c3ecd7a091e6476139c971d7f1ddf95668d2fa806bee58a41cead45270ae/diff:/var/lib/docker/overlay2/bfb288f50b2a5c28f960f330713ec7a9eb8699e88fca35d90ec6ffd3732c0155/diff:/var/lib/docker/overlay2/158cb1a636b896f339d3963f7f02cbea1001080e48f04ff8e967a68dc1f8989b/diff:/var/lib/docker/overlay2/a053655fc6bf903a82f5a8841550e197d9ddb1c9a638edde8fac9e4d7b33c1b3/diff:/var/lib/docker/overlay2/cd4fd621d368e617b3f2ea9b05d0307d61334b42522d87432ea3b90ea7b3d41f/diff:/var/lib/docker/overlay2/1fe257d0ef3fc451b6330663004c7e56e390cc67914da4fff0f05ad686bf37cc/diff:/var/lib/docker/overlay2/860378c5d8c8ca6057eb4cd441883c59b1002b3f66b9e2836adc8e0963163295/diff:/var/lib/docker/overlay2/6b4758fc9c34b65e98ca41af59d131dadaa3f781e14be4b29c03220d61aeed07/diff:/var/lib/docker/overlay2/67b1b42409e2cfeae28e1804ae76779ac085eb5ed38ee9b447df02c94598a02e/diff:/var/lib/docker/overlay2/60878c4d0202f0fe4abc48ea73fea663f43eaed13e38c1bfa11a33bbd324d281/diff:/var/lib/docker/overlay2/0b2a1f493a7269d3d7ba67315f3edadfdd3307be43f1502b85f8cda952e817bd/diff:/var/lib/docker/overlay2/efcae2d5b7df04a3eca1e6685330e7f5c55ed2460326790e331863c8e4687d96/diff:/var/lib/docker/overlay2/59c16eb74a8456c86a0e99f95d08e0000ccb0b9d76c828666e9f82bd82a49d11/diff:/var/lib/docker/overlay2/6feb3ae53c0d641f184d5fd41672278d664f56b262e9d35390ed3afd9903b370/diff:/var/lib/docker/overlay2/5ed4136ff1619fb476c1e1c4ae53deaedbacf2b8af14e66fa1b5bd05d94b8671/diff:/var/lib/docker/overlay2/5021c8b29a068a3d5ecdfa3c5b51ba4ca2e7630e5d7349fd3acd110d845c5175/diff:/var/lib/docker/overlay2/699c5f7749c2f058415aadf372cc302e289d67172429bc954561253b5e205b56/diff:/var/lib/docker/overlay2/effdb0602dda890667ba9f5b454a7e0a449076068a6d7ed1afe438594042e1d3/diff:/var/lib/docker/overlay2/e0892c2db12b5e5b3914d1b328b95cd5fbb82630a8733f3c3bcf504de30e4832/diff:/var/lib/docker/overlay2/7f7530b4165722f3433e0cf8b8c53c0d0237ef7406017d0148bc79d45e77af38/diff:/var/lib/docker/overlay2/b5e0dfc9ca9f6a420d08de9b6040e8df827594eecb0e51b8b3192967e807fef7/diff:/var/lib/docker/overlay2/8cbeae013dd5f08e6636f2d4914c3172da1979a04b24a61868568112b005d6ed/diff:/var/lib/docker/overlay2/fe4cdaf49f0422b6d61af230b5be566ff7009bd16ba9115bea8ff1a5740c6c11/diff:/var/lib/docker/overlay2/461ca7fd6e39d861b2311f1e1433d554d7027501f7e27f291d664f440cb033f9/diff:/var/lib/docker/overlay2/627a195d1725748e224ba0c8d299e8d5a862ad6133982c6458052c6b839f5e4a/diff:/var/lib/docker/overlay2/297b258cb3a6b3147e80550dbc6c3bd5998554552abd02cb3248496dde840bb9/diff:/var/lib/docker/overlay2/53590fc1976ffdc026d2b1570b36addf9b59e2c2a5e053cb70ec6daa7177f59c/diff:/var/lib/docker/overlay2/36afb5f3b33b3915eca7fc7803ea8c9687feff5bf1dafe376b931e39494e6100/diff",
            "MergedDir": "/var/lib/docker/overlay2/98ecaff459571511c311c626c1672c199a18b62af11cdd2dd906d53d66e08af1/merged",
            "UpperDir": "/var/lib/docker/overlay2/98ecaff459571511c311c626c1672c199a18b62af11cdd2dd906d53d66e08af1/diff",
            "WorkDir": "/var/lib/docker/overlay2/98ecaff459571511c311c626c1672c199a18b62af11cdd2dd906d53d66e08af1/work"
        },
        "Name": "overlay2"
    },
    "RootFS": {
        "Type": "layers",
        "Layers": [
            "sha256:5498e8c22f6996f25ef193ee58617d5b37e2a96decf22e72de13c3b34e147591",
            "sha256:4cd4079525948900a02a5734090afd1f3e046fc940dc882c55efcaee0a252dd0",
            "sha256:022bf74291b27404b223ba9ee16a7f3fb067253df9c65e23dfb3339800b28dfa",
            "sha256:eeb5315df33c9e700b3b8b8a3cdd1cf11e13c9dd44bfd946e340573478303349",
            "sha256:e942261d196e5e686398e2326c033119112f910191143b0497f13f78c377fa03",
            "sha256:5f70bf18a086007016e948b04aed3b82103a36bea41755b6cddfaf10ace3c6ef",
            "sha256:64ea17e826dc7e4d48595831d2b74eaedcfbe7714a06ae83e8e60a6999723989",
            "sha256:730ed689a6f17634e2884b8307b56581aae5c38844b5b3149926c22c6bfa9b22",
            "sha256:46eef3cbabf51553608e7314ae4c99675984a71013c7d65d293d194ede8954a3",
            "sha256:d0f55793bcfd4de7d7b639739578d3701c6b330a876887f8b86a8378769a6335",
            "sha256:afde7bb1ed9eb1accd934f02059baf995206b82b0d3e85943353b40c17b8c766",
            "sha256:2b155d2b915fa05025957ae0ef6c176fe984d537d08ca810ce44389361ed5a7e",
            "sha256:5f70bf18a086007016e948b04aed3b82103a36bea41755b6cddfaf10ace3c6ef",
            "sha256:e0920b1b4fb8aea05489409bdc34babaf510224a141eda55556cd670c08e2863",
            "sha256:079a106603d35cafb35c28db51ce477f4a7dc8c207e12023f57934a924445bbc",
            "sha256:e71af3b563a9e3c6d790fe0c43766ab9cd10d37fc56412ba27685ffe6185a1a2",
            "sha256:02250945d4bcc249c9e9156bf7e11527d000641d6034e2e2d958a08c8b4a71d6",
            "sha256:11f7719fab5eb728dda8be8c5f4afb82770442435a91ad523603e22d9bbbba64",
            "sha256:0f5f77c832ac15768758ba1ffa1b7cd0e839b0a7912c4e8fec7897f9afaeaa0a",
            "sha256:ddfffa9644375f1b50a3aa100a38cba249d5c3e566ed824953fa11f2bc8d8939",
            "sha256:b0f18c78111f993b0793245c3334adaf4d9e5a1e49d2bcfd9834759e8a3f0816",
            "sha256:f75c0e89a8f5dfc001ad8c8c0984656766047210e9538fab8910443445e33c0d",
            "sha256:cc37464586b90386411b53cfae8be6f3daa969823e13e1a99c208bbb2e21f719",
            "sha256:f85bd9d937b0d70483d42ef3df2e6d334f0176e3ee6a90dad779c1ce40abaf37",
            "sha256:23c00595a576f455c0ef1a708e319f79a32cfeaaafa97779cc262bc11ce04309",
            "sha256:c124e57cea5eee6c504e4699430be74c47b96b9abb20c75747753ad5b9b7d3b6",
            "sha256:4a07db686f9a7274078e303e2b8356ee1d2358efa28a596b3d38d05121f623bc",
            "sha256:a473bc1b654a254ad43acb6e05f50d055890e13334e68c855adc0709cf492b55",
            "sha256:403e9316b276d8d82458d20469a6343caaa7f666a0c9da1b24735239579e63c4",
            "sha256:0cfa1752a413bdc2997f6d097a4729f00596bfb276683a558fd3f2c3d9bc8825",
            "sha256:fbce79b41bde7e21b76dd93dfc60b7b0054df56c0143d359052eb6fd23365f97",
            "sha256:5f70bf18a086007016e948b04aed3b82103a36bea41755b6cddfaf10ace3c6ef"
        ]
    },
    "Metadata": {
        "LastTagTime": "2025-10-15T00:46:22.794540248+08:00"
    }
}

更多版本

ghcr.io/huggingface/text-generation-inference:2.1.1

linux/amd64 ghcr.io10.66GB2024-09-07 04:52
320

ghcr.io/huggingface/text-generation-inference:2.2

linux/amd64 ghcr.io11.37GB2024-09-07 05:20
764

ghcr.io/huggingface/text-generation-inference:2.3.0

linux/amd64 ghcr.io13.75GB2024-09-23 15:50
806

ghcr.io/huggingface/text-generation-inference:2.4.0

linux/amd64 ghcr.io14.11GB2024-11-08 17:53
361

ghcr.io/huggingface/text-generation-inference:3.1.0

linux/amd64 ghcr.io12.24GB2025-02-11 04:56
357

ghcr.io/huggingface/text-generation-inference:3.1.1

linux/amd64 ghcr.io16.21GB2025-03-10 04:51
175

ghcr.io/huggingface/text-generation-inference:3.2.1

linux/amd64 ghcr.io16.22GB2025-03-25 00:58
582
293

ghcr.io/huggingface/text-generation-inference:3.3

linux/amd64 ghcr.io16.92GB2025-10-15 01:06
17