ghcr.io/huggingface/text-embeddings-inference:cuda-1.9.2 linux/amd64

ghcr.io/huggingface/text-embeddings-inference:cuda-1.9.2 - 国内下载镜像源 浏览次数:8

文本嵌入推断

Hugging Face 提供了一个用于文本嵌入推断的 Docker 镜像。
源镜像 ghcr.io/huggingface/text-embeddings-inference:cuda-1.9.2
国内镜像 swr.cn-north-4.myhuaweicloud.com/ddn-k8s/ghcr.io/huggingface/text-embeddings-inference:cuda-1.9.2
镜像ID sha256:a58880345dce36b9a5903d1f69514fad97cfda99f2eb418df1d35a9dc1016dda
镜像TAG cuda-1.9.2
大小 5.17GB
镜像源 ghcr.io
CMD --json-output
启动入口 ./entrypoint.sh
工作目录
OS/平台 linux/amd64
浏览量 8 次
贡献者
镜像创建 2026-02-25T12:05:07.135923289Z
同步时间 2026-03-13 00:40
环境变量
PATH=/usr/local/cuda/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin NVARCH=x86_64 NVIDIA_REQUIRE_CUDA=cuda>=12.9 brand=unknown,driver>=535,driver<536 brand=grid,driver>=535,driver<536 brand=tesla,driver>=535,driver<536 brand=nvidia,driver>=535,driver<536 brand=quadro,driver>=535,driver<536 brand=quadrortx,driver>=535,driver<536 brand=nvidiartx,driver>=535,driver<536 brand=vapps,driver>=535,driver<536 brand=vpc,driver>=535,driver<536 brand=vcs,driver>=535,driver<536 brand=vws,driver>=535,driver<536 brand=cloudgaming,driver>=535,driver<536 brand=unknown,driver>=550,driver<551 brand=grid,driver>=550,driver<551 brand=tesla,driver>=550,driver<551 brand=nvidia,driver>=550,driver<551 brand=quadro,driver>=550,driver<551 brand=quadrortx,driver>=550,driver<551 brand=nvidiartx,driver>=550,driver<551 brand=vapps,driver>=550,driver<551 brand=vpc,driver>=550,driver<551 brand=vcs,driver>=550,driver<551 brand=vws,driver>=550,driver<551 brand=cloudgaming,driver>=550,driver<551 brand=unknown,driver>=560,driver<561 brand=grid,driver>=560,driver<561 brand=tesla,driver>=560,driver<561 brand=nvidia,driver>=560,driver<561 brand=quadro,driver>=560,driver<561 brand=quadrortx,driver>=560,driver<561 brand=nvidiartx,driver>=560,driver<561 brand=vapps,driver>=560,driver<561 brand=vpc,driver>=560,driver<561 brand=vcs,driver>=560,driver<561 brand=vws,driver>=560,driver<561 brand=cloudgaming,driver>=560,driver<561 brand=unknown,driver>=565,driver<566 brand=grid,driver>=565,driver<566 brand=tesla,driver>=565,driver<566 brand=nvidia,driver>=565,driver<566 brand=quadro,driver>=565,driver<566 brand=quadrortx,driver>=565,driver<566 brand=nvidiartx,driver>=565,driver<566 brand=vapps,driver>=565,driver<566 brand=vpc,driver>=565,driver<566 brand=vcs,driver>=565,driver<566 brand=vws,driver>=565,driver<566 brand=cloudgaming,driver>=565,driver<566 brand=unknown,driver>=570,driver<571 brand=grid,driver>=570,driver<571 brand=tesla,driver>=570,driver<571 brand=nvidia,driver>=570,driver<571 brand=quadro,driver>=570,driver<571 brand=quadrortx,driver>=570,driver<571 brand=nvidiartx,driver>=570,driver<571 brand=vapps,driver>=570,driver<571 brand=vpc,driver>=570,driver<571 brand=vcs,driver>=570,driver<571 brand=vws,driver>=570,driver<571 brand=cloudgaming,driver>=570,driver<571 NV_CUDA_CUDART_VERSION=12.9.79-1 CUDA_VERSION=12.9.1 LD_LIBRARY_PATH=/usr/local/cuda/lib64:/usr/local/cuda/lib64 NVIDIA_VISIBLE_DEVICES=all NVIDIA_DRIVER_CAPABILITIES=compute,utility NV_CUDA_LIB_VERSION=12.9.1-1 NV_NVTX_VERSION=12.9.79-1 NV_LIBNPP_VERSION=12.4.1.87-1 NV_LIBNPP_PACKAGE=libnpp-12-9=12.4.1.87-1 NV_LIBCUSPARSE_VERSION=12.5.10.65-1 NV_LIBCUBLAS_PACKAGE_NAME=libcublas-12-9 NV_LIBCUBLAS_VERSION=12.9.1.4-1 NV_LIBCUBLAS_PACKAGE=libcublas-12-9=12.9.1.4-1 NV_LIBNCCL_PACKAGE_NAME=libnccl2 NV_LIBNCCL_PACKAGE_VERSION=2.27.3-1 NCCL_VERSION=2.27.3-1 NV_LIBNCCL_PACKAGE=libnccl2=2.27.3-1+cuda12.9 NVIDIA_PRODUCT_NAME=CUDA HUGGINGFACE_HUB_CACHE=/data PORT=80 USE_FLASH_ATTENTION=True
镜像标签
NVIDIA CORPORATION <cudatools@nvidia.com>: maintainer 2026-02-25T11:18:28.441Z: org.opencontainers.image.created A blazing fast inference solution for text embeddings models: org.opencontainers.image.description Apache-2.0: org.opencontainers.image.licenses ubuntu: org.opencontainers.image.ref.name 1d6ceb4883230aee3a4e53b7d5d6c0b5477a335c: org.opencontainers.image.revision https://github.com/huggingface/text-embeddings-inference: org.opencontainers.image.source text-embeddings-inference: org.opencontainers.image.title https://github.com/huggingface/text-embeddings-inference: org.opencontainers.image.url cuda-1.9.2: org.opencontainers.image.version

Docker拉取命令

docker pull swr.cn-north-4.myhuaweicloud.com/ddn-k8s/ghcr.io/huggingface/text-embeddings-inference:cuda-1.9.2
docker tag  swr.cn-north-4.myhuaweicloud.com/ddn-k8s/ghcr.io/huggingface/text-embeddings-inference:cuda-1.9.2  ghcr.io/huggingface/text-embeddings-inference:cuda-1.9.2

Containerd拉取命令

ctr images pull swr.cn-north-4.myhuaweicloud.com/ddn-k8s/ghcr.io/huggingface/text-embeddings-inference:cuda-1.9.2
ctr images tag  swr.cn-north-4.myhuaweicloud.com/ddn-k8s/ghcr.io/huggingface/text-embeddings-inference:cuda-1.9.2  ghcr.io/huggingface/text-embeddings-inference:cuda-1.9.2

Shell快速替换命令

sed -i 's#ghcr.io/huggingface/text-embeddings-inference:cuda-1.9.2#swr.cn-north-4.myhuaweicloud.com/ddn-k8s/ghcr.io/huggingface/text-embeddings-inference:cuda-1.9.2#' deployment.yaml

Ansible快速分发-Docker

#ansible k8s -m shell -a 'docker pull swr.cn-north-4.myhuaweicloud.com/ddn-k8s/ghcr.io/huggingface/text-embeddings-inference:cuda-1.9.2 && docker tag  swr.cn-north-4.myhuaweicloud.com/ddn-k8s/ghcr.io/huggingface/text-embeddings-inference:cuda-1.9.2  ghcr.io/huggingface/text-embeddings-inference:cuda-1.9.2'

Ansible快速分发-Containerd

#ansible k8s -m shell -a 'ctr images pull swr.cn-north-4.myhuaweicloud.com/ddn-k8s/ghcr.io/huggingface/text-embeddings-inference:cuda-1.9.2 && ctr images tag  swr.cn-north-4.myhuaweicloud.com/ddn-k8s/ghcr.io/huggingface/text-embeddings-inference:cuda-1.9.2  ghcr.io/huggingface/text-embeddings-inference:cuda-1.9.2'

镜像构建历史


# 2026-02-25 20:05:07  0.00B 设置默认要执行的命令
CMD ["--json-output"]
                        
# 2026-02-25 20:05:07  0.00B 配置容器启动时运行的命令
ENTRYPOINT ["./entrypoint.sh"]
                        
# 2026-02-25 20:05:07  1.47KB 复制新文件或目录到容器中
COPY --chmod=775 cuda-all-entrypoint.sh entrypoint.sh # buildkit
                        
# 2026-02-25 20:05:07  296.07MB 复制新文件或目录到容器中
COPY /usr/src/target/release/text-embeddings-router-120 /usr/local/bin/text-embeddings-router-120 # buildkit
                        
# 2026-02-25 20:05:07  296.07MB 复制新文件或目录到容器中
COPY /usr/src/target/release/text-embeddings-router-100 /usr/local/bin/text-embeddings-router-100 # buildkit
                        
# 2026-02-25 20:05:07  296.07MB 复制新文件或目录到容器中
COPY /usr/src/target/release/text-embeddings-router-90 /usr/local/bin/text-embeddings-router-90 # buildkit
                        
# 2026-02-25 20:05:07  296.07MB 复制新文件或目录到容器中
COPY /usr/src/target/release/text-embeddings-router-80 /usr/local/bin/text-embeddings-router-80 # buildkit
                        
# 2026-02-25 20:05:07  97.56MB 复制新文件或目录到容器中
COPY /usr/src/target/release/text-embeddings-router-75 /usr/local/bin/text-embeddings-router-75 # buildkit
                        
# 2026-02-18 04:06:33  26.06MB 执行命令并创建新的镜像层
RUN |1 DEFAULT_USE_FLASH_ATTENTION=True /bin/sh -c apt-get update && DEBIAN_FRONTEND=noninteractive apt-get install -y --no-install-recommends     ca-certificates     libssl-dev     curl     cuda-compat-12-9     && rm -rf /var/lib/apt/lists/* # buildkit
                        
# 2026-02-18 04:06:33  0.00B 设置环境变量 HUGGINGFACE_HUB_CACHE PORT USE_FLASH_ATTENTION LD_LIBRARY_PATH
ENV HUGGINGFACE_HUB_CACHE=/data PORT=80 USE_FLASH_ATTENTION=True LD_LIBRARY_PATH=/usr/local/cuda/lib64:/usr/local/cuda/lib64
                        
# 2026-02-18 04:06:33  0.00B 定义构建参数
ARG DEFAULT_USE_FLASH_ATTENTION=True
                        
# 2025-07-19 04:22:35  0.00B 配置容器启动时运行的命令
ENTRYPOINT ["/opt/nvidia/nvidia_entrypoint.sh"]
                        
# 2025-07-19 04:22:35  0.00B 设置环境变量 NVIDIA_PRODUCT_NAME
ENV NVIDIA_PRODUCT_NAME=CUDA
                        
# 2025-07-19 04:22:35  2.53KB 复制新文件或目录到容器中
COPY nvidia_entrypoint.sh /opt/nvidia/ # buildkit
                        
# 2025-07-19 04:22:35  3.06KB 复制新文件或目录到容器中
COPY entrypoint.d/ /opt/nvidia/entrypoint.d/ # buildkit
                        
# 2025-07-19 04:22:35  240.07KB 执行命令并创建新的镜像层
RUN |1 TARGETARCH=amd64 /bin/sh -c apt-mark hold ${NV_LIBCUBLAS_PACKAGE_NAME} ${NV_LIBNCCL_PACKAGE_NAME} # buildkit
                        
# 2025-07-19 04:22:35  3.46GB 执行命令并创建新的镜像层
RUN |1 TARGETARCH=amd64 /bin/sh -c apt-get update && apt-get install -y --no-install-recommends     cuda-libraries-12-9=${NV_CUDA_LIB_VERSION}     ${NV_LIBNPP_PACKAGE}     cuda-nvtx-12-9=${NV_NVTX_VERSION}     libcusparse-12-9=${NV_LIBCUSPARSE_VERSION}     ${NV_LIBCUBLAS_PACKAGE}     ${NV_LIBNCCL_PACKAGE}     && rm -rf /var/lib/apt/lists/* # buildkit
                        
# 2025-07-19 04:22:35  0.00B 添加元数据标签
LABEL maintainer=NVIDIA CORPORATION <cudatools@nvidia.com>
                        
# 2025-07-19 04:22:35  0.00B 定义构建参数
ARG TARGETARCH
                        
# 2025-07-19 04:22:35  0.00B 设置环境变量 NV_LIBNCCL_PACKAGE
ENV NV_LIBNCCL_PACKAGE=libnccl2=2.27.3-1+cuda12.9
                        
# 2025-07-19 04:22:35  0.00B 设置环境变量 NCCL_VERSION
ENV NCCL_VERSION=2.27.3-1
                        
# 2025-07-19 04:22:35  0.00B 设置环境变量 NV_LIBNCCL_PACKAGE_VERSION
ENV NV_LIBNCCL_PACKAGE_VERSION=2.27.3-1
                        
# 2025-07-19 04:22:35  0.00B 设置环境变量 NV_LIBNCCL_PACKAGE_NAME
ENV NV_LIBNCCL_PACKAGE_NAME=libnccl2
                        
# 2025-07-19 04:22:35  0.00B 设置环境变量 NV_LIBCUBLAS_PACKAGE
ENV NV_LIBCUBLAS_PACKAGE=libcublas-12-9=12.9.1.4-1
                        
# 2025-07-19 04:22:35  0.00B 设置环境变量 NV_LIBCUBLAS_VERSION
ENV NV_LIBCUBLAS_VERSION=12.9.1.4-1
                        
# 2025-07-19 04:22:35  0.00B 设置环境变量 NV_LIBCUBLAS_PACKAGE_NAME
ENV NV_LIBCUBLAS_PACKAGE_NAME=libcublas-12-9
                        
# 2025-07-19 04:22:35  0.00B 设置环境变量 NV_LIBCUSPARSE_VERSION
ENV NV_LIBCUSPARSE_VERSION=12.5.10.65-1
                        
# 2025-07-19 04:22:35  0.00B 设置环境变量 NV_LIBNPP_PACKAGE
ENV NV_LIBNPP_PACKAGE=libnpp-12-9=12.4.1.87-1
                        
# 2025-07-19 04:22:35  0.00B 设置环境变量 NV_LIBNPP_VERSION
ENV NV_LIBNPP_VERSION=12.4.1.87-1
                        
# 2025-07-19 04:22:35  0.00B 设置环境变量 NV_NVTX_VERSION
ENV NV_NVTX_VERSION=12.9.79-1
                        
# 2025-07-19 04:22:35  0.00B 设置环境变量 NV_CUDA_LIB_VERSION
ENV NV_CUDA_LIB_VERSION=12.9.1-1
                        
# 2025-07-19 04:11:18  0.00B 设置环境变量 NVIDIA_DRIVER_CAPABILITIES
ENV NVIDIA_DRIVER_CAPABILITIES=compute,utility
                        
# 2025-07-19 04:11:18  0.00B 设置环境变量 NVIDIA_VISIBLE_DEVICES
ENV NVIDIA_VISIBLE_DEVICES=all
                        
# 2025-07-19 04:11:18  17.29KB 复制新文件或目录到容器中
COPY NGC-DL-CONTAINER-LICENSE / # buildkit
                        
# 2025-07-19 04:11:18  0.00B 设置环境变量 LD_LIBRARY_PATH
ENV LD_LIBRARY_PATH=/usr/local/cuda/lib64
                        
# 2025-07-19 04:11:18  0.00B 设置环境变量 PATH
ENV PATH=/usr/local/cuda/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin
                        
# 2025-07-19 04:11:18  22.00B 执行命令并创建新的镜像层
RUN |1 TARGETARCH=amd64 /bin/sh -c echo "/usr/local/cuda/lib64" >> /etc/ld.so.conf.d/nvidia.conf # buildkit
                        
# 2025-07-19 04:11:18  315.62MB 执行命令并创建新的镜像层
RUN |1 TARGETARCH=amd64 /bin/sh -c apt-get update && apt-get install -y --no-install-recommends     cuda-cudart-12-9=${NV_CUDA_CUDART_VERSION}     cuda-compat-12-9     && rm -rf /var/lib/apt/lists/* # buildkit
                        
# 2025-07-19 04:11:02  0.00B 设置环境变量 CUDA_VERSION
ENV CUDA_VERSION=12.9.1
                        
# 2025-07-19 04:11:02  10.26MB 执行命令并创建新的镜像层
RUN |1 TARGETARCH=amd64 /bin/sh -c apt-get update && apt-get install -y --no-install-recommends     gnupg2 curl ca-certificates &&     curl -fsSL https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2404/${NVARCH}/3bf863cc.pub | apt-key add - &&     echo "deb https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2404/${NVARCH} /" > /etc/apt/sources.list.d/cuda.list &&     apt-get purge --autoremove -y curl     && rm -rf /var/lib/apt/lists/* # buildkit
                        
# 2025-07-19 04:11:02  0.00B 添加元数据标签
LABEL maintainer=NVIDIA CORPORATION <cudatools@nvidia.com>
                        
# 2025-07-19 04:11:02  0.00B 定义构建参数
ARG TARGETARCH
                        
# 2025-07-19 04:11:02  0.00B 设置环境变量 NV_CUDA_CUDART_VERSION
ENV NV_CUDA_CUDART_VERSION=12.9.79-1
                        
# 2025-07-19 04:11:02  0.00B 设置环境变量 NVIDIA_REQUIRE_CUDA brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand
ENV NVIDIA_REQUIRE_CUDA=cuda>=12.9 brand=unknown,driver>=535,driver<536 brand=grid,driver>=535,driver<536 brand=tesla,driver>=535,driver<536 brand=nvidia,driver>=535,driver<536 brand=quadro,driver>=535,driver<536 brand=quadrortx,driver>=535,driver<536 brand=nvidiartx,driver>=535,driver<536 brand=vapps,driver>=535,driver<536 brand=vpc,driver>=535,driver<536 brand=vcs,driver>=535,driver<536 brand=vws,driver>=535,driver<536 brand=cloudgaming,driver>=535,driver<536 brand=unknown,driver>=550,driver<551 brand=grid,driver>=550,driver<551 brand=tesla,driver>=550,driver<551 brand=nvidia,driver>=550,driver<551 brand=quadro,driver>=550,driver<551 brand=quadrortx,driver>=550,driver<551 brand=nvidiartx,driver>=550,driver<551 brand=vapps,driver>=550,driver<551 brand=vpc,driver>=550,driver<551 brand=vcs,driver>=550,driver<551 brand=vws,driver>=550,driver<551 brand=cloudgaming,driver>=550,driver<551 brand=unknown,driver>=560,driver<561 brand=grid,driver>=560,driver<561 brand=tesla,driver>=560,driver<561 brand=nvidia,driver>=560,driver<561 brand=quadro,driver>=560,driver<561 brand=quadrortx,driver>=560,driver<561 brand=nvidiartx,driver>=560,driver<561 brand=vapps,driver>=560,driver<561 brand=vpc,driver>=560,driver<561 brand=vcs,driver>=560,driver<561 brand=vws,driver>=560,driver<561 brand=cloudgaming,driver>=560,driver<561 brand=unknown,driver>=565,driver<566 brand=grid,driver>=565,driver<566 brand=tesla,driver>=565,driver<566 brand=nvidia,driver>=565,driver<566 brand=quadro,driver>=565,driver<566 brand=quadrortx,driver>=565,driver<566 brand=nvidiartx,driver>=565,driver<566 brand=vapps,driver>=565,driver<566 brand=vpc,driver>=565,driver<566 brand=vcs,driver>=565,driver<566 brand=vws,driver>=565,driver<566 brand=cloudgaming,driver>=565,driver<566 brand=unknown,driver>=570,driver<571 brand=grid,driver>=570,driver<571 brand=tesla,driver>=570,driver<571 brand=nvidia,driver>=570,driver<571 brand=quadro,driver>=570,driver<571 brand=quadrortx,driver>=570,driver<571 brand=nvidiartx,driver>=570,driver<571 brand=vapps,driver>=570,driver<571 brand=vpc,driver>=570,driver<571 brand=vcs,driver>=570,driver<571 brand=vws,driver>=570,driver<571 brand=cloudgaming,driver>=570,driver<571
                        
# 2025-07-19 04:11:02  0.00B 设置环境变量 NVARCH
ENV NVARCH=x86_64
                        
# 2025-07-15 00:40:23  0.00B 
/bin/sh -c #(nop)  CMD ["/bin/bash"]
                        
# 2025-07-15 00:40:22  78.12MB 
/bin/sh -c #(nop) ADD file:b4619a63cd7829e1338ddaa4995ca17003002dd54b0dfd675a6f54a2b69151a6 in / 
                        
# 2025-07-15 00:40:20  0.00B 
/bin/sh -c #(nop)  LABEL org.opencontainers.image.version=24.04
                        
# 2025-07-15 00:40:20  0.00B 
/bin/sh -c #(nop)  LABEL org.opencontainers.image.ref.name=ubuntu
                        
# 2025-07-15 00:40:20  0.00B 
/bin/sh -c #(nop)  ARG LAUNCHPAD_BUILD_ARCH
                        
# 2025-07-15 00:40:20  0.00B 
/bin/sh -c #(nop)  ARG RELEASE
                        
                    

镜像信息

{
    "Id": "sha256:a58880345dce36b9a5903d1f69514fad97cfda99f2eb418df1d35a9dc1016dda",
    "RepoTags": [
        "ghcr.io/huggingface/text-embeddings-inference:cuda-1.9.2",
        "swr.cn-north-4.myhuaweicloud.com/ddn-k8s/ghcr.io/huggingface/text-embeddings-inference:cuda-1.9.2"
    ],
    "RepoDigests": [
        "ghcr.io/huggingface/text-embeddings-inference@sha256:dfef329df7c6c4620b74c8f0741e8b88d155998a73dec907908dbbd42a20e009",
        "swr.cn-north-4.myhuaweicloud.com/ddn-k8s/ghcr.io/huggingface/text-embeddings-inference@sha256:9a0a3aa50d3302cc4d0d7ddbeae52a1413a570bf8cc2fb6c4ecbb368af1c8d89"
    ],
    "Parent": "",
    "Comment": "buildkit.dockerfile.v0",
    "Created": "2026-02-25T12:05:07.135923289Z",
    "Container": "",
    "ContainerConfig": null,
    "DockerVersion": "",
    "Author": "",
    "Config": {
        "Hostname": "",
        "Domainname": "",
        "User": "",
        "AttachStdin": false,
        "AttachStdout": false,
        "AttachStderr": false,
        "Tty": false,
        "OpenStdin": false,
        "StdinOnce": false,
        "Env": [
            "PATH=/usr/local/cuda/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin",
            "NVARCH=x86_64",
            "NVIDIA_REQUIRE_CUDA=cuda\u003e=12.9 brand=unknown,driver\u003e=535,driver\u003c536 brand=grid,driver\u003e=535,driver\u003c536 brand=tesla,driver\u003e=535,driver\u003c536 brand=nvidia,driver\u003e=535,driver\u003c536 brand=quadro,driver\u003e=535,driver\u003c536 brand=quadrortx,driver\u003e=535,driver\u003c536 brand=nvidiartx,driver\u003e=535,driver\u003c536 brand=vapps,driver\u003e=535,driver\u003c536 brand=vpc,driver\u003e=535,driver\u003c536 brand=vcs,driver\u003e=535,driver\u003c536 brand=vws,driver\u003e=535,driver\u003c536 brand=cloudgaming,driver\u003e=535,driver\u003c536 brand=unknown,driver\u003e=550,driver\u003c551 brand=grid,driver\u003e=550,driver\u003c551 brand=tesla,driver\u003e=550,driver\u003c551 brand=nvidia,driver\u003e=550,driver\u003c551 brand=quadro,driver\u003e=550,driver\u003c551 brand=quadrortx,driver\u003e=550,driver\u003c551 brand=nvidiartx,driver\u003e=550,driver\u003c551 brand=vapps,driver\u003e=550,driver\u003c551 brand=vpc,driver\u003e=550,driver\u003c551 brand=vcs,driver\u003e=550,driver\u003c551 brand=vws,driver\u003e=550,driver\u003c551 brand=cloudgaming,driver\u003e=550,driver\u003c551 brand=unknown,driver\u003e=560,driver\u003c561 brand=grid,driver\u003e=560,driver\u003c561 brand=tesla,driver\u003e=560,driver\u003c561 brand=nvidia,driver\u003e=560,driver\u003c561 brand=quadro,driver\u003e=560,driver\u003c561 brand=quadrortx,driver\u003e=560,driver\u003c561 brand=nvidiartx,driver\u003e=560,driver\u003c561 brand=vapps,driver\u003e=560,driver\u003c561 brand=vpc,driver\u003e=560,driver\u003c561 brand=vcs,driver\u003e=560,driver\u003c561 brand=vws,driver\u003e=560,driver\u003c561 brand=cloudgaming,driver\u003e=560,driver\u003c561 brand=unknown,driver\u003e=565,driver\u003c566 brand=grid,driver\u003e=565,driver\u003c566 brand=tesla,driver\u003e=565,driver\u003c566 brand=nvidia,driver\u003e=565,driver\u003c566 brand=quadro,driver\u003e=565,driver\u003c566 brand=quadrortx,driver\u003e=565,driver\u003c566 brand=nvidiartx,driver\u003e=565,driver\u003c566 brand=vapps,driver\u003e=565,driver\u003c566 brand=vpc,driver\u003e=565,driver\u003c566 brand=vcs,driver\u003e=565,driver\u003c566 brand=vws,driver\u003e=565,driver\u003c566 brand=cloudgaming,driver\u003e=565,driver\u003c566 brand=unknown,driver\u003e=570,driver\u003c571 brand=grid,driver\u003e=570,driver\u003c571 brand=tesla,driver\u003e=570,driver\u003c571 brand=nvidia,driver\u003e=570,driver\u003c571 brand=quadro,driver\u003e=570,driver\u003c571 brand=quadrortx,driver\u003e=570,driver\u003c571 brand=nvidiartx,driver\u003e=570,driver\u003c571 brand=vapps,driver\u003e=570,driver\u003c571 brand=vpc,driver\u003e=570,driver\u003c571 brand=vcs,driver\u003e=570,driver\u003c571 brand=vws,driver\u003e=570,driver\u003c571 brand=cloudgaming,driver\u003e=570,driver\u003c571",
            "NV_CUDA_CUDART_VERSION=12.9.79-1",
            "CUDA_VERSION=12.9.1",
            "LD_LIBRARY_PATH=/usr/local/cuda/lib64:/usr/local/cuda/lib64",
            "NVIDIA_VISIBLE_DEVICES=all",
            "NVIDIA_DRIVER_CAPABILITIES=compute,utility",
            "NV_CUDA_LIB_VERSION=12.9.1-1",
            "NV_NVTX_VERSION=12.9.79-1",
            "NV_LIBNPP_VERSION=12.4.1.87-1",
            "NV_LIBNPP_PACKAGE=libnpp-12-9=12.4.1.87-1",
            "NV_LIBCUSPARSE_VERSION=12.5.10.65-1",
            "NV_LIBCUBLAS_PACKAGE_NAME=libcublas-12-9",
            "NV_LIBCUBLAS_VERSION=12.9.1.4-1",
            "NV_LIBCUBLAS_PACKAGE=libcublas-12-9=12.9.1.4-1",
            "NV_LIBNCCL_PACKAGE_NAME=libnccl2",
            "NV_LIBNCCL_PACKAGE_VERSION=2.27.3-1",
            "NCCL_VERSION=2.27.3-1",
            "NV_LIBNCCL_PACKAGE=libnccl2=2.27.3-1+cuda12.9",
            "NVIDIA_PRODUCT_NAME=CUDA",
            "HUGGINGFACE_HUB_CACHE=/data",
            "PORT=80",
            "USE_FLASH_ATTENTION=True"
        ],
        "Cmd": [
            "--json-output"
        ],
        "ArgsEscaped": true,
        "Image": "",
        "Volumes": null,
        "WorkingDir": "",
        "Entrypoint": [
            "./entrypoint.sh"
        ],
        "OnBuild": null,
        "Labels": {
            "maintainer": "NVIDIA CORPORATION \u003ccudatools@nvidia.com\u003e",
            "org.opencontainers.image.created": "2026-02-25T11:18:28.441Z",
            "org.opencontainers.image.description": "A blazing fast inference solution for text embeddings models",
            "org.opencontainers.image.licenses": "Apache-2.0",
            "org.opencontainers.image.ref.name": "ubuntu",
            "org.opencontainers.image.revision": "1d6ceb4883230aee3a4e53b7d5d6c0b5477a335c",
            "org.opencontainers.image.source": "https://github.com/huggingface/text-embeddings-inference",
            "org.opencontainers.image.title": "text-embeddings-inference",
            "org.opencontainers.image.url": "https://github.com/huggingface/text-embeddings-inference",
            "org.opencontainers.image.version": "cuda-1.9.2"
        }
    },
    "Architecture": "amd64",
    "Os": "linux",
    "Size": 5167402857,
    "GraphDriver": {
        "Data": {
            "LowerDir": "/var/lib/docker/overlay2/95b1feca6927a17b3318ca6f4a0ab8d4a6814a4286cbccab228b1c4402ee0526/diff:/var/lib/docker/overlay2/d33f161faeedd6cd7ad798e9454bb9460aa531bc68fe5e26d09cc9728f960f10/diff:/var/lib/docker/overlay2/e67aafa6aa1ad844e761d3d4d26177870fc398690de8485e322d0966bcb1d539/diff:/var/lib/docker/overlay2/d35bfab9160272e76a3913b77d28fb61791062c586270904088383e4ffe050f6/diff:/var/lib/docker/overlay2/a76822941b402a3182838dc6256c86a28b202d5a9973bc5a20446d1377f912d3/diff:/var/lib/docker/overlay2/4aeb312fef49cc44b535971d4c90170e1245ae0c34dc9d93f0ea602a361dfcfb/diff:/var/lib/docker/overlay2/4f7ccc8e157c0f186e961391252ffc3f9afed90255ee8e89e175d996ddc3305b/diff:/var/lib/docker/overlay2/b12b06881b4bb9251cd68202a1da22105635eec03fc34885554a0a9dacfcc8dd/diff:/var/lib/docker/overlay2/6d96c186da99f207bac2b2016f3f6077e966700c1ada65379bcacd85ac5b8890/diff:/var/lib/docker/overlay2/d28e0d314c5887e05cfc5f1af7d28399b3b3d07fd988b6910d5c81b031f26c09/diff:/var/lib/docker/overlay2/28aeb583e12af7def9ddfac74e150984cf8b04a379b846cf8731b8ad51b2f170/diff:/var/lib/docker/overlay2/f6d3b3c63e4f327b732b9183d163833e972b4375363bd22e3b4b64cb69447721/diff:/var/lib/docker/overlay2/e058824df8472b0fcd8fe85454f5627d7259c74c33b2ed124a4abfb1dd613c90/diff:/var/lib/docker/overlay2/6971d39ed44508c220cdde4ebf8338bf0f7536a72590bbd8ab8b9b414aee1977/diff:/var/lib/docker/overlay2/c05572b62ddbf5d453493fcc2cce4695232b95c3e0b4006dda66ec8e7216f7d0/diff",
            "MergedDir": "/var/lib/docker/overlay2/1ee1b892d8076178b5578d7b8e88cfeea89f678c194ef9c34fbcce563f719dfd/merged",
            "UpperDir": "/var/lib/docker/overlay2/1ee1b892d8076178b5578d7b8e88cfeea89f678c194ef9c34fbcce563f719dfd/diff",
            "WorkDir": "/var/lib/docker/overlay2/1ee1b892d8076178b5578d7b8e88cfeea89f678c194ef9c34fbcce563f719dfd/work"
        },
        "Name": "overlay2"
    },
    "RootFS": {
        "Type": "layers",
        "Layers": [
            "sha256:107cbdaeec042e6154640c94972c638f4e2fee795902b149e8ce9acbd03d59d7",
            "sha256:d54e805fe9aac70390dd3e3ca1b6fd0f63b2cf5913eb0023bf741bd291d2a746",
            "sha256:35a8bdbbbf277997b1e1efcbdc9ae5321ec3a96ceb81ad194ad58d671ab47bd4",
            "sha256:12f5e36b8e7e83c3fa409e1b34eaec072b9381c8a5e600e9f03c53dbd295bec1",
            "sha256:455bca42f6ec40ca42fea4bea15c6c17b97101af90413cdd25647de2b9d98960",
            "sha256:6e980015d67b5cae225c0adf06e30b516f62600df02350db433029e63968b932",
            "sha256:97486d1882c55d529af69661c72297f97dd38801c0dfa771cbbbe649da358078",
            "sha256:d38a3b5fb295c76c1383bf5f56fa8ede57ce3ad574a6a6918307daab6ae62889",
            "sha256:3cefe4836f3110ecc622b831aafa0c278feabc23c5aeb8a914f9f90727fcbd1a",
            "sha256:4d27fa4325349b8bb3cd04cf99513ab7bbb35d927c4e7d256cb6003589d4f5f1",
            "sha256:7d266c2633588b07419515e2e6043c90255160cb6689d42d315688bda593b25b",
            "sha256:ebed12a55ebda267f83a3214568c56052252c5819152f497abf2210aef1ffecf",
            "sha256:243e202da5d9e35cc78c32ae69750ede7795447b9811672061c308adca8d26c9",
            "sha256:e06105a188fd34001afddfdabd530c95f8d379b397f4efc99b8f82df76b35dd3",
            "sha256:0ca4149573ccd0c734f4dfdde042c02bc6221b46d6a3f2243b924f3fddb71ec1",
            "sha256:66727fb9dcb2167d136ebd0f279f5f28a012dd2dda11ce13eba8e87400941d8a"
        ]
    },
    "Metadata": {
        "LastTagTime": "2026-03-13T00:38:39.479236117+08:00"
    }
}

更多版本

ghcr.io/huggingface/text-embeddings-inference:cpu-1.2

linux/amd64 ghcr.io636.68MB2024-07-25 11:53
1081

ghcr.io/huggingface/text-embeddings-inference:turing-1.5

linux/amd64 ghcr.io900.87MB2024-10-23 11:14
1034

ghcr.io/huggingface/text-embeddings-inference:cpu-latest

linux/amd64 ghcr.io660.31MB2024-12-04 09:13
1241

ghcr.io/huggingface/text-embeddings-inference:1.6

linux/amd64 ghcr.io1.21GB2025-02-25 09:27
1309

ghcr.io/huggingface/text-embeddings-inference:cpu-1.6

linux/amd64 ghcr.io659.95MB2025-02-25 09:47
665

ghcr.io/huggingface/text-embeddings-inference:86-1.6.1

linux/amd64 ghcr.io1.31GB2025-03-31 11:03
466

ghcr.io/huggingface/text-embeddings-inference:86-1.7.0

linux/amd64 ghcr.io1.11GB2025-04-09 09:33
632

ghcr.io/huggingface/text-embeddings-inference:hopper-1.7

linux/amd64 ghcr.io1.11GB2025-04-14 16:27
493

ghcr.io/huggingface/text-embeddings-inference:latest

linux/amd64 ghcr.io1.11GB2025-04-17 14:51
744

ghcr.io/huggingface/text-embeddings-inference:cpu-1.7

linux/amd64 ghcr.io683.64MB2025-04-29 22:28
741

ghcr.io/huggingface/text-embeddings-inference:1.7

linux/amd64 ghcr.io1.11GB2025-05-14 09:19
483

ghcr.io/huggingface/text-embeddings-inference:86-1.7.1

linux/amd64 ghcr.io1.11GB2025-06-09 16:24
352

ghcr.io/huggingface/text-embeddings-inference:hopper-1.7.1

linux/amd64 ghcr.io1.12GB2025-06-11 17:26
441

ghcr.io/huggingface/text-embeddings-inference:1.7.1

linux/amd64 ghcr.io1.11GB2025-06-13 16:46
393

ghcr.io/huggingface/text-embeddings-inference:1.7.4

linux/amd64 ghcr.io1.11GB2025-07-08 09:31
356

ghcr.io/huggingface/text-embeddings-inference:1.8.0

linux/amd64 ghcr.io1.11GB2025-08-15 16:52
291

ghcr.io/huggingface/text-embeddings-inference:1.8

linux/amd64 ghcr.io1.11GB2025-08-15 17:00
629

ghcr.io/huggingface/text-embeddings-inference:cpu-1.8

linux/amd64 ghcr.io684.32MB2025-09-04 10:19
492

ghcr.io/huggingface/text-embeddings-inference:cuda-1.8.1

linux/amd64 ghcr.io2.65GB2025-09-11 17:54
521

ghcr.io/huggingface/text-embeddings-inference:hopper-1.8

linux/amd64 ghcr.io1.12GB2025-09-11 18:36
267

ghcr.io/huggingface/text-embeddings-inference:86-1.8.2

linux/amd64 ghcr.io1.11GB2025-09-18 21:54
304

ghcr.io/huggingface/text-embeddings-inference:cpu-1.7.2

linux/amd64 ghcr.io684.24MB2025-10-14 15:13
253

ghcr.io/huggingface/text-embeddings-inference:1.7.2

linux/amd64 ghcr.io1.11GB2025-10-14 16:56
250

ghcr.io/huggingface/text-embeddings-inference:89-1.8

linux/amd64 ghcr.io1.11GB2025-10-22 18:00
320

ghcr.io/huggingface/text-embeddings-inference:turing-1.8

linux/amd64 ghcr.io930.07MB2025-10-31 18:57
383

ghcr.io/huggingface/text-embeddings-inference:cpu-1.8.2

linux/amd64 ghcr.io685.78MB2025-12-08 15:35
230

ghcr.io/huggingface/text-embeddings-inference:86-1.9

linux/amd64 ghcr.io5.01GB2026-02-25 00:16
76

ghcr.io/huggingface/text-embeddings-inference:cuda-1.9

linux/amd64 ghcr.io5.17GB2026-02-25 09:12
136

ghcr.io/huggingface/text-embeddings-inference:cuda-1.9.2

linux/amd64 ghcr.io5.17GB2026-03-13 00:40
7