ghcr.io/huggingface/text-embeddings-inference:cuda-1.9 linux/amd64

ghcr.io/huggingface/text-embeddings-inference:cuda-1.9 - 国内下载镜像源 浏览次数:14

文本嵌入推断

Hugging Face 提供了一个用于文本嵌入推断的 Docker 镜像。
源镜像 ghcr.io/huggingface/text-embeddings-inference:cuda-1.9
国内镜像 swr.cn-north-4.myhuaweicloud.com/ddn-k8s/ghcr.io/huggingface/text-embeddings-inference:cuda-1.9
镜像ID sha256:fe9f2c34844c446e48faf4132650b8e3ab769c5b218e1e9d31912ef6472b3796
镜像TAG cuda-1.9
大小 5.17GB
镜像源 ghcr.io
CMD --json-output
启动入口 ./entrypoint.sh
工作目录
OS/平台 linux/amd64
浏览量 14 次
贡献者
镜像创建 2026-02-17T21:46:20.422845791Z
同步时间 2026-02-25 09:12
环境变量
PATH=/usr/local/cuda/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin NVARCH=x86_64 NVIDIA_REQUIRE_CUDA=cuda>=12.9 brand=unknown,driver>=535,driver<536 brand=grid,driver>=535,driver<536 brand=tesla,driver>=535,driver<536 brand=nvidia,driver>=535,driver<536 brand=quadro,driver>=535,driver<536 brand=quadrortx,driver>=535,driver<536 brand=nvidiartx,driver>=535,driver<536 brand=vapps,driver>=535,driver<536 brand=vpc,driver>=535,driver<536 brand=vcs,driver>=535,driver<536 brand=vws,driver>=535,driver<536 brand=cloudgaming,driver>=535,driver<536 brand=unknown,driver>=550,driver<551 brand=grid,driver>=550,driver<551 brand=tesla,driver>=550,driver<551 brand=nvidia,driver>=550,driver<551 brand=quadro,driver>=550,driver<551 brand=quadrortx,driver>=550,driver<551 brand=nvidiartx,driver>=550,driver<551 brand=vapps,driver>=550,driver<551 brand=vpc,driver>=550,driver<551 brand=vcs,driver>=550,driver<551 brand=vws,driver>=550,driver<551 brand=cloudgaming,driver>=550,driver<551 brand=unknown,driver>=560,driver<561 brand=grid,driver>=560,driver<561 brand=tesla,driver>=560,driver<561 brand=nvidia,driver>=560,driver<561 brand=quadro,driver>=560,driver<561 brand=quadrortx,driver>=560,driver<561 brand=nvidiartx,driver>=560,driver<561 brand=vapps,driver>=560,driver<561 brand=vpc,driver>=560,driver<561 brand=vcs,driver>=560,driver<561 brand=vws,driver>=560,driver<561 brand=cloudgaming,driver>=560,driver<561 brand=unknown,driver>=565,driver<566 brand=grid,driver>=565,driver<566 brand=tesla,driver>=565,driver<566 brand=nvidia,driver>=565,driver<566 brand=quadro,driver>=565,driver<566 brand=quadrortx,driver>=565,driver<566 brand=nvidiartx,driver>=565,driver<566 brand=vapps,driver>=565,driver<566 brand=vpc,driver>=565,driver<566 brand=vcs,driver>=565,driver<566 brand=vws,driver>=565,driver<566 brand=cloudgaming,driver>=565,driver<566 brand=unknown,driver>=570,driver<571 brand=grid,driver>=570,driver<571 brand=tesla,driver>=570,driver<571 brand=nvidia,driver>=570,driver<571 brand=quadro,driver>=570,driver<571 brand=quadrortx,driver>=570,driver<571 brand=nvidiartx,driver>=570,driver<571 brand=vapps,driver>=570,driver<571 brand=vpc,driver>=570,driver<571 brand=vcs,driver>=570,driver<571 brand=vws,driver>=570,driver<571 brand=cloudgaming,driver>=570,driver<571 NV_CUDA_CUDART_VERSION=12.9.79-1 CUDA_VERSION=12.9.1 LD_LIBRARY_PATH=/usr/local/cuda/lib64:/usr/local/cuda/lib64 NVIDIA_VISIBLE_DEVICES=all NVIDIA_DRIVER_CAPABILITIES=compute,utility NV_CUDA_LIB_VERSION=12.9.1-1 NV_NVTX_VERSION=12.9.79-1 NV_LIBNPP_VERSION=12.4.1.87-1 NV_LIBNPP_PACKAGE=libnpp-12-9=12.4.1.87-1 NV_LIBCUSPARSE_VERSION=12.5.10.65-1 NV_LIBCUBLAS_PACKAGE_NAME=libcublas-12-9 NV_LIBCUBLAS_VERSION=12.9.1.4-1 NV_LIBCUBLAS_PACKAGE=libcublas-12-9=12.9.1.4-1 NV_LIBNCCL_PACKAGE_NAME=libnccl2 NV_LIBNCCL_PACKAGE_VERSION=2.27.3-1 NCCL_VERSION=2.27.3-1 NV_LIBNCCL_PACKAGE=libnccl2=2.27.3-1+cuda12.9 NVIDIA_PRODUCT_NAME=CUDA HUGGINGFACE_HUB_CACHE=/data PORT=80 USE_FLASH_ATTENTION=True
镜像标签
NVIDIA CORPORATION <cudatools@nvidia.com>: maintainer 2026-02-17T21:01:20.111Z: org.opencontainers.image.created A blazing fast inference solution for text embeddings models: org.opencontainers.image.description Apache-2.0: org.opencontainers.image.licenses ubuntu: org.opencontainers.image.ref.name b38b8f111a23330831bf1b6ee5b33efb932319f8: org.opencontainers.image.revision https://github.com/huggingface/text-embeddings-inference: org.opencontainers.image.source text-embeddings-inference: org.opencontainers.image.title https://github.com/huggingface/text-embeddings-inference: org.opencontainers.image.url cuda-1.9.1: org.opencontainers.image.version

Docker拉取命令

docker pull swr.cn-north-4.myhuaweicloud.com/ddn-k8s/ghcr.io/huggingface/text-embeddings-inference:cuda-1.9
docker tag  swr.cn-north-4.myhuaweicloud.com/ddn-k8s/ghcr.io/huggingface/text-embeddings-inference:cuda-1.9  ghcr.io/huggingface/text-embeddings-inference:cuda-1.9

Containerd拉取命令

ctr images pull swr.cn-north-4.myhuaweicloud.com/ddn-k8s/ghcr.io/huggingface/text-embeddings-inference:cuda-1.9
ctr images tag  swr.cn-north-4.myhuaweicloud.com/ddn-k8s/ghcr.io/huggingface/text-embeddings-inference:cuda-1.9  ghcr.io/huggingface/text-embeddings-inference:cuda-1.9

Shell快速替换命令

sed -i 's#ghcr.io/huggingface/text-embeddings-inference:cuda-1.9#swr.cn-north-4.myhuaweicloud.com/ddn-k8s/ghcr.io/huggingface/text-embeddings-inference:cuda-1.9#' deployment.yaml

Ansible快速分发-Docker

#ansible k8s -m shell -a 'docker pull swr.cn-north-4.myhuaweicloud.com/ddn-k8s/ghcr.io/huggingface/text-embeddings-inference:cuda-1.9 && docker tag  swr.cn-north-4.myhuaweicloud.com/ddn-k8s/ghcr.io/huggingface/text-embeddings-inference:cuda-1.9  ghcr.io/huggingface/text-embeddings-inference:cuda-1.9'

Ansible快速分发-Containerd

#ansible k8s -m shell -a 'ctr images pull swr.cn-north-4.myhuaweicloud.com/ddn-k8s/ghcr.io/huggingface/text-embeddings-inference:cuda-1.9 && ctr images tag  swr.cn-north-4.myhuaweicloud.com/ddn-k8s/ghcr.io/huggingface/text-embeddings-inference:cuda-1.9  ghcr.io/huggingface/text-embeddings-inference:cuda-1.9'

镜像构建历史


# 2026-02-18 05:46:20  0.00B 设置默认要执行的命令
CMD ["--json-output"]
                        
# 2026-02-18 05:46:20  0.00B 配置容器启动时运行的命令
ENTRYPOINT ["./entrypoint.sh"]
                        
# 2026-02-18 05:46:20  1.47KB 复制新文件或目录到容器中
COPY --chmod=775 cuda-all-entrypoint.sh entrypoint.sh # buildkit
                        
# 2026-02-18 05:46:20  296.03MB 复制新文件或目录到容器中
COPY /usr/src/target/release/text-embeddings-router-120 /usr/local/bin/text-embeddings-router-120 # buildkit
                        
# 2026-02-18 05:46:20  296.03MB 复制新文件或目录到容器中
COPY /usr/src/target/release/text-embeddings-router-100 /usr/local/bin/text-embeddings-router-100 # buildkit
                        
# 2026-02-18 05:46:20  296.03MB 复制新文件或目录到容器中
COPY /usr/src/target/release/text-embeddings-router-90 /usr/local/bin/text-embeddings-router-90 # buildkit
                        
# 2026-02-18 05:46:20  296.03MB 复制新文件或目录到容器中
COPY /usr/src/target/release/text-embeddings-router-80 /usr/local/bin/text-embeddings-router-80 # buildkit
                        
# 2026-02-18 05:46:20  97.52MB 复制新文件或目录到容器中
COPY /usr/src/target/release/text-embeddings-router-75 /usr/local/bin/text-embeddings-router-75 # buildkit
                        
# 2026-02-18 04:06:33  26.06MB 执行命令并创建新的镜像层
RUN |1 DEFAULT_USE_FLASH_ATTENTION=True /bin/sh -c apt-get update && DEBIAN_FRONTEND=noninteractive apt-get install -y --no-install-recommends     ca-certificates     libssl-dev     curl     cuda-compat-12-9     && rm -rf /var/lib/apt/lists/* # buildkit
                        
# 2026-02-18 04:06:33  0.00B 设置环境变量 HUGGINGFACE_HUB_CACHE PORT USE_FLASH_ATTENTION LD_LIBRARY_PATH
ENV HUGGINGFACE_HUB_CACHE=/data PORT=80 USE_FLASH_ATTENTION=True LD_LIBRARY_PATH=/usr/local/cuda/lib64:/usr/local/cuda/lib64
                        
# 2026-02-18 04:06:33  0.00B 定义构建参数
ARG DEFAULT_USE_FLASH_ATTENTION=True
                        
# 2025-07-19 04:22:35  0.00B 配置容器启动时运行的命令
ENTRYPOINT ["/opt/nvidia/nvidia_entrypoint.sh"]
                        
# 2025-07-19 04:22:35  0.00B 设置环境变量 NVIDIA_PRODUCT_NAME
ENV NVIDIA_PRODUCT_NAME=CUDA
                        
# 2025-07-19 04:22:35  2.53KB 复制新文件或目录到容器中
COPY nvidia_entrypoint.sh /opt/nvidia/ # buildkit
                        
# 2025-07-19 04:22:35  3.06KB 复制新文件或目录到容器中
COPY entrypoint.d/ /opt/nvidia/entrypoint.d/ # buildkit
                        
# 2025-07-19 04:22:35  240.07KB 执行命令并创建新的镜像层
RUN |1 TARGETARCH=amd64 /bin/sh -c apt-mark hold ${NV_LIBCUBLAS_PACKAGE_NAME} ${NV_LIBNCCL_PACKAGE_NAME} # buildkit
                        
# 2025-07-19 04:22:35  3.46GB 执行命令并创建新的镜像层
RUN |1 TARGETARCH=amd64 /bin/sh -c apt-get update && apt-get install -y --no-install-recommends     cuda-libraries-12-9=${NV_CUDA_LIB_VERSION}     ${NV_LIBNPP_PACKAGE}     cuda-nvtx-12-9=${NV_NVTX_VERSION}     libcusparse-12-9=${NV_LIBCUSPARSE_VERSION}     ${NV_LIBCUBLAS_PACKAGE}     ${NV_LIBNCCL_PACKAGE}     && rm -rf /var/lib/apt/lists/* # buildkit
                        
# 2025-07-19 04:22:35  0.00B 添加元数据标签
LABEL maintainer=NVIDIA CORPORATION <cudatools@nvidia.com>
                        
# 2025-07-19 04:22:35  0.00B 定义构建参数
ARG TARGETARCH
                        
# 2025-07-19 04:22:35  0.00B 设置环境变量 NV_LIBNCCL_PACKAGE
ENV NV_LIBNCCL_PACKAGE=libnccl2=2.27.3-1+cuda12.9
                        
# 2025-07-19 04:22:35  0.00B 设置环境变量 NCCL_VERSION
ENV NCCL_VERSION=2.27.3-1
                        
# 2025-07-19 04:22:35  0.00B 设置环境变量 NV_LIBNCCL_PACKAGE_VERSION
ENV NV_LIBNCCL_PACKAGE_VERSION=2.27.3-1
                        
# 2025-07-19 04:22:35  0.00B 设置环境变量 NV_LIBNCCL_PACKAGE_NAME
ENV NV_LIBNCCL_PACKAGE_NAME=libnccl2
                        
# 2025-07-19 04:22:35  0.00B 设置环境变量 NV_LIBCUBLAS_PACKAGE
ENV NV_LIBCUBLAS_PACKAGE=libcublas-12-9=12.9.1.4-1
                        
# 2025-07-19 04:22:35  0.00B 设置环境变量 NV_LIBCUBLAS_VERSION
ENV NV_LIBCUBLAS_VERSION=12.9.1.4-1
                        
# 2025-07-19 04:22:35  0.00B 设置环境变量 NV_LIBCUBLAS_PACKAGE_NAME
ENV NV_LIBCUBLAS_PACKAGE_NAME=libcublas-12-9
                        
# 2025-07-19 04:22:35  0.00B 设置环境变量 NV_LIBCUSPARSE_VERSION
ENV NV_LIBCUSPARSE_VERSION=12.5.10.65-1
                        
# 2025-07-19 04:22:35  0.00B 设置环境变量 NV_LIBNPP_PACKAGE
ENV NV_LIBNPP_PACKAGE=libnpp-12-9=12.4.1.87-1
                        
# 2025-07-19 04:22:35  0.00B 设置环境变量 NV_LIBNPP_VERSION
ENV NV_LIBNPP_VERSION=12.4.1.87-1
                        
# 2025-07-19 04:22:35  0.00B 设置环境变量 NV_NVTX_VERSION
ENV NV_NVTX_VERSION=12.9.79-1
                        
# 2025-07-19 04:22:35  0.00B 设置环境变量 NV_CUDA_LIB_VERSION
ENV NV_CUDA_LIB_VERSION=12.9.1-1
                        
# 2025-07-19 04:11:18  0.00B 设置环境变量 NVIDIA_DRIVER_CAPABILITIES
ENV NVIDIA_DRIVER_CAPABILITIES=compute,utility
                        
# 2025-07-19 04:11:18  0.00B 设置环境变量 NVIDIA_VISIBLE_DEVICES
ENV NVIDIA_VISIBLE_DEVICES=all
                        
# 2025-07-19 04:11:18  17.29KB 复制新文件或目录到容器中
COPY NGC-DL-CONTAINER-LICENSE / # buildkit
                        
# 2025-07-19 04:11:18  0.00B 设置环境变量 LD_LIBRARY_PATH
ENV LD_LIBRARY_PATH=/usr/local/cuda/lib64
                        
# 2025-07-19 04:11:18  0.00B 设置环境变量 PATH
ENV PATH=/usr/local/cuda/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin
                        
# 2025-07-19 04:11:18  22.00B 执行命令并创建新的镜像层
RUN |1 TARGETARCH=amd64 /bin/sh -c echo "/usr/local/cuda/lib64" >> /etc/ld.so.conf.d/nvidia.conf # buildkit
                        
# 2025-07-19 04:11:18  315.62MB 执行命令并创建新的镜像层
RUN |1 TARGETARCH=amd64 /bin/sh -c apt-get update && apt-get install -y --no-install-recommends     cuda-cudart-12-9=${NV_CUDA_CUDART_VERSION}     cuda-compat-12-9     && rm -rf /var/lib/apt/lists/* # buildkit
                        
# 2025-07-19 04:11:02  0.00B 设置环境变量 CUDA_VERSION
ENV CUDA_VERSION=12.9.1
                        
# 2025-07-19 04:11:02  10.26MB 执行命令并创建新的镜像层
RUN |1 TARGETARCH=amd64 /bin/sh -c apt-get update && apt-get install -y --no-install-recommends     gnupg2 curl ca-certificates &&     curl -fsSL https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2404/${NVARCH}/3bf863cc.pub | apt-key add - &&     echo "deb https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2404/${NVARCH} /" > /etc/apt/sources.list.d/cuda.list &&     apt-get purge --autoremove -y curl     && rm -rf /var/lib/apt/lists/* # buildkit
                        
# 2025-07-19 04:11:02  0.00B 添加元数据标签
LABEL maintainer=NVIDIA CORPORATION <cudatools@nvidia.com>
                        
# 2025-07-19 04:11:02  0.00B 定义构建参数
ARG TARGETARCH
                        
# 2025-07-19 04:11:02  0.00B 设置环境变量 NV_CUDA_CUDART_VERSION
ENV NV_CUDA_CUDART_VERSION=12.9.79-1
                        
# 2025-07-19 04:11:02  0.00B 设置环境变量 NVIDIA_REQUIRE_CUDA brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand brand
ENV NVIDIA_REQUIRE_CUDA=cuda>=12.9 brand=unknown,driver>=535,driver<536 brand=grid,driver>=535,driver<536 brand=tesla,driver>=535,driver<536 brand=nvidia,driver>=535,driver<536 brand=quadro,driver>=535,driver<536 brand=quadrortx,driver>=535,driver<536 brand=nvidiartx,driver>=535,driver<536 brand=vapps,driver>=535,driver<536 brand=vpc,driver>=535,driver<536 brand=vcs,driver>=535,driver<536 brand=vws,driver>=535,driver<536 brand=cloudgaming,driver>=535,driver<536 brand=unknown,driver>=550,driver<551 brand=grid,driver>=550,driver<551 brand=tesla,driver>=550,driver<551 brand=nvidia,driver>=550,driver<551 brand=quadro,driver>=550,driver<551 brand=quadrortx,driver>=550,driver<551 brand=nvidiartx,driver>=550,driver<551 brand=vapps,driver>=550,driver<551 brand=vpc,driver>=550,driver<551 brand=vcs,driver>=550,driver<551 brand=vws,driver>=550,driver<551 brand=cloudgaming,driver>=550,driver<551 brand=unknown,driver>=560,driver<561 brand=grid,driver>=560,driver<561 brand=tesla,driver>=560,driver<561 brand=nvidia,driver>=560,driver<561 brand=quadro,driver>=560,driver<561 brand=quadrortx,driver>=560,driver<561 brand=nvidiartx,driver>=560,driver<561 brand=vapps,driver>=560,driver<561 brand=vpc,driver>=560,driver<561 brand=vcs,driver>=560,driver<561 brand=vws,driver>=560,driver<561 brand=cloudgaming,driver>=560,driver<561 brand=unknown,driver>=565,driver<566 brand=grid,driver>=565,driver<566 brand=tesla,driver>=565,driver<566 brand=nvidia,driver>=565,driver<566 brand=quadro,driver>=565,driver<566 brand=quadrortx,driver>=565,driver<566 brand=nvidiartx,driver>=565,driver<566 brand=vapps,driver>=565,driver<566 brand=vpc,driver>=565,driver<566 brand=vcs,driver>=565,driver<566 brand=vws,driver>=565,driver<566 brand=cloudgaming,driver>=565,driver<566 brand=unknown,driver>=570,driver<571 brand=grid,driver>=570,driver<571 brand=tesla,driver>=570,driver<571 brand=nvidia,driver>=570,driver<571 brand=quadro,driver>=570,driver<571 brand=quadrortx,driver>=570,driver<571 brand=nvidiartx,driver>=570,driver<571 brand=vapps,driver>=570,driver<571 brand=vpc,driver>=570,driver<571 brand=vcs,driver>=570,driver<571 brand=vws,driver>=570,driver<571 brand=cloudgaming,driver>=570,driver<571
                        
# 2025-07-19 04:11:02  0.00B 设置环境变量 NVARCH
ENV NVARCH=x86_64
                        
# 2025-07-15 00:40:23  0.00B 
/bin/sh -c #(nop)  CMD ["/bin/bash"]
                        
# 2025-07-15 00:40:22  78.12MB 
/bin/sh -c #(nop) ADD file:b4619a63cd7829e1338ddaa4995ca17003002dd54b0dfd675a6f54a2b69151a6 in / 
                        
# 2025-07-15 00:40:20  0.00B 
/bin/sh -c #(nop)  LABEL org.opencontainers.image.version=24.04
                        
# 2025-07-15 00:40:20  0.00B 
/bin/sh -c #(nop)  LABEL org.opencontainers.image.ref.name=ubuntu
                        
# 2025-07-15 00:40:20  0.00B 
/bin/sh -c #(nop)  ARG LAUNCHPAD_BUILD_ARCH
                        
# 2025-07-15 00:40:20  0.00B 
/bin/sh -c #(nop)  ARG RELEASE
                        
                    

镜像信息

{
    "Id": "sha256:fe9f2c34844c446e48faf4132650b8e3ab769c5b218e1e9d31912ef6472b3796",
    "RepoTags": [
        "ghcr.io/huggingface/text-embeddings-inference:cuda-1.9",
        "swr.cn-north-4.myhuaweicloud.com/ddn-k8s/ghcr.io/huggingface/text-embeddings-inference:cuda-1.9"
    ],
    "RepoDigests": [
        "ghcr.io/huggingface/text-embeddings-inference@sha256:64bfb8bdd79ec1a2ef40c8fd297102b19cdeac38adf770eb9d815f4f41e4df00",
        "swr.cn-north-4.myhuaweicloud.com/ddn-k8s/ghcr.io/huggingface/text-embeddings-inference@sha256:375d3916a16406d8c4fb59cc4aed29b55b8518c22f450fd8cf2252e44266f788"
    ],
    "Parent": "",
    "Comment": "buildkit.dockerfile.v0",
    "Created": "2026-02-17T21:46:20.422845791Z",
    "Container": "",
    "ContainerConfig": null,
    "DockerVersion": "",
    "Author": "",
    "Config": {
        "Hostname": "",
        "Domainname": "",
        "User": "",
        "AttachStdin": false,
        "AttachStdout": false,
        "AttachStderr": false,
        "Tty": false,
        "OpenStdin": false,
        "StdinOnce": false,
        "Env": [
            "PATH=/usr/local/cuda/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin",
            "NVARCH=x86_64",
            "NVIDIA_REQUIRE_CUDA=cuda\u003e=12.9 brand=unknown,driver\u003e=535,driver\u003c536 brand=grid,driver\u003e=535,driver\u003c536 brand=tesla,driver\u003e=535,driver\u003c536 brand=nvidia,driver\u003e=535,driver\u003c536 brand=quadro,driver\u003e=535,driver\u003c536 brand=quadrortx,driver\u003e=535,driver\u003c536 brand=nvidiartx,driver\u003e=535,driver\u003c536 brand=vapps,driver\u003e=535,driver\u003c536 brand=vpc,driver\u003e=535,driver\u003c536 brand=vcs,driver\u003e=535,driver\u003c536 brand=vws,driver\u003e=535,driver\u003c536 brand=cloudgaming,driver\u003e=535,driver\u003c536 brand=unknown,driver\u003e=550,driver\u003c551 brand=grid,driver\u003e=550,driver\u003c551 brand=tesla,driver\u003e=550,driver\u003c551 brand=nvidia,driver\u003e=550,driver\u003c551 brand=quadro,driver\u003e=550,driver\u003c551 brand=quadrortx,driver\u003e=550,driver\u003c551 brand=nvidiartx,driver\u003e=550,driver\u003c551 brand=vapps,driver\u003e=550,driver\u003c551 brand=vpc,driver\u003e=550,driver\u003c551 brand=vcs,driver\u003e=550,driver\u003c551 brand=vws,driver\u003e=550,driver\u003c551 brand=cloudgaming,driver\u003e=550,driver\u003c551 brand=unknown,driver\u003e=560,driver\u003c561 brand=grid,driver\u003e=560,driver\u003c561 brand=tesla,driver\u003e=560,driver\u003c561 brand=nvidia,driver\u003e=560,driver\u003c561 brand=quadro,driver\u003e=560,driver\u003c561 brand=quadrortx,driver\u003e=560,driver\u003c561 brand=nvidiartx,driver\u003e=560,driver\u003c561 brand=vapps,driver\u003e=560,driver\u003c561 brand=vpc,driver\u003e=560,driver\u003c561 brand=vcs,driver\u003e=560,driver\u003c561 brand=vws,driver\u003e=560,driver\u003c561 brand=cloudgaming,driver\u003e=560,driver\u003c561 brand=unknown,driver\u003e=565,driver\u003c566 brand=grid,driver\u003e=565,driver\u003c566 brand=tesla,driver\u003e=565,driver\u003c566 brand=nvidia,driver\u003e=565,driver\u003c566 brand=quadro,driver\u003e=565,driver\u003c566 brand=quadrortx,driver\u003e=565,driver\u003c566 brand=nvidiartx,driver\u003e=565,driver\u003c566 brand=vapps,driver\u003e=565,driver\u003c566 brand=vpc,driver\u003e=565,driver\u003c566 brand=vcs,driver\u003e=565,driver\u003c566 brand=vws,driver\u003e=565,driver\u003c566 brand=cloudgaming,driver\u003e=565,driver\u003c566 brand=unknown,driver\u003e=570,driver\u003c571 brand=grid,driver\u003e=570,driver\u003c571 brand=tesla,driver\u003e=570,driver\u003c571 brand=nvidia,driver\u003e=570,driver\u003c571 brand=quadro,driver\u003e=570,driver\u003c571 brand=quadrortx,driver\u003e=570,driver\u003c571 brand=nvidiartx,driver\u003e=570,driver\u003c571 brand=vapps,driver\u003e=570,driver\u003c571 brand=vpc,driver\u003e=570,driver\u003c571 brand=vcs,driver\u003e=570,driver\u003c571 brand=vws,driver\u003e=570,driver\u003c571 brand=cloudgaming,driver\u003e=570,driver\u003c571",
            "NV_CUDA_CUDART_VERSION=12.9.79-1",
            "CUDA_VERSION=12.9.1",
            "LD_LIBRARY_PATH=/usr/local/cuda/lib64:/usr/local/cuda/lib64",
            "NVIDIA_VISIBLE_DEVICES=all",
            "NVIDIA_DRIVER_CAPABILITIES=compute,utility",
            "NV_CUDA_LIB_VERSION=12.9.1-1",
            "NV_NVTX_VERSION=12.9.79-1",
            "NV_LIBNPP_VERSION=12.4.1.87-1",
            "NV_LIBNPP_PACKAGE=libnpp-12-9=12.4.1.87-1",
            "NV_LIBCUSPARSE_VERSION=12.5.10.65-1",
            "NV_LIBCUBLAS_PACKAGE_NAME=libcublas-12-9",
            "NV_LIBCUBLAS_VERSION=12.9.1.4-1",
            "NV_LIBCUBLAS_PACKAGE=libcublas-12-9=12.9.1.4-1",
            "NV_LIBNCCL_PACKAGE_NAME=libnccl2",
            "NV_LIBNCCL_PACKAGE_VERSION=2.27.3-1",
            "NCCL_VERSION=2.27.3-1",
            "NV_LIBNCCL_PACKAGE=libnccl2=2.27.3-1+cuda12.9",
            "NVIDIA_PRODUCT_NAME=CUDA",
            "HUGGINGFACE_HUB_CACHE=/data",
            "PORT=80",
            "USE_FLASH_ATTENTION=True"
        ],
        "Cmd": [
            "--json-output"
        ],
        "ArgsEscaped": true,
        "Image": "",
        "Volumes": null,
        "WorkingDir": "",
        "Entrypoint": [
            "./entrypoint.sh"
        ],
        "OnBuild": null,
        "Labels": {
            "maintainer": "NVIDIA CORPORATION \u003ccudatools@nvidia.com\u003e",
            "org.opencontainers.image.created": "2026-02-17T21:01:20.111Z",
            "org.opencontainers.image.description": "A blazing fast inference solution for text embeddings models",
            "org.opencontainers.image.licenses": "Apache-2.0",
            "org.opencontainers.image.ref.name": "ubuntu",
            "org.opencontainers.image.revision": "b38b8f111a23330831bf1b6ee5b33efb932319f8",
            "org.opencontainers.image.source": "https://github.com/huggingface/text-embeddings-inference",
            "org.opencontainers.image.title": "text-embeddings-inference",
            "org.opencontainers.image.url": "https://github.com/huggingface/text-embeddings-inference",
            "org.opencontainers.image.version": "cuda-1.9.1"
        }
    },
    "Architecture": "amd64",
    "Os": "linux",
    "Size": 5167193353,
    "GraphDriver": {
        "Data": {
            "LowerDir": "/var/lib/docker/overlay2/4439e1ad4bd680638adb40f6fcf0450b42a7797a9d1f2a82fc0e543087027e27/diff:/var/lib/docker/overlay2/d032cd16c7a6f576cc320ed0eb6de4f0f858b0258c5b824b5b7d5a0a6bcfd2fb/diff:/var/lib/docker/overlay2/d402fc2704565b4ac97c9ff96028974bbb0e129c057794a9f77c12bebd7cfa99/diff:/var/lib/docker/overlay2/1c6a8a473704b5b265ebbf2b09741ef76a65bc7e184cf425959a153d87fcce09/diff:/var/lib/docker/overlay2/b07c35a86ca8a89ae4a9262d3014c76dd63878b7eb641aaf8ecbd53a4eada3d7/diff:/var/lib/docker/overlay2/9e6854236efe6e625f4d231b24b616585d0919931634e292a8150f8625fc8cde/diff:/var/lib/docker/overlay2/26bfd17d906cbd790f23b241064870668c50ad39f4203bc1f7ea7e521e736f43/diff:/var/lib/docker/overlay2/4b7d0e7086c8a6a086f4f2920bc7d9557e86e87d27041dbbb2ab128790b90d48/diff:/var/lib/docker/overlay2/691775865ecc403f993e8d78eba94afdf425b43d7dc4d46969aeef104ea50291/diff:/var/lib/docker/overlay2/cc66c0fd07daa631d3fa6e8191bd7eb4f6afffe2491a4d075d642b5291f136e1/diff:/var/lib/docker/overlay2/15ee5b5f6fd6dcb4adf8b9b5abf236611153bc62007b27df59b22908f5e8b0ca/diff:/var/lib/docker/overlay2/023fa147fcd778e74de19d88d76f4ba15a72f304607f9969603e753f2a73be2a/diff:/var/lib/docker/overlay2/6fb862a4b0c2b1579a15c1957e4966c1f74b60e903f4602de12d3e6f91db6c3d/diff:/var/lib/docker/overlay2/720249e91413753ae11a68bdff9b42c9091f31597069406e8fe83da1e94c0d73/diff:/var/lib/docker/overlay2/fcbe3f4688fc3cae828ede94d6875f94c50c8ecacff772370b963b3795e0af81/diff",
            "MergedDir": "/var/lib/docker/overlay2/3a64ef1b051fa877b124f491cf74f507f2964bfbefef23938f33ed40acf2256c/merged",
            "UpperDir": "/var/lib/docker/overlay2/3a64ef1b051fa877b124f491cf74f507f2964bfbefef23938f33ed40acf2256c/diff",
            "WorkDir": "/var/lib/docker/overlay2/3a64ef1b051fa877b124f491cf74f507f2964bfbefef23938f33ed40acf2256c/work"
        },
        "Name": "overlay2"
    },
    "RootFS": {
        "Type": "layers",
        "Layers": [
            "sha256:107cbdaeec042e6154640c94972c638f4e2fee795902b149e8ce9acbd03d59d7",
            "sha256:d54e805fe9aac70390dd3e3ca1b6fd0f63b2cf5913eb0023bf741bd291d2a746",
            "sha256:35a8bdbbbf277997b1e1efcbdc9ae5321ec3a96ceb81ad194ad58d671ab47bd4",
            "sha256:12f5e36b8e7e83c3fa409e1b34eaec072b9381c8a5e600e9f03c53dbd295bec1",
            "sha256:455bca42f6ec40ca42fea4bea15c6c17b97101af90413cdd25647de2b9d98960",
            "sha256:6e980015d67b5cae225c0adf06e30b516f62600df02350db433029e63968b932",
            "sha256:97486d1882c55d529af69661c72297f97dd38801c0dfa771cbbbe649da358078",
            "sha256:d38a3b5fb295c76c1383bf5f56fa8ede57ce3ad574a6a6918307daab6ae62889",
            "sha256:3cefe4836f3110ecc622b831aafa0c278feabc23c5aeb8a914f9f90727fcbd1a",
            "sha256:4d27fa4325349b8bb3cd04cf99513ab7bbb35d927c4e7d256cb6003589d4f5f1",
            "sha256:5a12579ff4760bd13b2b3fd10536a8616b48ae36f1fefe96dbf955120ba84aa7",
            "sha256:9a1f07a603a04f2ee8e13c344ba526364e7da1cf4f5873a911826d2f6722ca49",
            "sha256:ef974232f04d5ed0c2749f05afbe878f85ecd0ae03bfbf52f1053c36d13616dd",
            "sha256:cd63d019935b03c4eb432d290208769ba3b648365c07c497a2492bc52c0eb63a",
            "sha256:d6fab079757a40d9ab95107f9e56d09589380479b304e13fb3c7ba8429c71d48",
            "sha256:5446ae1aa2679cbb9ec8143527284f972e421b0d106100bbbd513a56fa6f80bf"
        ]
    },
    "Metadata": {
        "LastTagTime": "2026-02-25T09:10:47.283808545+08:00"
    }
}

更多版本

ghcr.io/huggingface/text-embeddings-inference:cpu-1.2

linux/amd64 ghcr.io636.68MB2024-07-25 11:53
1057

ghcr.io/huggingface/text-embeddings-inference:turing-1.5

linux/amd64 ghcr.io900.87MB2024-10-23 11:14
1001

ghcr.io/huggingface/text-embeddings-inference:cpu-latest

linux/amd64 ghcr.io660.31MB2024-12-04 09:13
1183

ghcr.io/huggingface/text-embeddings-inference:1.6

linux/amd64 ghcr.io1.21GB2025-02-25 09:27
1269

ghcr.io/huggingface/text-embeddings-inference:cpu-1.6

linux/amd64 ghcr.io659.95MB2025-02-25 09:47
642

ghcr.io/huggingface/text-embeddings-inference:86-1.6.1

linux/amd64 ghcr.io1.31GB2025-03-31 11:03
439

ghcr.io/huggingface/text-embeddings-inference:86-1.7.0

linux/amd64 ghcr.io1.11GB2025-04-09 09:33
596

ghcr.io/huggingface/text-embeddings-inference:hopper-1.7

linux/amd64 ghcr.io1.11GB2025-04-14 16:27
483

ghcr.io/huggingface/text-embeddings-inference:latest

linux/amd64 ghcr.io1.11GB2025-04-17 14:51
707

ghcr.io/huggingface/text-embeddings-inference:cpu-1.7

linux/amd64 ghcr.io683.64MB2025-04-29 22:28
717

ghcr.io/huggingface/text-embeddings-inference:1.7

linux/amd64 ghcr.io1.11GB2025-05-14 09:19
456

ghcr.io/huggingface/text-embeddings-inference:86-1.7.1

linux/amd64 ghcr.io1.11GB2025-06-09 16:24
332

ghcr.io/huggingface/text-embeddings-inference:hopper-1.7.1

linux/amd64 ghcr.io1.12GB2025-06-11 17:26
425

ghcr.io/huggingface/text-embeddings-inference:1.7.1

linux/amd64 ghcr.io1.11GB2025-06-13 16:46
376

ghcr.io/huggingface/text-embeddings-inference:1.7.4

linux/amd64 ghcr.io1.11GB2025-07-08 09:31
349

ghcr.io/huggingface/text-embeddings-inference:1.8.0

linux/amd64 ghcr.io1.11GB2025-08-15 16:52
279

ghcr.io/huggingface/text-embeddings-inference:1.8

linux/amd64 ghcr.io1.11GB2025-08-15 17:00
603

ghcr.io/huggingface/text-embeddings-inference:cpu-1.8

linux/amd64 ghcr.io684.32MB2025-09-04 10:19
466

ghcr.io/huggingface/text-embeddings-inference:cuda-1.8.1

linux/amd64 ghcr.io2.65GB2025-09-11 17:54
495

ghcr.io/huggingface/text-embeddings-inference:hopper-1.8

linux/amd64 ghcr.io1.12GB2025-09-11 18:36
250

ghcr.io/huggingface/text-embeddings-inference:86-1.8.2

linux/amd64 ghcr.io1.11GB2025-09-18 21:54
293

ghcr.io/huggingface/text-embeddings-inference:cpu-1.7.2

linux/amd64 ghcr.io684.24MB2025-10-14 15:13
236

ghcr.io/huggingface/text-embeddings-inference:1.7.2

linux/amd64 ghcr.io1.11GB2025-10-14 16:56
227

ghcr.io/huggingface/text-embeddings-inference:89-1.8

linux/amd64 ghcr.io1.11GB2025-10-22 18:00
301

ghcr.io/huggingface/text-embeddings-inference:turing-1.8

linux/amd64 ghcr.io930.07MB2025-10-31 18:57
363

ghcr.io/huggingface/text-embeddings-inference:cpu-1.8.2

linux/amd64 ghcr.io685.78MB2025-12-08 15:35
204

ghcr.io/huggingface/text-embeddings-inference:86-1.9

linux/amd64 ghcr.io5.01GB2026-02-25 00:16
20

ghcr.io/huggingface/text-embeddings-inference:cuda-1.9

linux/amd64 ghcr.io5.17GB2026-02-25 09:12
13