Skip to content

vctrmn/zig-text-embeddings-inference

Repository files navigation

Zig Text Embeddings Inference

A text embeddings inference server built with Zig, leveraging the power of Zig, ZML, and Zap for blazing fast HTTP serving capabilities.

Getting Started

Prerequisites

The project uses bazel as its build system, primarily because ZML is built with Bazel. This ensures compatibility and reproducible builds.

MacOS installation :
brew install bazelisk
Linux installation (amd64) :
curl -L -o /usr/local/bin/bazel 'https://github.com/bazelbuild/bazelisk/releases/download/v1.25.0/bazelisk-linux-amd64'
chmod +x /usr/local/bin/bazel

Build and run the server

bazel run -c opt //:ModernBERT-large
bazel run -c opt --@zml//runtimes:cuda=true //:ModernBERT-large

Available Models

Model ID Description Status
ModernBERT-large Answer.AI's ModernBERT large model ✅ Available
nomic-ai/modernbert-embed-base A ModernBERT-based embedding model from Nomic AI ⚠️ In Progress
Alibaba-NLP/gte-modernbert-base An improved GTE (General Text Embeddings) variant based on ModernBERT ⚠️ In Progress

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published