embed-server

Tiny OpenAI-compatible /v1/embeddings HTTP API, backed by fastembed. Drop-in for any client that already speaks the OpenAI embeddings API; runs the model locally with no per-call cost.

Install + run

uv sync
uv run embed-server   # listens on 127.0.0.1:8001

Environment variables: EMBED_HOST (default 127.0.0.1), EMBED_PORT (default 8001).

Use it

import openai
client = openai.OpenAI(base_url="http://127.0.0.1:8001/v1", api_key="local")
r = client.embeddings.create(model="jinaai/jina-embeddings-v2-base-es", input="hola")
print(len(r.data[0].embedding))  # 768

Or with curl:

curl -s http://127.0.0.1:8001/v1/embeddings \
  -H 'Content-Type: application/json' \
  -d '{"model":"jinaai/jina-embeddings-v2-base-es","input":"hola"}' | jq '.data[0].embedding | length'

Models are loaded on first request and cached in memory. The first call to a model triggers a download into ~/.cache/fastembed; subsequent calls are fast.

See the fastembed supported models list for valid model values.

Endpoints

Method	Path	Notes
`POST`	`/v1/embeddings`	OpenAI-compatible request/response
`GET`	`/health`	`{"status":"ok","models_loaded":[...],"ts":...}`

Deploy as a systemd service

See deploy/embed-server.service. Install:

sudo cp deploy/embed-server.service /etc/systemd/system/
sudo systemctl daemon-reload
sudo systemctl enable --now embed-server

Edit WorkingDirectory, User, and the Environment=PATH in the unit to match the host. The unit assumes uv is on PATH and the repo is checked out at WorkingDirectory.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
deploy		deploy
embed_server		embed_server
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

embed-server

Install + run

Use it

Endpoints

Deploy as a systemd service

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

embed-server

Install + run

Use it

Endpoints

Deploy as a systemd service

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages