ncro (pronounced Necro) is a lightweight HTTP proxy, inspired by Squid and
several other projects in the same domain, optimized for Nix binary cache
routing. It routes narinfo requests to the fastest available upstream using EMA
latency tracking, persists routing decisions in SQLite and optionally gossips
routes to peer nodes over a mesh network. How cool is that!
Unlike ncps, ncro does not store NARs on disk. It streams NAR data directly from upstreams with zero local storage. The tradeoff is simple: repeated downloads of the same NAR always hit an upstream, but routing decisions (which upstream to use) are cached and reused. Though, this is desirable for what ncro aims to be. The optimization goal is extremely domain-specific.
During a Nix build, binaries are downloaded from configured substituters, also known as binary caches. When multiple caches serve the same paths or you have multiple caches configured in your Nix setup, there is additional wait time and overhead to every build. ncro solves this by acting as an intelligent local proxy that measures upstream latency in real time and routes each request to the fastest responder. To keep ncro small and lightweight, routing metadata is persisted on disk; NAR content is streamed through with zero local storage. This keeps the proxy stateless on the data path and eliminates cache-invalidation complexity.
For a deeper look at the system design, see the architechture document.
flowchart TD
A[Nix client] --> B[ncro proxy :8080]
B --> C[/hash.narinfo request/]
B --> D[/nar/*.nar request/]
C --> E[Parallel HEAD race]
E --> F[Fastest upstream wins]
F --> G[Result cached in SQLite TTL]
D --> H[Try upstreams in latency order]
H --> I{404?}
I -- yes --> J[Fallback to next upstream]
I -- no --> K[Zero copy stream to client]
J --> H
K --> A
The request flow follows two distinct paths depending on the request type:
- Nix requests
/<hash>.narinfo - ncro checks the SQLite route cache; on a hit, it re-fetches from the cached upstream without probing others
- On a miss, it races HEAD requests to all configured upstreams in parallel
- The fastest upstream wins; the full narinfo body is fetched from that upstream and returned to the client
- The winning route is persisted with a configurable TTL; subsequent requests for the same hash use the cached route directly
- Nix requests
/nar/<hash>.nar - ncro looks up the route for the corresponding narinfo hash; if no route is found (e.g. the narinfo was requested directly from an upstream), it tries upstreams in latency order
- The NAR body is streamed chunk-by-chunk from the selected upstream to the client with zero buffering on disk
- If the upstream returns 404, ncro falls through to the next upstream in latency order
- After all upstreams are exhausted with no success, a 404 is returned
Background probes (HEAD /nix-cache-info) run every 30 seconds to keep latency
measurements current and detect unhealthy upstreams. System design is covered
further in the architechture document.
GET /nix-cache-info: proxy capability advertisement used by NixGET /<hash>.narinfo: route lookup and upstream selectionGET /nar/<path>.nar: streamed NAR content from the chosen upstreamGET /metrics: Prometheus metricsGET /health: JSON health summary of configured upstreams
- Route cache decisions are stored in SQLite and reused until their TTL expires
(or they are evicted by the LRU policy when
max_entriesis reached). - Latency is tracked using an Exponentially Weighted Moving Average (EMA) with a
configurable smoothing factor (
cache.latency_alpha, default 0.3). Higher alpha values react faster to changes; lower values filter out measurement noise. - Lower latency wins the race. When two upstreams are within 10% of each other,
the lower
priorityvalue acts as a tiebreaker. - Background probes (
HEAD /nix-cache-info) update latency estimates every 30 seconds even when no client traffic is flowing, ensuring warm routing data. - On a cache miss, ncro races all configured upstreams in parallel and returns the first successful response. Unhealthy upstreams (detected by consecutive probe failures) are excluded from the race until they recover.
# Run with defaults (upstreams: cache.nixos.org, listen: :8080)
$ ncro
# Point at a config file
$ ncro --config /etc/ncro/config.toml
# Tell Nix to use it
$ nix-shell -p hello --substituters http://localhost:8080Deployment instructions are in installation document.
Tip
If you are testing locally, point only a single Nix client at ncro first. That makes it easier to see cache behavior and upstream selection in logs.
Default config is embedded; create a TOML file to override any field.
[server]
listen = ":8080"
read_timeout = "30s"
write_timeout = "30s"
[[upstreams]]
url = "https://cache.nixos.org"
priority = 10 # lower = preferred on latency ties (within 10%)
[[upstreams]]
url = "https://nix-community.cachix.org"
priority = 20
[cache]
db_path = "/var/lib/ncro/routes.db"
max_entries = 100000 # LRU eviction above this
ttl = "1h" # how long a routing decision is trusted
latency_alpha = 0.3 # EMA smoothing factor (0 < alpha < 1)
[logging]
level = "info" # debug | info | warn | error
format = "json" # json | text
[mesh]
enabled = false
bind_addr = "0.0.0.0:7946"
peers = [] # list of {addr, public_key} peer entries
private_key = "" # path to ed25519 key file; empty = ephemeral
gossip_interval = "30s"| Variable | Config field |
|---|---|
NCRO_LISTEN |
server.listen |
NCRO_DB_PATH |
cache.db_path |
NCRO_LOG_LEVEL |
logging.level |
Environment overrides are useful for containerized or Systemd deployments where you want a fixed config file but still need to tweak one or two settings.
{
services.ncro = {
enable = true;
settings = {
upstreams = [
{ url = "https://cache.nixos.org"; priority = 10; }
{ url = "https://nix-community.cachix.org"; priority = 20; }
];
};
};
# Point Nix at the proxy
nix.settings.substituters = [ "http://localhost:8080" ];
}Alternatively, if you're not using NixOS, create a Systemd service similar to
this. You'll also want to harden this, but for the sake of brevity I will not
cover that here. Make sure you have ncro in your PATH, and then write the
Systemd service:
[Unit]
Description=Nix Cache Route Optimizer
[Service]
ExecStart=ncro --config /etc/ncro/config.toml
DynamicUser=true
StateDirectory=ncro
Restart=on-failure
[Install]
WantedBy=multi-user.targetPlace it in /etc/systemd/system/ and enable the service with
systemctl enable. In the case you want to test out first, run the binary with
a sample configuration instead.
When mesh.enabled = true, ncro creates an ed25519 identity, binds a UDP socket
on bind_addr, and gossips recent route decisions to configured peers on
gossip_interval. Messages are signed with the node's ed25519 private key and
serialized with msgpack. Received routes are merged into an in-memory store
using a lower-latency-wins / newer-timestamp-on-tie conflict resolution policy.
Each peer entry takes an address and an optional ed25519 public key. When a public key is provided, incoming gossip packets are verified against it; packets from unlisted senders or with invalid signatures are silently dropped.
If mesh.private_key is left empty, ncro generates an ephemeral identity on
startup. That is fine for testing, but persistent gossip requires a stable key
so peers can recognize the node across restarts.
[mesh]
enabled = true
private_key = "/var/lib/ncro/node.key"
[[mesh.peers]]
addr = "100.64.1.2:7946"
public_key = "a1b2c3..." # hex-encoded ed25519 public key (32 bytes)
[[mesh.peers]]
addr = "100.64.1.3:7946"
public_key = "d4e5f6..."The node logs its public key on startup (mesh node identity log line). You can
share it with peers so they can add it to their config.
Tip
Keep mesh traffic on a private network. The gossip protocol is signed, but it is still meant for trusted peers. ncro's mesh network feature was designed with Tailscale in mind.
Prometheus metrics are available at /metrics.
| Metric | Type | Description |
|---|---|---|
ncro_narinfo_cache_hits_total |
counter | Narinfo requests served from route cache |
ncro_narinfo_cache_misses_total |
counter | Narinfo requests requiring upstream race |
ncro_narinfo_requests_total{status} |
counter | Narinfo requests by status (200/error) |
ncro_nar_requests_total |
counter | NAR streaming requests |
ncro_upstream_race_wins_total{upstream} |
counter | Race wins per upstream |
ncro_upstream_latency_seconds{upstream} |
histogram | Race latency per upstream |
ncro_route_entries |
gauge | Current route entries in SQLite |
Tip
If you are tuning upstreams, watch ncro_upstream_latency_seconds and
ncro_upstream_race_wins_total together. The first shows raw response timing;
the second shows which cache host is actually being chosen.
- Use
priorityto break ties between similarly fast caches, not to override a clearly slower upstream. - Put
db_pathon persistent storage if you want routing decisions to survive restarts. - Use a small
ttlwhile testing and a larger one in production to reduce upstream probing. - Keep
cache.nix.organd any private caches in the upstream list, with the most trusted cache first. - If you run behind a firewall or container network, make sure the listen port is reachable from your Nix clients.
This project is built with NixOS in mind and naturally the primary means of
working on this project is using Nix for a reproducible developer environment.
Use nix develop to enter a development shell, or direnv allow to use the
provided .envrc if you use Direnv.
# With Nix (recommended)
$ nix build
# With Cargo directly
$ cargo build --release
# Development shell
$ nix develop
$ cargo testncro is made available under Mozilla Public License (MPL) version 2.0. See LICENSE for more details on the exact conditions. An online copy is provided here.