Unmetered Tokens Café — Experimental Brews

A live-sentiment-analysis demo for a fictional café's experimental menu. Customer reviews stream in by the hundred, and a transformer model classifies each one as positive, neutral, or negative — entirely in the browser, on the device's NPU, GPU, or CPU, with no cloud round-trips.

Built on Windows ML via WebNN and ONNX Runtime Web.

What it shows

A real transformer (RoBERTa, fine-tuned for sentiment) running locally in the browser.
The same JavaScript code routed to NPU / GPU / CPU by changing a single line.
Live latency, throughput, and a rolling sentiment chart so you can see the hardware acceleration.
No API calls. No server inference. No data leaving the device.

How it works

Tokenize — text is converted to BPE token IDs with the model's tokenizer.
Tensorize — tokens are packed into fixed [1, 512] int32 buffers with an attention mask for padding.
Inference — ort.InferenceSession.run() dispatches to the WebNN execution provider on the configured device.
Decode — logits → softmax → argmax to pick the predicted class.
Render — the message lands in the feed with its sentiment badge and inference latency.

The whole inference path lives in js/sentimentAnalyzer.js, in about 40 lines.

Switching the device

Open js/sentimentAnalyzer.js and change one constant:

// ▶️ Device the model runs on. Change to 'cpu' or 'gpu' to compare silicon.
const DEVICE_TYPE = 'npu';

Reload the page. The badge in the bottom right reflects the active device, and the latency stat will jump (or drop) accordingly.

Running locally

cd Sentiment-Analysis-WinML-WebNN
npm install
npm start

Then open http://localhost:8888.

The model files (model/model_opt.onnx, model/model_opt.onnx.data, model/vocab.json, model/merges.txt) are gitignored due to size (~478 MB). Place them in the model/ folder before running.

Requirements

A browser with WebNN enabled — see Enabling WebNN flags below.
Optional but recommended: a Copilot+ PC with an NPU for the most dramatic latency numbers.

Enabling WebNN flags

WebNN is currently behind experimental flags. In Edge or Chrome, open edge://flags (or chrome://flags), search for WebNN, and enable the following three flags:

Flag	Setting
Enables WebNN API (`#web-machine-learning-neural-network`)	Enabled
Enables experimental WebNN API features (`#experimental-web-machine-learning-neural-network`)	Enabled
ONNX Runtime backend for WebNN (`#webnn-onnxruntime`)	Enabled

Restart the browser after enabling.

Tech stack

onnxruntime-web — ONNX Runtime in the browser, with the WebNN execution provider.
Windows ML / WebNN — hardware-accelerated ML on Windows.
Vanilla JavaScript / HTML / CSS — no frontend framework.
Model: cardiffnlp/twitter-roberta-base-sentiment-latest, optimized to ONNX.

Project layout

Sentiment-Analysis-WinML-WebNN/
├── index.html              # App shell + EP badge
├── server.mjs              # Tiny static file server
├── package.json
├── css/styles.css
├── js/
│   ├── app.js              # UI, stream control, stats
│   ├── sentimentAnalyzer.js   # WebNN session + analyze() pipeline
│   ├── tokenizer.js        # BPE tokenizer
│   ├── messageGenerator.js # Simulated café reviews
│   └── sentimentChart.js   # Live trend chart
└── model/                  # ONNX model + tokenizer files (gitignored)

Learn more

aka.ms/winml — Windows ML overview, samples, and docs.
WebNN API — W3C draft spec.
ONNX Runtime Web — JS bindings and EP options.

License

Released under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
css		css
docs		docs
js		js
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
index.html		index.html
package-lock.json		package-lock.json
package.json		package.json
server.mjs		server.mjs
server.py		server.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Unmetered Tokens Café — Experimental Brews

What it shows

How it works

Switching the device

Running locally

Requirements

Enabling WebNN flags

Tech stack

Project layout

Learn more

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Unmetered Tokens Café — Experimental Brews

What it shows

How it works

Switching the device

Running locally

Requirements

Enabling WebNN flags

Tech stack

Project layout

Learn more

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages