List view
Exploratory work on CPU-only ONNX inference for supplemental features: local embeddings for memory search, prompt injection defense, and content safety. NOT about replacing main LLM inference — these are auxiliary capabilities that run alongside the primary model.
No due date•0/3 issues closed