apple-sili

Here are 2 public repositories matching this topic...

SharpAI / SwiftLM

⚡ Native MLX Swift LLM inference server for Apple Silicon. OpenAI-compatible API, SSD streaming for 100B+ MoE models, TurboQuant KV cache compression, MACOS + iOS iPhone app.

swift ios metal inference moe mlx on-device-ai openai-api llm apple-sili

Updated May 19, 2026
Swift

IbadKhalid7 / turboquant-model

Star

Optimize LLM inference with near-optimal 4-bit weight quantization and on-the-fly dequantization for lower memory use and faster matmul

Updated May 26, 2026
Python

Improve this page

Add a description, image, and links to the apple-sili topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the apple-sili topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly