Skip to content
View robertlangdonn's full-sized avatar

Organizations

@kagaji

Block or report robertlangdonn

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
robertlangdonn/README.md

Prasad Khake

I make LLMs run well on real, constrained hardware — on-device, edge, Apple Silicon — and build the products around them.

The recurring question in my work: what actually limits LLM inference on a machine you own, and how do those limits change as you scale? Background: hardware (e-paper boards shipped to 20+ countries), Rust systems tooling, and AI-native full-stack development.

On-device / inference (Apple's MLX)

  • Contributor to mlx-lm. Merged: #1349 — enables text-mode loading of Gemma 4 (gemma4_unified) checkpoints on MLX.
  • #1329 (approved) — root-caused why Mistral/Devstral (tekken-v13) models emit Ġ instead of spaces on Apple Silicon, and fixed the detokenizer routing. The writeup.
  • First merged contribution to vLLM on Apple Silicon (#382).

On Device — measuring the bottlenecks

Other work

  • paperd.ink — open-source ESP32 e-paper dev board, in makers' hands across 20+ countries.
  • vcfkit — genomics CLI in Rust; 4× faster than bcftools, single static binary.
  • Hacker Newspaper — comments-first mobile Hacker News reader.

Writing about on-device LLMs at prasadkhake.com · On Device.

📫 prasadkhake@gmail.com · LinkedIn

Pinned Loading

  1. paperdink/paperd.ink paperdink/paperd.ink Public

    ESP32 based E-paper display.

    248 10

  2. vcfkit vcfkit Public

    Fast VCF toolkit — normalize, liftover, filter — as a single static binary. 4× faster than bcftools on hot paths. No htslib, no Python, no C dependencies.

    Rust 7

  3. hacker-newspaper hacker-newspaper Public

    Mobile-optimized Hacker News frontend (React, Vite, Tailwind)

    JavaScript

  4. pioracle pioracle Public

    Pioracle — Your destiny in π. Pi Day oracle app.

    HTML