Add VS Code local MLX model example by ahnafyy · Pull Request #1425 · ml-explore/mlx-examples

ahnafyy · 2026-06-09T21:06:56Z

Summary

Add a new vscode_lm example that serves a local MLX coding model through an OpenAI-compatible chat endpoint for use with VS Code language model integration.

What changed

add a new vscode_lm/server.py example built on mlx-lm
expose GET /health, GET /v1/models, and POST /v1/chat/completions
add basic prompt-mediated tool-calling support for agent-style requests
document the VS Code setup flow in vscode_lm/README.md
add a sample chatLanguageModels.example.json config
link the new example from the top-level README.md
ignore the local virtualenv created for this example in .gitignore

Notes

the example defaults to no-auth local usage for simpler setup
the default model is mlx-community/Qwen2.5-Coder-1.5B-Instruct-4bit to fit local disk and memory constraints better than the 7B model
the example was validated end-to-end locally with /health, /v1/models, and a real /v1/chat/completions request

Validation

start server from the local Python 3.12 virtualenv
verify GET /health
verify GET /v1/models
verify POST /v1/chat/completions returns a completion without auth

Add VS Code local MLX model example

68f4967

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add VS Code local MLX model example#1425

Add VS Code local MLX model example#1425
ahnafyy wants to merge 1 commit into
ml-explore:mainfrom
ahnafyy:add-vscode-mlx-local-model-example

ahnafyy commented Jun 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

ahnafyy commented Jun 9, 2026

Summary

What changed

Notes

Validation

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant