LabBench

LabBench is an open-source, production-oriented terminal AI assistant for Python, notebooks, and data workflows. It is intentionally compact, with practical safeguards and operational features for real projects.

Overview

Terminal-first workflow with minimal moving parts
Works in any repository directory (./.labbench/ for project-local state)
Supports hosted and local model providers through one interface
Includes permission controls, tool safety gates, memory, skills, diagnostics, and tests

Interface Preview

Runtime conversation example

Built-in help and command surface

Quick Start

python3 -m pip install --upgrade pip
python3 -m pip install labbench-cli
labbench

Set one of these API keys before first run (depending on provider):

ANTHROPIC_API_KEY
OPENAI_API_KEY
provider-specific keys for other backends in providers.py

Source install (development)

git clone https://github.com/slfagrouche/labbench.git labbench
cd labbench
python3 -m venv .venv
source .venv/bin/activate  # Windows: .venv\Scripts\activate
python3 -m pip install --upgrade pip
python3 -m pip install -e .
labbench

Production Usage

Interactive (default REPL)

labbench

Interactive with explicit provider/model

labbench -m claude-opus-4-6
labbench -m gpt-4o
labbench -m gemini-2.0-flash
labbench -m ollama/qwen2.5-coder
labbench -m custom/my-model

One-shot mode (automation, scripts, CI)

labbench -p "Summarize this pull request"
labbench -m gpt-4o -p "Draft release notes from git history"

Operational flags

--help show full CLI and REPL command help
--version print version and exit
--verbose include token/thinking output
--thinking enable extended reasoning where supported
--accept-all disable permission prompts (high trust mode; use cautiously)

Supported Providers

LabBench supports both hosted and local providers through a unified interface.

Hosted providers

anthropic (Claude)
openai (GPT/o-series)
gemini (Google Gemini via OpenAI-compatible endpoint)
kimi (Moonshot)
qwen (DashScope)
zhipu (GLM)
deepseek (DeepSeek)
custom (any OpenAI-compatible API via custom/<model>)

Local/self-hosted providers

ollama (local server, no cloud key required)
lmstudio (local server, no cloud key required)

Use either:

Auto-detected model names (example: claude-opus-4-6, gpt-4o)
Explicit prefix form (example: ollama/qwen2.5-coder, custom/my-model)

For exact env var mapping and defaults, see providers.py.

Core Capabilities

File and shell tooling (Read, Write, Edit, Bash, Glob, Grep)
Notebook editing (NotebookEdit)
Web tools (WebFetch, WebSearch)
Diagnostics (GetDiagnostics)
Human-in-the-loop clarification (AskUserQuestion)
Persistent memory (memory/)
Markdown skills (skill/)
Context compaction for long sessions (compaction.py)

Visual Architecture

1) End-to-end flow

Interpretation:

Non-technical: user asks, system runs safely, user gets result.
Technical: agent orchestrates provider calls, tool execution, memory/skills, and permission checks.

2) Permission model (production safety)

3) Runtime components (technical map)

Project Layout

labbench.py CLI, REPL, slash commands, terminal UX
agent.py core loop and permission flow
providers.py model provider adapters
tools.py built-in tools and registration
tool_registry.py tool schema/dispatch system
memory/ memory persistence and memory tools
skill/ skill parsing, built-ins, and skill tools
context.py system prompt assembly
config.py config/session persistence
pyproject.toml package metadata and CLI entrypoint

Configuration and State

Global config: ~/.labbench/config.json
Global sessions: ~/.labbench/sessions/
Project-local state: ./.labbench/ (memory and project skills)

The clone folder name does not matter; runtime paths are based on home and current working directory.

Safety Model

LabBench can run shell commands and edit files. Safety is enforced via:

permission modes (auto, manual, accept-all)
safe-command allowlist for auto-approved Bash
explicit prompts for sensitive operations

Always review before enabling --accept-all in sensitive environments.

Troubleshooting

`labbench: command not found`

Install the package first (or reinstall into your environment):

python3 -m pip install --upgrade pip
python3 -m pip install labbench-cli

If you are running from source without installing, use:

python3 labbench.py

`python: command not found`

Use python3 explicitly:

python3 labbench.py

`bad interpreter` from `.venv/bin/pip`

Your virtual environment likely points to an old path. Rebuild it:

deactivate 2>/dev/null || true
rm -rf .venv
python3 -m venv .venv
source .venv/bin/activate
python3 -m pip install --upgrade pip
python3 -m pip install -r requirements.txt

`PermissionError` on history file

Recent versions of LabBench handle restricted history-file permissions gracefully. If you still see this, pull latest main and rerun.

Development

Run tests:

python3 -m pip install -r requirements-dev.txt
python3 -m pytest tests/ -v
python3 -m pip install build
python3 -m build

PyPI publishing is configured via GitHub Actions in .github/workflows/publish-pypi.yml.

Docs:

Architecture: docs/architecture.md
Contributor guide: docs/contributor_guide.md

License

Licensed under Apache License 2.0. See LICENSE and NOTICE.

Name		Name	Last commit message	Last commit date
Latest commit History 69 Commits
.github/workflows		.github/workflows
docs		docs
memory		memory
skill		skill
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
NOTICE		NOTICE
PYPI_README.md		PYPI_README.md
README.md		README.md
agent.py		agent.py
compaction.py		compaction.py
config.py		config.py
context.py		context.py
labbench.py		labbench.py
memory.py		memory.py
providers.py		providers.py
pyproject.toml		pyproject.toml
requirements-dev.txt		requirements-dev.txt
requirements.txt		requirements.txt
subagent.py		subagent.py
tool_registry.py		tool_registry.py
tools.py		tools.py

Folders and files

Latest commit

History

Repository files navigation

LabBench

Overview

Interface Preview

Runtime conversation example

Built-in help and command surface

Quick Start

Source install (development)

Production Usage

Interactive (default REPL)

Interactive with explicit provider/model

One-shot mode (automation, scripts, CI)

Operational flags

Supported Providers

Hosted providers

Local/self-hosted providers

Core Capabilities

Visual Architecture

1) End-to-end flow

2) Permission model (production safety)

3) Runtime components (technical map)

Project Layout

Configuration and State

Safety Model

Troubleshooting

labbench: command not found

python: command not found

bad interpreter from .venv/bin/pip

PermissionError on history file

Development

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

`labbench: command not found`

`python: command not found`

`bad interpreter` from `.venv/bin/pip`

`PermissionError` on history file

Packages