A complete AI agent in ~225 lines of Python. No frameworks, no abstractions.
runtime.py agent loop, tools, memory, system prompt
model.py local Hugging Face transformer inference + sampling
cli.py terminal interface
# with ollama
ollama pull mistral-small3.2
pip install requests
python runtime.py
# with local Hugging Face models (needs GPU)
pip install torch transformers accelerate
# defaults to Qwen/Qwen2.5-7B-Instruct
python runtime.py --local-model
# enable verbose mode to inspect prompts sent to the LLM
python runtime.py --verbosedef calculator(expression):
return str(eval(expression))
TOOLS["calculator"] = {"fn": calculator, "desc": "calculator(expression)"}MIT
