ALEX 1.0: Minecraft RL-micro and LLM-macro management AI agent

ALEX is a Minecraft AI agent that combines reinforcement learning (RL) for micro-level tasks and large language models (LLMs) for macro-level decision-making. ALEX utilizes the strengths of both approaches to navigate and interact with the complex Minecraft environment effectively.

Architecture & Pipeline

ALEX implements a hierarchical agent architecture that combines vision-language models with reinforcement learning:

Planning & Decision Layer

The overall architecture consists of several experts (2 LLM ones, and a heuristic-based one):

Meta-planner coordinates as a heuristic-based completition evaluator
Planner uses LLMs (via HF Inference API) to generate high-level action plans
Reflex manager handles immediate threats and opportunities (low health, hostile mobs, valuable items)

Vision Processing Layer

Pipeline has a vision perception layer which utilizes MineCLIP encoders for visual scene understanding and spatial attention

Multi-modal vision queries for object detection, inventory analysis, and environment perception
Scene analyzer that extracts structured game state from raw observations
Spatial attention concept which splits the image into patches, and inferences MineCLIP on them to understand the image better

Knowledge base

The knowledge system is integrated within the pipeline to not hallucinate on minecraft prompts and commands

RAG system retrieves relevant Minecraft wiki information for decision-making
Vector-based retrieval using wiki dataset for crafting recipes, mob behaviors, and game mechanics
Prompt engineering with few-shot examples for consistent LLM outputs

Execution Layer

The RL-based executor which utilizes STEVE-1 model

STEVE-1 policy executor translates natural language commands into low-level actions
VPT-based vision-to-action model for fine-grained control
Action sequence generation with temporal consistency

The system operates on two timescales: fast reactive reflexes (every step) and complex planning (every 50-100 steps), enabling both tactical responsiveness and strategic goals planning.

Prerequisites and installation

Clone the repository:

git clone --recursive https://github.com/n1n1n1q/alex.git
cd alex

Download MineCLIP weights (avg.pth or attn.pth) and place them in models/ directory.

Download the MineDojo Wiki dataset:

mkdir -p data
wget -c --show-progress "https://zenodo.org/record/6640448/files/wiki_samples.zip" -O data/wiki_samples.zip
unzip data/wiki_samples.zip -d data && rm data/wiki_samples.zip

Put WANDB_API_KEY into .env file.

Environment setup

Docker setup

Build docker with

chmod +x docker/build.sh
./docker/build.sh

Run the environment with

chmod +x docker/run.sh
./docker/run.sh

If no CUDA GPU available, use the no GPU version:

chmod +x docker/run_no_gpu.sh
./docker/run_no_gpu.sh

Conda setup

conda create -n minestudio python=3.10 -y
source $(conda info --base)/etc/profile.d/conda.sh
conda activate minestudio
conda install --channel=conda-forge openjdk=8 -y

Setup for MacOS with Conda

chmod +x install_macos.sh

After setting up your appropriate environment, install submodules:

cd alex
pip install -e submodules/MineStudio 
pip install -e submodules/MineCLIP

Install dependencies:

pip install -r requirements.txt

Name		Name	Last commit message	Last commit date
Latest commit History 136 Commits
alex		alex
assets		assets
benchmarks		benchmarks
conf		conf
docker		docker
examples		examples
models		models
submodules		submodules
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
install_macos.sh		install_macos.sh
readme.md		readme.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ALEX 1.0: Minecraft RL-micro and LLM-macro management AI agent

Architecture & Pipeline

Planning & Decision Layer

Vision Processing Layer

Knowledge base

Execution Layer

Prerequisites and installation

Environment setup

Docker setup

Conda setup

Setup for MacOS with Conda

References

Contributors

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ALEX 1.0: Minecraft RL-micro and LLM-macro management AI agent

Architecture & Pipeline

Planning & Decision Layer

Vision Processing Layer

Knowledge base

Execution Layer

Prerequisites and installation

Environment setup

Docker setup

Conda setup

Setup for MacOS with Conda

References

Contributors

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages