LlamaModel

A Python web application to manage large language models for llama.cpp servers: search and browse GGUF models on Hugging Face, download selected quantizations, and maintain a models.ini file compatible with the llama.cpp server.

Note: This application is currently in its alpha phase and has only been tested on Linux.

Screenshots

Discover

My Models

Settings

Features

Discover – Search Hugging Face for GGUF-compatible models (LMStudio-like interface). Filter by tags or text.
Model Detail – View the Hugging Face model card, introduction, and a list of available quantizations (GGUF files), along with download sizes.
Download Management – Download a chosen quantization directly via the Hugging Face API into your configured models directory. Active downloads include real-time progress bars, speed metrics, ETA estimates, and a cancel button.
models.ini Integration – After each download, the app intelligently parses the model card for recommended parameters, dynamically structuring an entry in models.ini, precisely mapping to the downloaded file.
My Models / Parameters Editor – Configure and edit default loading constraints for models (e.g., context size, GPU layers, seed) using an intuitive interface. Fully compatible with llama.cpp's new configuration file requirements.
Configurable Settings – Web application port and default Hugging Face models directory can be configured through a unified settings view or config.yaml.

Requirements

Python 3.10+
Linux (Only tested on Linux)
Dependencies in requirements.txt

Installation

git clone https://github.com/your-username/llamamodel.git
cd /path/to/llamamodel
python -m venv venv
source venv/bin/activate
pip install -r requirements.txt

Configuration

Parameter	Default	Description
port	`8081`	HTTP port for the web app.
models_dir	`~/.cache/huggingface/models`	Base directory for Hugging Face downloads. Downloads are saved as `author/model/file`. `models.ini` is stored as `models_dir/models.ini`.

Configure via:

Web UI: Navigate to the "Settings" tab in the application.

config.yaml in the project root:

port: 8081
models_dir: ~/.cache/huggingface/models

Environment variables (override config file):
- LLAMAMODEL_PORT – port number
- LLAMAMODEL_MODELS_DIR – path to models directory

Running

python run.py

Or with uvicorn directly:

uvicorn app.main:app --host 0.0.0.0 --port 8081

Then open http://localhost:8081 (or the port you configured) in your browser.

How to Use

Launch the App: Run the application using the commands above and open the UI in your browser.
Find Models: Go to the Discover page to search for GGUF models. Filter your search by utilizing available tags.
Select and Download: Click on a model to open its details. In the right panel, you'll see a list of available quantizations alongside file sizes. Click on the quantization to initiate the download.
Monitor Progress: Keep track of the real-time download bar tracking speed, size, and ETA directly on the Discover page. You can cancel downloads midway if necessary.
Manage Parameters: Once downloaded, navigate to the My Models page. Here, you can configure the parameters llama.cpp will use upon initializing the model, such as n-gpu-layers and ctx-size. Changes are actively saved to models.ini.
Start llama.cpp Server: Serve your chosen model utilizing the newly formatted models.ini file:
```
llama-server --model-config ~/.cache/huggingface/models/models.ini
```

License

This project is licensed under the GPL-2.0 License. See the LICENSE.md file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
.cursor/plans		.cursor/plans
app		app
doc		doc
static		static
templates		templates
.gitignore		.gitignore
Discover.png		Discover.png
LICENSE.md		LICENSE.md
MyModels.png		MyModels.png
README.md		README.md
Settings.png		Settings.png
config.yaml		config.yaml
requirements.txt		requirements.txt
run.py		run.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LlamaModel

Screenshots

Discover

My Models

Settings

Features

Requirements

Installation

Configuration

Running

How to Use

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

LlamaModel

Screenshots

Discover

My Models

Settings

Features

Requirements

Installation

Configuration

Running

How to Use

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages