LLM Comparison Tool

A Streamlit web app for comparing AI language models side by side on pricing, performance, and benchmark scores.

What it does

Select any two models from 400+ LLMs and compare them on:

Response Latency — how fast the model starts responding
Output Speed — tokens generated per second
Intelligence, Code, and Math Index — benchmark scores from Artificial Analysis
Prompt Pricing — cost per input token
Context Length — how much information the model can hold in memory
Max Completion Tokens — maximum length of the model's response

Data Sources

OpenRouter — pricing, context length, token limits
Artificial Analysis — benchmark scores, latency, output speed

Stack

Python, Streamlit, Plotly, pandas, rapidfuzz

Setup

Clone the repo
Install dependencies: python -m pip install -r requirements.txt
Run streamlit run tool.py to start the app

Author

Lucas Lu — ll207@rice.edu — LinkedIn

Name		Name	Last commit message	Last commit date
Latest commit History 43 Commits
.streamlit		.streamlit
backend		backend
data_fetching		data_fetching
.gitignore		.gitignore
README.md		README.md
cache.json		cache.json
cache.py		cache.py
requirements.txt		requirements.txt
tool.py		tool.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLM Comparison Tool

What it does

Data Sources

Stack

Setup

Author

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

LLM Comparison Tool

What it does

Data Sources

Stack

Setup

Author

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages