A Streamlit web app for comparing AI language models side by side on pricing, performance, and benchmark scores.
Select any two models from 400+ LLMs and compare them on:
- Response Latency — how fast the model starts responding
- Output Speed — tokens generated per second
- Intelligence, Code, and Math Index — benchmark scores from Artificial Analysis
- Prompt Pricing — cost per input token
- Context Length — how much information the model can hold in memory
- Max Completion Tokens — maximum length of the model's response
- OpenRouter — pricing, context length, token limits
- Artificial Analysis — benchmark scores, latency, output speed
- Python, Streamlit, Plotly, pandas, rapidfuzz
- Clone the repo
- Install dependencies:
python -m pip install -r requirements.txt - Run
streamlit run tool.pyto start the app
Lucas Lu — ll207@rice.edu — LinkedIn