The chatengine provide a way to init and start the chat web system with several type of LLMs. which can run only of the major public cloud.
The init ideal is to support the following models:
- LLama 2 chat
- Qwen
- GPT3, GPT4
The major cloud provider include:
- Amazon public cloud
- Google cloud
- Aliyun
As far as we know there are more and more LLMs published recent months, some of them provided the public api, others publish the code and weights.however, each one has its own visit link and style, it is hard to combine them into one UI, for the following cases:
- compare the two LLMs with the same context and prompt.
- to get more options.
- chat each others to make a sence design.
- prompt optimization.
install the requirements
pip install -r requirements.txt
start the server
python server.py --model_path "meta-llama/Llama-2-7b-chat-hf" --server_name "0.0.0.0"
note: remove the 'server_name' parameter if run on localhost.
This repository is under development. it may occur error when trying to run it, please feel free to open issue if any question.