本项目是一个基于多模型支持的text2sql智能问答系统,结合了向量检索、重排序及大语言模型生成,构建端到端的 text2sql 解决方案。支持的核心模型包括:
- 生成模型:DeepSeek-V3
- 嵌入模型:bge-large-zh-v1.5
- 重排序模型:bge-reranker-large
请根据实际路径修改配置文件:project/server/config.yaml。
vector_store:
type: chroma
name: rag
kwargs:
search_params:
metric_type: "IP"
params: { ef: 128 }
index_params:
metric_type: "IP"
index_type: "HNSW"
params: { M: 16, efConstruction: 200 }knowledge_base:
base_storage_path: path/to/your/knowledge_base
name: talent_engine_deepseek_01llm:
api_key: sk-xxx
base_url: https://xxx/v1
model_name: deepseek-chat
temperature: 1.5
max_tokens: 10000
model_engine: openaiembeddings:
model_name_or_path: text-embedding-ada-002
model_engine: openai
dimensions: 1024
device: cuda:0
base_url: https://api.gptapi.us/v1
api_key: sk-xxxreranker:
model_name_or_path: path/to/your/reranker_model
type: rank
device: cuda:0server:
api_server_host: 0.0.0.0
api_server_port: 8888database:
db_type: mysql
host: 10.10.80.67
port: 3318
user: root
password: hjr
database: talent_engine
echo: truecd project/client
npm install
npm run servecd project/server
uv sync
uv source .venv/bin/activate
python main.py- 本项目已集成多模型协同推理机制;
- 请确保服务器有足够显存支持大模型和 embedding 同时加载;
- 更多模型配置请参考 HuggingFace 和 DeepSeek 官方文档。



