GitHub - privateai-com/research-ai: AI assistant that solves your research tasks by searching multiple sources

What It Does • Architecture • Quick Setup • Usage • Docs

Overview

research-ai — agentic research assistant that transforms your research goals into actionable insights through a sophisticated multi-stage pipeline that intelligently manages tasks, searches multiple sources, and delivers personalized results.

Key Features

🎯 Smart Task Management

Queue-based task processing with priority management
Cycle-based execution with configurable limits per user plan
Intelligent notifications when cycle limits are reached
Automatic task status tracking and progress monitoring

🔍 Multi-Source Intelligence

Automatic source selection per query (arXiv, Google Scholar, PubMed, GitHub)
BM25 ranking over title + abstract for relevance scoring
LLM-powered analysis with configurable relevance thresholds
Structured result presentation with actionable insights

⚡ User-Centric Experience

Free (5 cycles) and Premium (100 cycles) user plans
Real-time progress tracking and status updates
Group chat support for team collaboration
Personalized notification preferences

Example Tasks

"Find practical studies on vision transformers for medical imaging"
"Summarize top benchmarks for small-context RAG"
"Discover recent diffusion methods for texture generation"

sequenceDiagram
  participant U as User
  participant B as Telegram Bot
  participant Q as Task Queue
  participant AG as Agent Manager
  participant P as Pipeline
  participant S as Sources
  participant LLM as LLM Provider

  U->>B: /task "research description"
  B->>Q: create task with priority
  AG->>Q: fetch next queued task
  AG->>P: run research pipeline
  P->>S: multi-source search
  S-->>P: candidates
  P->>LLM: analyze relevance
  LLM-->>P: structured analysis
  P->>AG: pipeline results
  AG->>Q: update task status
  AG->>B: cycle limit notification
  B->>U: personalized report

Architecture

Modular Design

research-ai/
├── agent/                # AI Agent System
│   ├── manager.py        # Task processing orchestrator
│   ├── pipeline/         # Research pipeline stages
│   └── browsing/         # Source-specific tools
├── bot/                  # Telegram Bot Interface
│   ├── handlers/         # Command handlers
│   └── notifications/    # Notification system
├── api/                  # REST API Interface
├── shared/               # Shared Components
│   ├── database/         # Modular Database Layer
│   │   ├── operations/   # Database operations
│   │   ├── models.py     # SQLAlchemy ORM models
│   │   └── enums.py      # Database enums
│   ├── llm.py            # LLM configuration
│   └── logging.py        # Centralized logging
└── tests/                # Test Suite

Database Architecture

Core Models:

User - User management with plan-based limits
UserTask - Enhanced task model with queue support
TaskQueue - Priority-based task processing queue
RateLimitRecord - Anti-spam protection
TaskStatistics - Processing metrics and analytics

Agent Models:

ResearchTopic - Research topic management
ArxivPaper - Paper storage and metadata
PaperAnalysis - Relevance analysis results
UserSettings - User preferences and thresholds

Task Processing Flow

Task Creation → User creates task via bot/API
Queue Management → Task added to priority queue
Agent Processing → Agent picks next task from queue
Pipeline Execution → Multi-stage research pipeline
Cycle Management → Track cycles, notify on limits
Result Delivery → Personalized notifications to user

Quick Setup

Prerequisites

Python 3.10+
Telegram Bot Token (create one)
OpenAI API Key or local LLM setup

1️⃣ Install Dependencies

# Install uv package manager (see docs: https://docs.astral.sh/uv/)
curl -LsSf https://astral.sh/uv/install.sh | sh

# Install project dependencies
uv sync

2️⃣ Configure Environment

Create .env file in the project root:

# Bot Configuration
TELEGRAM_BOT_TOKEN=your-telegram-bot-token

# LLM Configuration
# if you have an API key for OpenAI:
# OPENAI_API_KEY=your-openai-key-here
# if you use OpenRouter provider:
# OPENAI_API_KEY=
OPENAI_API_KEY=your-openai-api-key
OPENROUTER_API_KEY=your-openrouter-key

# Database Configuration
DATABASE_URL=sqlite:///./database.db

# Agent Configuration
AGENT_POLL_SECONDS=30
AGENT_DRY_RUN=false
AGENT_ID=main_agent

# Pipeline Configuration
PIPELINE_USE_AGENTS_STRATEGY=1
PIPELINE_USE_AGENTS_ANALYZE=0

3️⃣ Configure LLM

Choose your model in shared/llm.py by changing the AGENT_MODEL variable. You will need an API key for the model you choose (except for Ollama local models). You can also add your own models like this:

_my_model_provider = AsyncOpenAI(
    base_url="https://api.my-model-provider.com/v1",
    api_key=os.getenv("MY_MODEL_API_KEY")
)

MyModel = OpenAIChatCompletionsModel(
    model="my-model",
    openai_client=_my_model_provider
)

See shared/llm.py for more examples.

4️⃣ Launch System

# 🚀 Start everything
uv run python main.py

# Or run components separately:
uv run python start_bot.py    # 🤖 Bot only
uv run python start_agent.py  # 🧠 Agent only
uv run python start_api.py    # 🌐 REST API on http://localhost:8000

Usage

User Plans

Free Plan:

5 tasks per day
1 concurrent task
5 search cycles per task
Basic notifications

Premium Plan:

100 tasks per day
5 concurrent tasks
100 search cycles per task
Advanced notifications and analytics

Bot Commands

Category	Command	Description	Example
Tasks	`/task`	Create new research task	`/task "AI for medical imaging"`
	`/status`	View task status and progress	-
	`/history`	Browse task results	-
	`/pause_task`	Pause specific task	`/pause_task 123`
	`/resume_task`	Resume paused task	`/resume_task 123`
	`/cancel_task`	Cancel running task	`/cancel_task 123`
Settings	`/set_notification`	Configure notification thresholds	`/set_notification instant 80`
	`/set_group`	Enable group notifications	-
	`/unset_group`	Disable group notifications	-
	`/upgrade`	Upgrade to Premium plan	-
	`/help`	Show help and commands	-

Task Lifecycle

Creation → Task added to queue with priority
Processing → Agent executes research pipeline
Cycles → Multiple search cycles (5 for Free, 100 for Premium)
Completion → Results delivered with cycle limit notification
History → Results available for review

Cycle Limit Notifications

When a task reaches its cycle limit, users receive personalized notifications:

With Results:

🎉 Task #1 completed!
✅ Found results for your query
🔄 Cycles completed: 5/5 (Plan: Free)
🤖 Hope the results were helpful!
💡 Want to continue research?
• Create new task with refined query
• Or upgrade to Premium for unlimited cycles

Without Results:

🔄 Task #5 completed
📝 Your query description...
🔄 Cycles completed: 5/5 (Plan: Free)
❌ No results found for this query
💡 Recommendations:
• Try reformulating your query
• Use different keywords
• Or upgrade to Premium for more cycles

REST API

Start the API:

uv run python start_api.py

Health check:

curl http://localhost:8000/healthz

Run the pipeline:

curl -X POST http://localhost:8000/v1/run \
  -H "Content-Type: application/json" \
  -d '{
    "query": "AI for medical imaging",
    "categories": ["cs.CV"],
    "max_queries": 5,
    "bm25_top_k": 20,
    "max_analyze": 10,
    "min_relevance": 50.0
  }'

📄 License

This project is licensed under the MIT License. See the LICENSE file for details.

Give this project a ⭐ if you found it useful!

⬆️ Back to top

Name		Name	Last commit message	Last commit date
Latest commit History 44 Commits
.github/workflows		.github/workflows
agent		agent
api		api
assets		assets
bot		bot
docs		docs
shared		shared
tests		tests
.env.example		.env.example
.python-version		.python-version
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
quality-check.py		quality-check.py
start_agent.py		start_agent.py
start_api.py		start_api.py
start_bot.py		start_bot.py
trace_imports.py		trace_imports.py
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Overview

Key Features

Example Tasks

Architecture

Modular Design

Database Architecture

Task Processing Flow

Quick Setup

1️⃣ Install Dependencies

2️⃣ Configure Environment

3️⃣ Configure LLM

4️⃣ Launch System

Usage

User Plans

Bot Commands

Task Lifecycle

Cycle Limit Notifications

REST API

📄 License

About

Uh oh!

Contributors

Uh oh!

Languages

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Overview

Key Features

Example Tasks

Architecture

Modular Design

Database Architecture

Task Processing Flow

Quick Setup

1️⃣ Install Dependencies

2️⃣ Configure Environment

3️⃣ Configure LLM

4️⃣ Launch System

Usage

User Plans

Bot Commands

Task Lifecycle

Cycle Limit Notifications

REST API

📄 License

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Contributors

Uh oh!

Languages