A multi-source news and discussion aggregator with AI-powered summarization. Scrapes and summarizes news from Reddit and Twitter (X.com) using MCP and Gemini, with a Streamlit frontend and FastAPI backend.
- Scrape and summarize news from Reddit and Twitter (X.com)
- Uses MCP for robust web scraping
- Summarization powered by Google Gemini (via LangChain)
- Async, rate-limited scraping for reliability
- Modular codebase: separate scrapers for each source
- Streamlit frontend and FastAPI backend
- Clone the repository and install dependencies:
pip install -r requirements.txt
- Set up your
.envfile with the following variables:GEMINI_API_KEYAPI_TOKENWEB_UNLOCKER_ZONEMCP_API_KEY(for x_scrapper)
- Run the backend:
python backend.py
- Run the frontend:
streamlit run frontend.py
- Test scrapers directly:
python reddit__scraper.py python x_scrapper.py
reddit__scraper.py— Reddit scraping and summarizationx_scrapper.py— Twitter/X.com scraping and summarizationnews_scraper.py— (Other news sources)models.py,utils.py— Shared models and utilitiesbackend.py,frontend.py— API and UI
MIT