Interactive Streamlit app for mining scientific literature using pretrained Named-Entity-Recognition (NER) models.
The Scientific Named Entity Explorer is an interactive tool for automated extraction of scientific entities from research abstracts.
It supports upload from Excel sheets and provides:
- 🧠 Pretrained scientific NER models
- 🎛️ Rich filters & configurable thresholds
- 🔍 Entity-level visualization as colored chips
- 📊 Summaries and downloadable reports
- ⚡ Fast local execution via Streamlit
- Prerequisites
- Quick Start
- Step-by-Step Installation
- Using the App
- Troubleshooting
- Screenshots
- Contributing
- Python 3.10+
git(optional)- Internet connection (initial model download)
⚠️ Note: Pretrained models and PyTorch wheels are large. Ensure sufficient disk space.
git clone https://github.com/Rkl2023/Named_entity_algorithm_project.git
cd Named_entity_algorithm_project
python3 install.py
source .venv/bin/activate # Windows: .venv\Scripts\activate
streamlit run app.pygit clone https://github.com/k25063738/Named_entity_algorithm_project
cd Named_entity_algorithm_projectpython3 -m venv .venv
source .venv/bin/activate # macOS/Linux
# .venv\Scripts\activate # Windowspip install --upgrade pip
pip install -r requirements.txtstreamlit run app.pyPrefer automation?
python3 install.pyhandles everything for you.
- Activate your virtual environment
- Run:
streamlit run app.py
- Open Streamlit tab (default: http://localhost:8501)
- Select NER model + confidence threshold
- Upload Excel file (Title + Abstract required)
- Click Run NER Extraction to:
- Process abstracts with model
- Display entities as colored chips
- Filter by keyword / entity type
- Download results as CSV / JSON / Excel
pip install --upgrade pipApple Silicon users may need wheels from:
👉 https://pytorch.org/
First run downloads large models.
Subsequent runs use:
~/.cache/huggingface/
Activate virtual environment:
source .venv/bin/activateEnsure required columns:
TitleAbstract
- Split very large Excel files
- Increase confidence threshold
Feel free to open issues or submit PRs.
⭐ If you find this project useful, consider giving it a star!

