Skip to content

taezeem14/DeepRecon

Repository files navigation

🕵️‍♂️ DeepRecon

Python License Docker AI DB Tor

DeepRecon is an advanced, AI-powered framework for conducting automated dark web OSINT investigations on the Tor (.onion) network.

It leverages Multi-Engine Meta-Searches, Local LLMs (Ollama) to refine and summarize findings, auto-scaling SOCKS5 crawlers, and automated tech-stack fingerprinting to map hidden services effortlessly.

FeaturesInstallationUsageArchitectureAcknowledgements


Note

DeepRecon requires a native Tor service to route traffic through the anonymity network safely. Ensure your Tor daemon is active before kicking off crawls.


✨ Features

  • 🧠 AI Investigation Analysis – Deeply integrated with local LLMs (via Ollama) to automatically digest, summarize, and contextualize dark web domains and scraped payloads.
  • 🧅 Global Dark-Web Metasearch Engine – Directly query a dozen curated Dark Web Search Engines asynchronously (Ahmia, OnionLand, Torch, etc.) to discover hidden services before dispatching crawlers.
  • 🏎️ Ludicrous Speed (Async SOCKS5) – Built entirely around aiohttp and asyncio, utilizing non-blocking connection scaling to blitz through nodes exponentially faster than threaded alternatives.
  • 🐳 Docker-Ready Deployments – Spin up the environment entirely isolated through Docker Compose.
  • ⚙️ Modular Architecture – Clean separation between the engine, extraction rules, and storage workflows, allowing instant extensibility.
  • 🌐 Web UI – A local FastAPI/Tailwind-based interactive dashboard for seamless investigations without terminal clutter.
  • 💻 Elite Terminal CLI – Beautiful rich-powered interactive non-ending loops natively from the terminal.
  • 🔍 Full-Text Search (FTS5) – Blazing fast storage utilizing SQLite FTS5 index engines for gigabytes of HTML parsing logic.
  • 🤖 Auto-Extraction Plugins – Dynamically loadable artifact extractors (BTC sniffing, Fingerprinting, NLP).

🏗️ Architecture

DeepRecon removes monolithic messes. The ecosystem is fully decoupled for instant upgrades:

  • core/: The heavy-lifting crawler and HTML parser logic (crawler.py). Features resilient memory caching and Breadth-First Traversal (BFS) duplicate avoidance.
  • plugins/: Extractor engines (BasePlugin). Drop a custom script here, and the framework automatically consumes it. No manual configs needed.
  • storage/: Atomic-replacement SQLite engines handling Full-Text Search and relational nodes seamlessly.
  • web/ & main.py: Fast frontends acting as proxies to the background logic.

⚠️ Disclaimer

Warning

This tool is intended for educational and lawful ethical security research purposes only. Accessing or interacting with certain hidden services on the dark web may be heavily regulated or illegal depending on your jurisdiction. The author and contributors hold no liability for misuse of this tool or the intelligence gathered with it.

We strongly advise enforcing robust OPSEC hygiene, utilizing dedicated sandbox environments alongside proper proxy configurations. Use responsibly and at your absolute own risk.


💾 Installation

DeepRecon packages an auto-installation script that provisions required system binaries, handles Tor network installation, and globally registers the deeprecon terminal binary to your path.

Docker [Recommended]

Avoid installing local binaries by pulling up the pre-configured DeepRecon Docker instance:

docker-compose up --build -d

All databases and reports will persist natively to your local ./storage and ./reports directories.

Linux / Debian / Kali

Run the installation script to configure Python virtual environments, pip, and Tor:

git clone https://github.com/taezeem14/DeepRecon.git
cd DeepRecon
chmod +x install.sh
./install.sh

Windows

Run the batch configuration:

git clone https://github.com/taezeem14/DeepRecon.git
cd DeepRecon
install.bat

Tip

Windows Users: You will need to manually download and let the Tor Expert Bundle run in the background on port 9050 before performing reconnaissance.


🎮 Usage

DeepRecon supports both interactive CLI scraping and a point-and-click browser interface.

🌐 Web Interface Mode

Launch the local web dashboard for intuitive intelligence gathering:

deeprecon --web --port 8000

Navigate to http://localhost:8000 to start your asynchronous intelligence pipelines.

💻 Elite CLI Mode

Stay native in the terminal. Launch the interactive framework:

deeprecon --cli

Follow the interactive rich-loops for executing deep-crawls, exporting PDFs, and searching target domains against Regex statements.


🤝 Contributing

Contributions are fundamentally vital to open-source OSINT development! Please feel free to submit Pull Requests to improve the engine.

  1. Fork the Project
  2. Create your Feature Branch (git checkout -b feature/NewPlugin)
  3. Commit your Changes (git commit -m 'Add some NewPlugin')
  4. Push to the Branch (git push origin feature/NewPlugin)
  5. Open a Pull Request

� Acknowledgements

Special shoutout and credit to Apurv Singh Gautam and his project Robin. DeepRecon's AI integration layer, Dark Web Meta-Search structure, and beautiful README aesthetics were heavily inspired by Robin. Highly recommend checking out his implementations of OSINT tooling!


�📝 License

Distributed under the MIT License. See LICENSE for more information.

About

A Tool For Dark Web Recon

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages