This repository contains the source code and documentation for Project 1 as part of our Semester 4 Team Project. The goal of this project is to build core data mining pipeline components including web scraping, API data collection, data cleaning with Pandas, and data visualization.
- Berkay Paray: Infrastructure, Task 1 (Job Listings Web Scraper & Skills Analysis), Final report
- Güner Özben: Task 2 & 3 (News Headlines Analyzer & Product Price Tracker)
- Aras Taşdemir: Task 4 and 5 (Weather Data Analytics & Public Health Dashboard)
We use a GitHub Projects Kanban Board to track the progress of our tasks and manage workflow transitions.
- Python 3.x
- Pandas
- Beautiful Soup / Scrapy (for Web Scraping)
- Matplotlib / Seaborn (for Visualization)