Final project for CS 5010: Python for Data Analysis Bev Dobrenz (bgd5de), Joseph Wysocki (jw6mw), Amanda West (acw9gs), Nikki Aaron (na5zn)
- Paper: 1. Write-Up
- Powerpoint: 2. Powerpoint Slides
Original dataset: winemag-data-130k-v2.csv Original grape Variety list (Wikipedia): grape_list.csv
- Parser script: grape_list_parser.py
- Output: grape_list_parsed.csv
- Unit test: grape_list_parser_test.py
- Matching script: grape_id.py
- Unit test: grape_id_test.py
- Output: wine.csv
- NLP Practice: Text_Mining_Suite.ipynb
- Definitions: 3. NLP Definitions.pdf
- Test with One Paragraph Box: Desc_Test_1_Box.ipynb
- Original Categorizer Script: Add_Categories.ipynb
- Chunked Processing Categorizer Script: Add_Categories.py
- Output: wine2.csv
- Application Script: filters.py
- Sample Output: recommendations.txt
