OSF2

Data Mining Class Exercise 2 for Olga, Simon and Fabian

Folder Structure

scripts includes all R scripts needed to reproduce this project
output contains the outputs generated by the R scripts, including the knitted HTML report

To Do List

Set up API: Simon
Create corpus of Guardian articles on the company Amazon: Fabian
- NER Classifier not 100% accurate: Includes mentions of the rainforest (see Word Cloud)
Sentiment analysis of corpus and 2-3 sentences on the analysis: Simon
Word cloud and / or word frequencies of corpus and 2-3 sentences on the analysis: Olga
Topic modelling of corpus and 2-3 sentences on the analysis: Fabian
Create final report: Fabian

Andrea's Feedback from CE1

You worked reproducibly using advanced features of GitHub (e.g., the todo list!). The substantive part (the idea, the research question...) is usually not considered in this seminar, but in your case is really well-developed and so it boosted the grade a but. It could lead to a reseach paper. If you want to work at it together I am willing to supervise. Excellent! You missed the 6.0 grade because you did not use issues and had only one PR, ideally each one of you would have made one. Also, you could do a few more commits to practice (it's below average).

Name		Name	Last commit message	Last commit date
Latest commit History 60 Commits
output		output
scripts		scripts
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OSF2

Folder Structure

To Do List

Andrea's Feedback from CE1

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

OSF2

Folder Structure

To Do List

Andrea's Feedback from CE1

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages