Skip to content

ashuhlee/data-analytics-labs

Repository files navigation

Data Analysis & Visualization Coursework

A portfolio of labs and assignments completed for a course at Western University. The work covers a progression from core Python programming to data analysis, visualization, network analysis, geospatial mapping, and natural language processing.

πŸ“‚ Assignments

Spotify Dataset Analysis

Exploratory data analysis of a Spotify tracks dataset using pandas. Covers data cleaning and transformation, descriptive statistics, boolean filtering, and data visualization with matplotlib.

Visualizations:

Betweeness Chart Relation Chart

eBook Dataset Analysis

String parsing and data manipulation in Python using a Project Gutenberg eBook dataset. Covers string slicing and splitting, custom sorting functions, dictionary aggregation, and list manipulation.

Visualizations:

Yearly Downloads Bar Chart Yearly Downloads Scatter Plot

πŸ”¬ Labs

Network Analysis

Introduction to network analysis using NetworkX and pandas. Covers graph theory concepts, building graphs from CSV data, computing weighted degree and betweenness centrality, and visualizing networks with node sizing scaled to centrality values.

Visualizations:

Betweeness Chart Relation Chart

Movies Dataset Analysis I

Pandas lab using a Hollywood movies dataset (2,000 films). Covers loading and inspecting data, dropping and renaming columns, creating derived columns, sorting, and slicing DataFrames.

Movies Dataset Analysis II

Structured data analysis and visualization using an extended movies dataset. Covers handling missing values, boolean filtering, label-based and positional slicing, value counts, and pie chart visualization with matplotlib.

Sentiment Analysis

text-analysis.ipynb

Sentiment analysis using spaCy on the 2017 Beauty and the Beast screenplay. Covers polarity and subjectivity scoring, interpreting sentiment results, analyzing a full screenplay sentence-by-sentence, and visualizing sentiment trends across the plot.

Geocoding & Interactive Maps

geocoding.ipynb

Geocoding and interactive map visualization using geopy and folium. Covers coordinate lookup, location attributes, multi-result geocoding, and adding custom markers to an interactive map centered on London, Ontario.

πŸ› οΈ Tools & Libraries

Data Science

Library Purpose
pandas Data manipulation and analysis
NumPy Multi-dimensional arrays and matrix support

Data Visualization

Library Purpose
matplotlib Static data visualization
folium Interactive map visualization

Network Analysis

Library Purpose
NetworkX Relationships and communication patterns within networks

Natural Language Processing

Library Purpose
spaCy Advanced NLP pipeline and text processing
SpaCyTextBlob Sentiment analysis via polarity and subjectivity scoring

About

Coursework focused on analyzing data through visualization, text analysis, networks and maps πŸ“Š

Topics

Resources

Stars

Watchers

Forks

Contributors