de-densification-2

Disclaimer: Courses are subject to change and are classified in different ways. The scraper is a student-created tool. The Spectator’s Data Visualization team did their best to filter classes faithfully based on information provided by the University. Because of these potential errors, these results might not be replicable. Find our data, code, and methodology here.

The data:

Per University communications about departments subject to de-densification, we filtered classes to only include classes offered by the Faculty of Arts and Sciences and the School of Engineering and Applied Science. Classes without a start or stop time listed were excluded. Classes with variations and abbreviations on the words “laboratory,” “recitation,” and “discussion” in their names were excluded, in accordance with how the University measured departments meeting the 40 percent de-densification cap. All course data was scraped from the Columbia directory of classes on April 22nd. Generative artificial intelligence tools were utilized in the creation of the web-scraping algorithm.

'desenificationgfx.ipynb' cleans raw course data and finds all the classes in session during 30-minute increments for every day of the week.

'desenificationgfx (1).ipynb' cleans raw course data and sorts classes into time buckets based on class times.

'desenificationgfx (1).ipynb' cleans raw course data and finds the percent of classes in each department starting from 10 am (inclusive)-2 pm (exclusive).

The scraper:

'new_scraper.py' scrapes the CU Directory of Classes and Vergil to retrieve course data. It scrapes the CU Directory of Classes for the department code. Our scraper manually adds the department codes for departments in the Faculty of Arts and Sciences and the School of Engineering and Applied Science. The scraper can be run by editing the year and semester when running 'scrape_courses()'. This follows the key ' (spring - 1, summer - 2, fall - 3)'

Name		Name	Last commit message	Last commit date
Latest commit History 48 Commits
output		output
time_blocks		time_blocks
.DS_Store		.DS_Store
FILTERED_fall_26.xls		FILTERED_fall_26.xls
README.md		README.md
desenificationgfx (1).ipynb		desenificationgfx (1).ipynb
desenificationgfx (2).ipynb		desenificationgfx (2).ipynb
desenificationgfx (3).ipynb		desenificationgfx (3).ipynb
desenificationgfx.ipynb		desenificationgfx.ipynb
df_filtered.xls		df_filtered.xls
df_no_start (1).xls		df_no_start (1).xls
df_no_start (2).xls		df_no_start (2).xls
df_no_start (3).xls		df_no_start (3).xls
df_no_start.xls		df_no_start.xls
fall 25 new (2).csv		fall 25 new (2).csv
fall 26 new (2).csv		fall 26 new (2).csv
new_scraper.py		new_scraper.py
spring 26 new (2).csv		spring 26 new (2).csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

de-densification-2

The data:

The scraper:

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

de-densification-2

The data:

The scraper:

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages