Skip to content

semajyllek/ConnectedComponents

Repository files navigation

Connected Components

This repository contains a number of scripts in python, scala to run the Connected Components algorithm proposed in Kiveras et al., (2015). All of which are designed to be run in a cluster with access to Hadoop in the case of the python script (for Hadoop streaming), or Spark (for scala).

kelly_graph_proj.pdf describes the results of my experiements.

About

a project experimenting with different implementations of algorithms for finding connected components of graphs at scale, including scala spark, hadoop, and pure python implementations (see pdf for paper)

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors