15418-final-project

Parallel Union-Find Implementations in C++/OpenMP

This project implements and compares several sequential and parallel versions of the Union-Find (Disjoint Set Union) data structure using C++20 and OpenMP. The goal is to analyze the performance and scalability trade-offs of different parallelization strategies, including coarse-grained locking, fine-grained locking, and lock-free techniques.

Features

Sequential Baseline: An optimized serial Union-Find implementation (UnionFind).
Coarse-Grained Locking: Parallel execution protected by a single global mutex (UnionFindParallelCoarse).
Fine-Grained Locking: Parallel execution using per-element locks (primarily for roots) during union operations, with best-effort path compression (UnionFindParallelFine).
Lock-Free (Baseline): Lock-free implementation using std::atomic<int> encoding parent/rank and Compare-and-Swap (CAS) based path compression (UnionFindParallelLockFree).
Lock-Free Optimizations:
- Path compaction using plain atomic writes (UnionFindParallelLockFreePlainWrite).
- Immediate Parent Check (IPC) heuristic (UnionFindParallelLockFreeIPC).
Dataset Generator: Python script to generate workloads with varying parameters (size, operation mix, contention).
Correctness Test: Verifies parallel implementations against the serial baseline based on final connectivity.
Benchmark Suite: Measures performance (wall-clock time) of different implementations under various workloads and thread counts.

Requirements

Compiler: A C++ compiler supporting C++20 and OpenMP (e.g., g++ version 10 or later).
Build System: make.
Dataset Generation: Python 3.
Operating System: Developed and tested on Linux. Lock-free implementations rely on efficient native atomic support.

Building the Code

The project uses a Makefile for building the library and executables.

Enabling Parallel Implementations:

Before building, you can control which parallel implementations are compiled by setting environment variables or modifying the Makefile. The key variables are:

COARSE: Set to 1 to enable the Coarse-Grained implementation.
FINE: Set to 1 to enable the Fine-Grained implementation.
LOCKFREE: Set to 1 to enable the baseline Lock-Free implementation.
LOCKFREE_PLAIN: Set to 1 to enable the Lock-Free (Plain Write) implementation.
LOCKFREE_IPC: Set to 1 to enable the Lock-Free (IPC) implementation.

Example: To enable and build all implementations:

export COARSE=1 FINE=1 LOCKFREE=1 LOCKFREE_PLAIN=1 LOCKFREE_IPC=1
make

Build Commands:

Simply use make to build the project and make clean to delete all compiled files

Generating Datasets

Use the generate_operations.py script to create input files for testing and benchmarking.

python generate_operations.py <n_elements> <n_operations> <output_file> [options]

Key arguments:

n_elements: Number of disjoint elements (e.g., 1000000).
n_operations: Total number of operations (e.g., 10000000).
output_file: Path to save the generated file (e.g., tests/resources/ops_1M_10M_c0.5.txt).

Options:

--find-ratio : Target ratio of FIND operations (default: 0.5).
--sameset-ratio : Target ratio of SAMESET among non-FIND ops (default: 0.1).
--contention-level : Focus level for hot element (0.0=uniform, 1.0=high focus, default: 0.0).
--hot-element : Index of the element for focused contention (default: 0).
--extreme-contention: Flag to force all operations onto elements 0 and 1.
--seed : Optional random seed for reproducibility.

Running Correctness Tests:

Verify parallel implementations against the serial baseline:

./test_parallel_correctness <operations_file>

Example:

./test_parallel_correctness tests/resources/uniform.txt

The test will output PASS or FAIL for each enabled parallel implementation.

Running Benchmarks:**

Measure execution time for different implementations:

./benchmark <implementation_type> <operations_file> <num_runs> [num_threads]

<implementation_type>: serial, coarse, fine, lockfree, lockfree_plain, or lockfree_ipc.
<operations_file>: Path to the dataset file.
<num_runs>: Number of benchmark repetitions.
[num_threads]: (Optional) Number of OpenMP threads. Defaults to maximum available.

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
benchmarks		benchmarks
include		include
scripts		scripts
src		src
tests		tests
.gitattributes		.gitattributes
.gitignore		.gitignore
Makefile		Makefile
README.md		README.md
benchmark		benchmark
libunionfind.a		libunionfind.a
perf.data		perf.data
test_parallel_correctness		test_parallel_correctness
test_serial_correctness		test_serial_correctness

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

15418-final-project

Parallel Union-Find Implementations in C++/OpenMP

Features

Requirements

Building the Code

Generating Datasets

Running Correctness Tests:

Running Benchmarks:**

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

15418-final-project

Parallel Union-Find Implementations in C++/OpenMP

Features

Requirements

Building the Code

Generating Datasets

Running Correctness Tests:

Running Benchmarks:**

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages