Initialization

CloudLab

Hardware Information: https://docs.cloudlab.us/hardware.html - CloudLab Clemson, nvidiagh Experiment Profile Link: https://www.cloudlab.us/p/gpu-coherence/grace-hopper

Instantiate a new experiment, using the profile listed above on one of the nvidiagh nodes available through CloudLab's Clemson Datacenter.

Start from Scratch

Follow these steps to initialize your CUDA Environment.

sudo DEBIAN_FRONTEND=noninteractive apt purge linux-image-$(uname -r) linux-headers-$(uname -r) linux-modules-$(uname -r) -y
sudo apt update
sudo apt install linux-nvidia-64k-hwe-22.04 -y
sudo reboot now

sudo apt install linux-headers-$(uname -r)
wget https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/sbsa/cuda-keyring_1.1-1_all.deb
sudo dpkg -i cuda-keyring*.deb
sudo apt update
sudo apt install cuda-toolkit-12-4 -y
sudo apt install nvidia-kernel-open-550 cuda-drivers-550 -y
sudo mkdir /lib/systemd/system/nvidia-persistenced.service.d
sudo dd status=none of=/lib/systemd/system/nvidia-persistenced.service.d/override.conf << EOF
[Service]
ExecStart=/usr/bin/nvidia-persistenced --persistence-mode --verbose
[Install]
WantedBy=multi-user.target 
EOF
sudo su
echo "kernel.numa_balancing = 0" >> /etc/sysctl.conf
sudo reboot now
sudo apt install numactl nvidia-cuda-toolkit build-essential -y

Clone this Repository

git clone https://github.com/icsa-caps/ismm26-artifacts.git
git submodule update --init --recursive

Running Grace Litmus Tests

The grace-litmus directory hosts the herdtools7 suite, out of which we require diyone7 and litmus7. Follow their included build instructions (README.md, INSTALL.md), and then do the following to run the specific Message-Passing Litmus Test

Generating Litmus Tests

diyone7 -arch AArch64 -name MP-rlx-rlx PodWW Rfe PodRR Fre
diyone7 -arch AArch64 -name MP-rlx-rel PodWW L Rfe PodRR Fre
diyone7 -arch AArch64 -name MP-rlx-rlx-acqrcsc PodWW Rfe A PodRR Fre
diyone7 -arch AArch64 -name MP-rlx-rlx-acqrcpc PodWW Rfe Q PodRR Fre
diyone7 -arch AArch64 -name MP-rlx-rel-acqrcsc PodWW L Rfe A PodRR Fre
diyone7 -arch AArch64 -name MP-rlx-rel-acqrcpc PodWW L Rfe Q PodRR Fre

mkdir -p mp-gen
cat <<'EOF' | diyone7 -arch AArch64 -o mp-gen
MP-rlx-fence-rlx: DMB.SYdWW Rfe PodRR Fre
MP-rlx-fence-rlx-fence: DMB.SYdWW Rfe DMB.SYdRR Fre
MP-rlx-rlx-rlx-fence: PodWW Rfe DMB.SYdRR Fre
EOF

For each generated .litmus file, make 3 copies and set:

Uncached:
Prefetch=0:x=F,0:y=F,1:x=F,1:y=F
Cached:
Prefetch=0:x=T,0:y=T,1:x=T,1:y=T
Modified:
Prefetch=0:x=W,0:y=W,1:x=W,1:y=W

Running Litmus Tests

litmus7 -set-libdir "$PWD/litmus/libdir" -o out/MP-rlx-rel-acqrcpc-uncached mp-tests/MP-rlx-rel-acqrcpc-uncached.litmus
make -C out/MP-rlx-rel-acqrcpc-uncached -j"$(nproc)"
sh out/MP-rlx-rel-acqrcpc-uncached/run.sh

Running Hopper Litmus Tests

cd hopper-litmus
./tune-specific.sh tune-mp.txt > output.log 
python3 parse_stats.py output.log
# produces output_parsed.csv

tune-specific.sh runs all the MP variants one after the other in an infinite loop. The litmus testing must be terminated manually using Ctrl+C. Even without terminating the litmus testing, you can run the parse_stats.py to parse whatever data has been collected thus far.

Observing weak behaviours is probabilistic. If you don't observe weak behaviours in the test variants where it is expected, run the testing suite for longer. For some variants, it took several days for the weak behaviours to reveal themselves.

Running Grace-Hopper Litmus Tests

cd gh-litmus
make -C stress_malloc
make -C stress_malloc/reverse
python3 run-all.py

Similar to the Hopper Litmus Tests, the Grace-Hopper litmus tests also run in an infinite loop and need to be terminated manually using Ctrl+C.

Running Read-Modify-Write Tests

cd rmw-tests
make
python3 pingpong_loop.py

Running Value Propagation Tests

cd vp-tests
make all
./bin/cache_invalidation_testing_DATA_SIZE_32.out -P hopper_vp_invalidation_check -F configs/paper_hopper_write_through.yaml -m cuda_malloc
./bin/cache_invalidation_testing_DATA_SIZE_32.out -P hopper_vp_self_invalidation -F configs/paper_hopper_self_invalidation.yaml -m cuda_malloc
./bin/cache_invalidation_testing_DATA_SIZE_32.out -P grace_hopper_cpu_to_gpu -F configs/paper_gh_cpu_to_gpu.yaml -m malloc
./bin/cache_invalidation_testing_DATA_SIZE_32.out -P grace_hopper_gpu_to_cpu -F configs/paper_gh_gpu_to_cpu.yaml -m malloc

The heterogenous tests will only work on NVIDIA Grace-Hopper. On other NVIDIA architectures, you can vary the -m <um|dram> for heterogeneous tests.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
gh-litmus @ ec9d2b7		gh-litmus @ ec9d2b7
grace-litmus @ 867f289		grace-litmus @ 867f289
hopper-litmus @ 2ba808d		hopper-litmus @ 2ba808d
rmw-tests @ ebb00ac		rmw-tests @ ebb00ac
vp-tests @ 82cd5c6		vp-tests @ 82cd5c6
.gitmodules		.gitmodules
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Initialization

CloudLab

Start from Scratch

Clone this Repository

Running Grace Litmus Tests

Generating Litmus Tests

Running Litmus Tests

Running Hopper Litmus Tests

Running Grace-Hopper Litmus Tests

Running Read-Modify-Write Tests

Running Value Propagation Tests

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Initialization

CloudLab

Start from Scratch

Clone this Repository

Running Grace Litmus Tests

Generating Litmus Tests

Running Litmus Tests

Running Hopper Litmus Tests

Running Grace-Hopper Litmus Tests

Running Read-Modify-Write Tests

Running Value Propagation Tests

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Packages