LArMAE

Experiments in using Masked Autoencoders for pre-traininged a Vision Transformer on EXTBNB data.

We will use this as a baseline for several experiments:

Run3 G1 EXTBNB sample has 34K files with about 15 events each. If we aim for a crop size of 512x512, we will have about 2*4 images from each event.

This leads us to roughly an effective image sample size of 4 million images for each plane, a bit more for the Y-plane.

We use larbys/larcv Version 1 for handling microboone data.

Steps

The repository lucidarains/vit-pytorch has ViT and a MAE wrapper of some sort. Easy peasy!

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
dataprep		dataprep
vit-pytorch @ b932e01		vit-pytorch @ b932e01
.gitignore		.gitignore
.gitmodules		.gitmodules
README.md		README.md
calc_accuracies.py		calc_accuracies.py
config_train.yaml		config_train.yaml
dump_parameters.py		dump_parameters.py
extract_weights_from_checkpoints.py		extract_weights_from_checkpoints.py
larmae_dataset.py		larmae_dataset.py
larmae_mp_dataloader.py		larmae_mp_dataloader.py
lr_cosine_annealing.py		lr_cosine_annealing.py
mem_utils.py		mem_utils.py
model.py		model.py
plot_cosine_learningrate_schedule.ipynb		plot_cosine_learningrate_schedule.ipynb
plot_loss.py		plot_loss.py
relarmae_dataset.py		relarmae_dataset.py
relarmae_mp_dataloader.py		relarmae_mp_dataloader.py
run_larmae_training_cannon.sh		run_larmae_training_cannon.sh
setenv.sh		setenv.sh
submit_cannon.sh		submit_cannon.sh
sum_mem.py		sum_mem.py
test_loader.yaml		test_loader.yaml
train.py		train.py
utils.py		utils.py
view_larmae_dataset.ipynb		view_larmae_dataset.ipynb
vis_larmae_completion.ipynb		vis_larmae_completion.ipynb