This repository provides code for 3D point cloud processing using OpenPCDet.
It supports flexible backbones like PillarBackBone8x and PointNet2Backbone, with support for frozen pretrained encoders for multimodal fusion.
- Image Encoder: Frozen DINOv2 ViT-B
- Point Cloud Encoder: Lightweight pretrained model (Download here)
# Create and activate a Conda environment
conda create -n 3ddet python=3.8 -y
conda activate 3ddet
# Install PyTorch 1.11.0 with CUDA 11.5
pip install torch==1.11.0 torchvision==0.12.0 torchaudio==0.11.0 --extra-index-url https://download.pytorch.org/whl/cu115
# Install spconv (for sparse convolution support)
pip install spconv-cu113
# Install additional dependencies
pip install -r requirements.txt
# Install OpenPCDet
Follow the official instructions here:
# https://github.com/open-mmlab/OpenPCDet/blob/master/docs/INSTALL.mdmkdir -p checkpoint pip install gdown gdown https://drive.google.com/uc?id=1wMxWTpU1qUoY3DsCH31WJmvJxcjFXKlm -O checkpoint/pointpillar_gdrive.pth
- Run the following notebook to create training and validation splits:
../data_preprocessing/create_train_val_split.ipynb
./launcher/dist_launch.sh train.py /home/as2114/code/3DBB/config/pointpillar.yaml 1
./launcher/dist_launch.sh train.py /home/as2114/code/3DBB/config/pointcoloring.yaml 1
python inference.py --cfg /home/as2114/code/3DBB/config/pointpillar.yaml --resume checkpoint.pth