Prox-E: Fine-Grained 3D Shape Editing via Primitive-Based Abstractions

Etai Sella*¹, Hao Phung*², Nitay Amiel³, Or Litany³, Or Patashnik¹, Hadar Averbuch-Elor²

¹ Tel Aviv University ² Cornell University ³ Technion - Israel Institute of Technology

This is the official PyTorch implementation of Prox-E.

📄 Abstract

Text-based 2D image editing models have recently reached an impressive level of maturity, motivating a growing body of work that heavily depends on these models to drive 3D edits. While effective for appearance-based modifications, such 2D-centric 3D editing pipelines often struggle with fine-grained 3D editing, where localized structural changes must be applied while strictly preserving an object's overall identity. To address this limitation, we propose Prox-E, a training-free framework that enables fine-grained 3D control through an explicit, primitive-based geometric abstraction. Our framework first abstracts an input 3D shape into a compact set of geometric primitives. A pretrained vision-language model (VLM) then edits this abstraction to specify primitive-level changes. These structural edits are subsequently used to guide a 3D generative model, enabling fine-grained, localized modifications while preserving unchanged regions of the original shape. Through extensive experiments, we demonstrate that our method consistently balances identity preservation, shape quality, and instruction fidelity more effectively than various existing approaches, including 2D-based 3D editors and training-based methods.

🚀 Getting Started

Cloning the repository

Clone the repo and initialize submodules if your checkout stores prox_e/submodules as git submodules:

git clone https://github.com/etaisella/Prox-E.git
cd Prox-E
git submodule update --init --recursive

Environment Setup

Create the environment:

bash scripts/setup_environment.sh
conda activate prox-e

The setup script creates a Python 3.11 conda environment, installs PyTorch and the remaining Python dependencies, installs the two source-built rasterization packages, and downloads SuperDec checkpoints if they are missing.

TRELLIS

Download the TRELLIS-image-large model, and add the following entries to TRELLIS-image-large/pipeline.json:

{
    "sparse_structure_encoder": "ckpts/ss_enc_conv3d_16l8_fp16",
    "slat_encoder": "ckpts/slat_enc_swin8_B_64l8_fp16"
}

Download the TRELLIS-text-large model, and add the following entries to TRELLIS-text-large/pipeline.json:

{
    "sparse_structure_encoder": "path/to/TRELLIS-image-large/ckpts/ss_enc_conv3d_16l8_fp16",
    "sparse_structure_decoder": "path/to/TRELLIS-image-large/ckpts/ss_dec_conv3d_16l8_fp16",
    "slat_encoder": "path/to/TRELLIS-image-large/ckpts/slat_enc_swin8_B_64l8_fp16",
    "slat_decoder_gs": "path/to/TRELLIS-image-large/ckpts/slat_dec_gs_swin8_B_64l8gs32_fp16",
    "slat_decoder_rf": "path/to/TRELLIS-image-large/ckpts/slat_dec_rf_swin8_B_64l8r16_fp16",
    "slat_decoder_mesh": "path/to/TRELLIS-image-large/ckpts/slat_dec_mesh_swin8_B_64l8m256c_fp16",
}

Blender

The code expects Blender for utility renders. It uses BLENDER_PATH if set, otherwise blender on PATH. On a fresh machine, install Blender and make sure blender is on PATH; otherwise set:

export BLENDER_PATH=/path/to/blender

VLM Setup

By default, Prox-E uses Gemini as the VLM backbone in the proxy editing and prompt parsing stages. Set your Google API key before running the default pipeline:

export GOOGLE_API_KEY=<your-key>

Prox-E also supports GPT as the VLM backend. To use it, set your OpenAI API key and pass --vlm gpt:

export OPENAI_API_KEY=<your-key>
python inference.py ... --vlm gpt

Qwen is also supported for local prompt parsing and local VLM proxy editing. The VLM stage loads a local Qwen/Qwen3-VL-<size>-Instruct checkpoint, so no Qwen API key is required for --vlm qwen; choose the checkpoint size with --qwen_model_size if needed:

python inference.py ... --vlm qwen --qwen_model_size 4B

NOTE: In our testing the local Qwen models significantly underperformed in proxy editing compared to the high end GPT and Gemini models.

Additional requirements

The SuperDec checkpoints are expected under prox_e/submodules/superdec/checkpoints/normalized/. If they are missing, run:

cd prox_e/submodules/superdec
bash scripts/download_checkpoints.sh
cd ../../..

🎮 Running the Demos

We include an demo edit example for each datset used in our work:

ShapeNet:

python inference.py \
  --input_mesh demo/shapenet/chair/model_normalized.obj \
  --category chair \
  --edit_instruction "make the chair 1.5 times wider"

Edit3D-Bench:

python inference.py \
  --input_mesh demo/edit3dbench/elephant/model.glb \
  --category elephant \
  --edit_instruction "make the elephant wear a red hat" \
  --orientation_index 15

Toys4K:

python inference.py \
  --input_mesh demo/toys4k/sheep/model.glb \
  --category sheep \
  --edit_instruction "turn the sheep's head 30 degrees to the right"

Final results are saved in the outputs/ folder.

🛋️ Running Prox-E on custom shapes

For a custom mesh, set --input_mesh to the mesh file, --category to the object class, and --edit_instruction to the requested edit:

python inference.py \
  --input_mesh /path/to/model.glb \
  --category lamp \
  --edit_instruction "make the lamp shade wider"

If the mesh orientation is wrong, render all supported input orientations:

python scripts/orientation_sweep.py --input_mesh /path/to/model.glb

Open the generated orientation_sweep_overview.png, pick the best index, then rerun inference with it:

python inference.py \
  --input_mesh /path/to/model.glb \
  --category lamp \
  --edit_instruction "make the lamp shade wider" \
  --orientation_index 12

If you change the orientation for a mesh you already processed, use a fresh --output_folder so cached abstractions are not reused.

✏️ BibTeX

If you find our work useful in your research, please consider citing:

@misc{sella2026proxefinegrained3dshape,
 title={Prox-E: Fine-Grained 3D Shape Editing via Primitive-Based Abstractions},
 author={Etai Sella and Hao Phung and Nitay Amiel and Or Litany and Or Patashnik and Hadar Averbuch-Elor},
 year={2026},
 eprint={2604.23774},
 archivePrefix={arXiv},
 primaryClass={cs.GR},
 eprint={2604.23774},
 url={https://arxiv.org/abs/2604.23774},
}

🙏 Acknowledgements

This code builds upon the VoxHammer, SuperDec and TRELLIS repositories, we thank their creators for their great work.

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
demo		demo
prox_e		prox_e
scripts		scripts
teaser_vid		teaser_vid
webpage_assets		webpage_assets
.gitignore		.gitignore
README.md		README.md
environment.yml		environment.yml
index.html		index.html
inference.py		inference.py
requirements.txt		requirements.txt
style.css		style.css

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Prox-E: Fine-Grained 3D Shape Editing via Primitive-Based Abstractions

📄 Abstract

🚀 Getting Started

Cloning the repository

Environment Setup

TRELLIS

Blender

VLM Setup

Additional requirements

🎮 Running the Demos

🛋️ Running Prox-E on custom shapes

✏️ BibTeX

🙏 Acknowledgements

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Prox-E: Fine-Grained 3D Shape Editing via Primitive-Based Abstractions

📄 Abstract

🚀 Getting Started

Cloning the repository

Environment Setup

TRELLIS

Blender

VLM Setup

Additional requirements

🎮 Running the Demos

🛋️ Running Prox-E on custom shapes

✏️ BibTeX

🙏 Acknowledgements

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages