Hi,
I’m having trouble achieving the same high-resolution results as shown in the README examples. I followed the tutorial steps provided in the README closely and recorded my video using an iPhone 13 Mini. Below are the detailed steps I took:
Steps Followed:
- Uploaded an 8-second video of my hand (light conditions were good).
- Split the video into frames.
- Created
frame_list_test.txt and frame_list_train.txt based on the frames.
- Ran RobustVideoMatting to generate the mask data.
- Ran Mediapipe to extract the keypoint data.
- Ran Depth-Anything-V2 to generate the depth maps.
- Ran InterWild to generate the
mano_params data.
- Created the
cam_param.json file using the following script:
import math
import json
# Frame dimensions
img_width = 720
img_height = 1080
# Principal point (center of the image)
princpt_x = img_width / 2
princpt_y = img_height / 2
# Horizontal Field of View (HFOV) in degrees
HFOV_deg = 73
HFOV_rad = math.radians(HFOV_deg)
# Compute focal length in pixels
focal_length_px = (img_width / 2) / math.tan(HFOV_rad / 2)
# Create camera parameters dictionary
cam_param = {
"focal": [focal_length_px, focal_length_px], # Assuming fx = fy
"princpt": [princpt_x, princpt_y]
}
# Save to cam_param.json
cam_param_path = '/content/UHM/data/Custom/data/YourSubjectName/0/cam_param.json'
with open(cam_param_path, 'w') as f:
json.dump(cam_param, f, indent=4)
- Placed and arranged all the models as per the README instructions.
- Ran the training script. After training completed, I received 10 snapshot files (
snapshot_10.pth through snapshot_99.pth).
- Tested with snapshot 99 using the following command:
%cd /content/UHM/Avatar/main/
!python test.py --subject_id TomerSubject1 --test_epoch 99
- Ran the test script again with the texture flag:
%cd /content/UHM/Avatar/main/
!python test.py --subject_id TomerSubject1 --test_epoch 99 --use_tex
- Finally, ran the test script with the
fit_pose_to_test flag:
%cd /content/UHM/Avatar/main/
!python test.py --subject_id TomerSubject1 --test_epoch 99 --use_tex --fit_pose_to_test
Issue:
Despite following all these steps, the resulting videos/meshes are not as accurate or detailed as I had hoped. I’ve attached all the data I used for reference. Could you please review it and help me figure out what I might be doing wrong? Any tips to achieve better results would be greatly appreciated!
my results + data:
https://drive.google.com/drive/folders/1CFGv7VChYxkTQOsNI_FevL7HMAB9cc6V?usp=sharing
Thanks in advance,
Brio
Hi,
I’m having trouble achieving the same high-resolution results as shown in the README examples. I followed the tutorial steps provided in the README closely and recorded my video using an iPhone 13 Mini. Below are the detailed steps I took:
Steps Followed:
frame_list_test.txtandframe_list_train.txtbased on the frames.mano_paramsdata.cam_param.jsonfile using the following script:snapshot_10.pththroughsnapshot_99.pth).%cd /content/UHM/Avatar/main/ !python test.py --subject_id TomerSubject1 --test_epoch 99%cd /content/UHM/Avatar/main/ !python test.py --subject_id TomerSubject1 --test_epoch 99 --use_texfit_pose_to_testflag:%cd /content/UHM/Avatar/main/ !python test.py --subject_id TomerSubject1 --test_epoch 99 --use_tex --fit_pose_to_testIssue:
Despite following all these steps, the resulting videos/meshes are not as accurate or detailed as I had hoped. I’ve attached all the data I used for reference. Could you please review it and help me figure out what I might be doing wrong? Any tips to achieve better results would be greatly appreciated!
my results + data:
https://drive.google.com/drive/folders/1CFGv7VChYxkTQOsNI_FevL7HMAB9cc6V?usp=sharing
Thanks in advance,
Brio