Finetuning2 by yaolug · Pull Request #3 · yaolug/openpi-pytorch

yaolug · 2025-08-22T19:29:18Z

No description provided.

kvablack · 2025-08-26T18:05:32Z

+	# Reuse existing dataset + transforms pipeline
+	data_conf = config.data.create(config.assets_dirs, config.model)
+	dataset = _data.create_torch_dataset(data_conf, config.model.action_horizon, config.model)
+	print(f"data_conf: {data_conf}")


nit: don't use print

kvablack · 2025-08-26T18:06:53Z

+
+	# Parse additional command line arguments for memory optimization
+	parser = argparse.ArgumentParser(add_help=False)
+	parser.add_argument("--resume", action="store_true", default=False,


the config already has resume/overwrite flags

kvablack · 2025-08-26T18:08:28Z

+	return result
+
+
+def _tree_map_multi(func, batch_list):


I honestly think it's easier to just use JAX here as well lol

jax.tree.map(lambda *xs: np.stack([np.asarray(x) for x in xs], axis=0), *batch_list)

kvablack · 2025-08-26T18:11:52Z

+	# Use full batch size since we removed gradient accumulation
+	effective_batch_size = config.batch_size // (torch.distributed.get_world_size() if use_ddp else 1)
+
+	loader = torch.utils.data.DataLoader(dataset, batch_size=effective_batch_size, shuffle=(sampler is None), sampler=sampler, num_workers=config.num_workers, pin_memory=True, drop_last=True, collate_fn=collate_to_numpy)


maybe dumb question, but why not use the existing openpi dataloader? we can make the JAX-specific things optional (namely, the jax.make_array_from_process_local_data), and add the necessary PyTorch specific things (e.g., custom sampler). other than that, the implementations look fairly similar, and I think it would make things easier to maintain going forward if they were shared.

kvablack · 2025-08-26T18:16:38Z

    )


+def preprocess_observation_pytorch(


why not put this in the models_pytorch directory somewhere?

kvablack · 2025-08-26T18:22:54Z

+        return False, None
+
+
+def compare_losses(pytorch_loss, jax_loss):


I realize this is all AI-generated but this is a crazy amount of unnecessary code... this whole function could be replaced with np.testing.assert_allclose. I'm fine with having this file but maybe not in the top-level scripts/ directory, would prefer if it was in examples/compare_jax_pytorch.py or something like that.

Will not release this file

yaolug added 8 commits August 22, 2025 00:57

support finetuning

44bc4b2

add preprocess

468e5f2

fixes

eb6ebb1

fix resume

e0d76fb

add gradient checkpointing

642bb4f

fix gradient checkpointing

3bd6bde

batch size 16 working

5789f0d

bs 1024 (2 nodes) working

bcce0ed

yaolug force-pushed the pi05-pytorch branch 2 times, most recently from f83909e to e5d936c Compare August 26, 2025 07:03

yaolug added 10 commits August 26, 2025 00:06

support finetuning

f554bc8

add preprocess

b9b7f6a

fixes

d2aba1c

fix resume

6fa3f8d

add gradient checkpointing

3c9084a

fix gradient checkpointing

e09ee98

batch size 16 working

615149e

bs 1024 (2 nodes) working

4fc7766

fix merge error

25d8af2

clean up pytorch finetuning

44dba74

kvablack reviewed Aug 26, 2025

View reviewed changes

yaolug added 9 commits August 27, 2025 08:25

try float32

b93f363

reuse jax dataloader

6775bdd

further simplify dataloader

5f3aaba

fix batch size

bde41be

fix batch size again

974c45c

further clean up

a644c95

add missing file

db8b5f7

add documentation, check transfoemres_replace

c51b061

change pi0 back

ed03aee

yaolug added 6 commits August 28, 2025 03:46

minor changes

b42d3ac

further cleanup

a30c8d6

further cleanup

babea0e

fix a bug

f4a5847

add check.py

b8597b6

fix float32

cde7856

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Finetuning2#3

Finetuning2#3
yaolug wants to merge 33 commits into
pi05-pytorchfrom
finetuning2

yaolug commented Aug 22, 2025

Uh oh!

kvablack Aug 26, 2025

Uh oh!

yaolug Aug 27, 2025

Uh oh!

kvablack Aug 26, 2025

Uh oh!

kvablack Aug 26, 2025

Uh oh!

yaolug Aug 27, 2025

Uh oh!

kvablack Aug 26, 2025

Uh oh!

yaolug Aug 27, 2025

Uh oh!

kvablack Aug 26, 2025

Uh oh!

yaolug Aug 27, 2025

Uh oh!

kvablack Aug 26, 2025

Uh oh!

yaolug Aug 27, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		return False, None


		def compare_losses(pytorch_loss, jax_loss):

Conversation

yaolug commented Aug 22, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants