Add post_dataloading_processing method to Trainer #1985

fegin · 2025-11-04T08:03:30Z

Stack from ghstack (oldest at bottom):

We are adding more actions to convert the raw inputs and label.

The new CP can do the input/label/BlockMask sharding this in this method.
The experimental full dtensor model can simply override this method without changing too many Trainer code.

This method is extracted from #1857

Makeing this a standalone PR allows us to continue the two projects above without one blocks another.

[ghstack-poisoned]

We are adding more actions to convert the raw inputs and label. 1. The new CP can do the input/label/BlockMask sharding this in this method. 2. The experimental full dtensor model can simply override this method without changing too many Trainer code. This method is extracted from #1857 Makeing this a standalone PR allows us to continue the two projects above without one blocks another. ghstack-source-id: d1882a7 Pull-Request: #1985

tianyu-l · 2025-11-05T00:16:31Z

torchtitan/train.py

                extra_inputs=extra_inputs,
            )

+        return inputs, label, extra_inputs, extra_kwargs


I think we should add docstring for returns, especially on the difference between extra_inputs and extra_kwargs.

Also not sure if we should just merge inputs and extra_inputs. Not urgent though.

tianyu-l · 2025-11-05T00:30:02Z

torchtitan/train.py

-        model_parts = self.model_parts
-        parallel_dims = self.parallel_dims
-
+    def post_dataloading_processing(


This name is accurate in where it should be called, but we are putting it not right after dataloading. Rather we are putting it before training, which makes sense because when other library depends on torchtitan training but not torchtitan data loading, this is the right place to put it.

I just wonder if we could have another name that can express it's happening right before (but mostly as part of) the training, e.g. a bad and verbose version would be pre-actual-training-last-minute-data-preparation.

How about pre_training_data_processing or pre_training_data_preparation or if the "last" is really an important message, then final_data_preparation?

I was trying to avoid the term "pre-training" which could cause confusion.

I think we can go with post_dataloading_process, seems no ambiguity.

Suggested change

def post_dataloading_processing(

def post_dataloading_process(

tianyu-l · 2025-11-05T21:26:21Z

torchtitan/train.py

-        model_parts = self.model_parts
-        parallel_dims = self.parallel_dims
-
+    def post_dataloading_processing(


I was trying to avoid the term "pre-training" which could cause confusion.

I think we can go with post_dataloading_process, seems no ambiguity.

Suggested change

def post_dataloading_processing(

def post_dataloading_process(

Update

f6a66d9

[ghstack-poisoned]

fegin requested review from tianyu-l, wconstab and wwwjn as code owners November 4, 2025 08:03

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Nov 4, 2025

tianyu-l reviewed Nov 5, 2025

View reviewed changes

tianyu-l approved these changes Nov 5, 2025

View reviewed changes

fegin mentioned this pull request Nov 5, 2025

Deduplicate TorchTitan main function #1995

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add post_dataloading_processing method to Trainer #1985

Add post_dataloading_processing method to Trainer #1985

fegin commented Nov 4, 2025 •

edited

Loading

Uh oh!

tianyu-l Nov 5, 2025

Uh oh!

tianyu-l Nov 5, 2025

Uh oh!

fegin Nov 5, 2025

Uh oh!

tianyu-l Nov 5, 2025

Uh oh!

tianyu-l Nov 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	def post_dataloading_processing(
	def post_dataloading_process(

Add post_dataloading_processing method to Trainer #1985

Are you sure you want to change the base?

Add post_dataloading_processing method to Trainer #1985

Conversation

fegin commented Nov 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tianyu-l Nov 5, 2025

Choose a reason for hiding this comment

Uh oh!

tianyu-l Nov 5, 2025

Choose a reason for hiding this comment

Uh oh!

fegin Nov 5, 2025

Choose a reason for hiding this comment

Uh oh!

tianyu-l Nov 5, 2025

Choose a reason for hiding this comment

Uh oh!

tianyu-l Nov 5, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

fegin commented Nov 4, 2025 •

edited

Loading