Skip to content

Conversation

@dimapihtar
Copy link
Contributor

No description provided.

Signed-off-by: dimapihtar <[email protected]>
Signed-off-by: dimapihtar <[email protected]>
Signed-off-by: dimapihtar <[email protected]>
Signed-off-by: dimapihtar <[email protected]>
Signed-off-by: dimapihtar <[email protected]>
Signed-off-by: dimapihtar <[email protected]>
Signed-off-by: dimapihtar <[email protected]>
Signed-off-by: dimapihtar <[email protected]>
Signed-off-by: dimapihtar <[email protected]>
Signed-off-by: dimapihtar <[email protected]>
Signed-off-by: dimapihtar <[email protected]>
Signed-off-by: dimapihtar <[email protected]>
Signed-off-by: dimapihtar <[email protected]>
Signed-off-by: dimapihtar <[email protected]>
Signed-off-by: dimapihtar <[email protected]>
Signed-off-by: dimapihtar <[email protected]>
Signed-off-by: dimapihtar <[email protected]>
Signed-off-by: dimapihtar <[email protected]>
Signed-off-by: dimapihtar <[email protected]>
Signed-off-by: dimapihtar <[email protected]>
Signed-off-by: dimapihtar <[email protected]>
@copy-pr-bot
Copy link

copy-pr-bot bot commented Sep 26, 2025

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

@dimapihtar
Copy link
Contributor Author

/ok to test 28e63cf

@dimapihtar
Copy link
Contributor Author

/ok to test 8b8944c

Copy link
Contributor

@maanug-nv maanug-nv left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Notebook is great, easy to follow. Thanks!

@maanug-nv
Copy link
Contributor

/ok to test fcb72d8

@copy-pr-bot
Copy link

copy-pr-bot bot commented Oct 3, 2025

/ok to test fcb72d8

@maanug-nv, there was an error processing your request: E2

See the following link for more information: https://docs.gha-runners.nvidia.com/cpr/e/2/

@maanug-nv
Copy link
Contributor

/ok to test 59e4c17

@dimapihtar
Copy link
Contributor Author

/ok to test 59e4c17

@dimapihtar dimapihtar merged commit 45ed38a into main Oct 6, 2025
43 of 46 checks passed
@dimapihtar dimapihtar deleted the dpykhtar/data_preproc_scripts branch October 6, 2025 08:30
paul-gibbons pushed a commit to paul-gibbons/Megatron-Bridge that referenced this pull request Oct 29, 2025
* add data prep scripts

Signed-off-by: dimapihtar <[email protected]>

* fix style

Signed-off-by: dimapihtar <[email protected]>

* add readme

Signed-off-by: dimapihtar <[email protected]>

* fix style

Signed-off-by: dimapihtar <[email protected]>

* fix readme

Signed-off-by: dimapihtar <[email protected]>

* fix style

Signed-off-by: dimapihtar <[email protected]>

* add decompress section

Signed-off-by: dimapihtar <[email protected]>

* add merging stage

Signed-off-by: dimapihtar <[email protected]>

* fix style

Signed-off-by: dimapihtar <[email protected]>

* minor fix

Signed-off-by: dimapihtar <[email protected]>

* code changes

Signed-off-by: dimapihtar <[email protected]>

* add data shuffling

Signed-off-by: dimapihtar <[email protected]>

* fix style

Signed-off-by: dimapihtar <[email protected]>

* add data shuffle section

Signed-off-by: dimapihtar <[email protected]>

* fix dataprep section

Signed-off-by: dimapihtar <[email protected]>

* fix merge section

Signed-off-by: dimapihtar <[email protected]>

* fix merge section

Signed-off-by: dimapihtar <[email protected]>

* fix data shuffle section

Signed-off-by: dimapihtar <[email protected]>

* fix data prep section

Signed-off-by: dimapihtar <[email protected]>

* fix bash script

Signed-off-by: dimapihtar <[email protected]>

* fix data prep section

Signed-off-by: dimapihtar <[email protected]>

* fix style

Signed-off-by: dimapihtar <[email protected]>

* add docstrings

Signed-off-by: dimapihtar <[email protected]>

* fix style

Signed-off-by: dimapihtar <[email protected]>

* fix style

Signed-off-by: dimapihtar <[email protected]>

* add notebook

Signed-off-by: dimapihtar <[email protected]>

* clear cell outputs

Signed-off-by: dimapihtar <[email protected]>

* add stages

Signed-off-by: dimapihtar <[email protected]>

* add dataprep stage

Signed-off-by: dimapihtar <[email protected]>

* add description

Signed-off-by: dimapihtar <[email protected]>

* fix docstring

Signed-off-by: dimapihtar <[email protected]>

---------

Signed-off-by: dimapihtar <[email protected]>
Co-authored-by: Maanu Grover <[email protected]>
Signed-off-by: Paul Gibbons <[email protected]>
nv-mollys pushed a commit that referenced this pull request Oct 31, 2025
* add data prep scripts

Signed-off-by: dimapihtar <[email protected]>

* fix style

Signed-off-by: dimapihtar <[email protected]>

* add readme

Signed-off-by: dimapihtar <[email protected]>

* fix style

Signed-off-by: dimapihtar <[email protected]>

* fix readme

Signed-off-by: dimapihtar <[email protected]>

* fix style

Signed-off-by: dimapihtar <[email protected]>

* add decompress section

Signed-off-by: dimapihtar <[email protected]>

* add merging stage

Signed-off-by: dimapihtar <[email protected]>

* fix style

Signed-off-by: dimapihtar <[email protected]>

* minor fix

Signed-off-by: dimapihtar <[email protected]>

* code changes

Signed-off-by: dimapihtar <[email protected]>

* add data shuffling

Signed-off-by: dimapihtar <[email protected]>

* fix style

Signed-off-by: dimapihtar <[email protected]>

* add data shuffle section

Signed-off-by: dimapihtar <[email protected]>

* fix dataprep section

Signed-off-by: dimapihtar <[email protected]>

* fix merge section

Signed-off-by: dimapihtar <[email protected]>

* fix merge section

Signed-off-by: dimapihtar <[email protected]>

* fix data shuffle section

Signed-off-by: dimapihtar <[email protected]>

* fix data prep section

Signed-off-by: dimapihtar <[email protected]>

* fix bash script

Signed-off-by: dimapihtar <[email protected]>

* fix data prep section

Signed-off-by: dimapihtar <[email protected]>

* fix style

Signed-off-by: dimapihtar <[email protected]>

* add docstrings

Signed-off-by: dimapihtar <[email protected]>

* fix style

Signed-off-by: dimapihtar <[email protected]>

* fix style

Signed-off-by: dimapihtar <[email protected]>

* add notebook

Signed-off-by: dimapihtar <[email protected]>

* clear cell outputs

Signed-off-by: dimapihtar <[email protected]>

* add stages

Signed-off-by: dimapihtar <[email protected]>

* add dataprep stage

Signed-off-by: dimapihtar <[email protected]>

* add description

Signed-off-by: dimapihtar <[email protected]>

* fix docstring

Signed-off-by: dimapihtar <[email protected]>

---------

Signed-off-by: dimapihtar <[email protected]>
Co-authored-by: Maanu Grover <[email protected]>
Signed-off-by: mollys <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants