-
Notifications
You must be signed in to change notification settings - Fork 601
[compiler toolkit] Add integration tests for compiler toolkit experiments #1944
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
7a8fdd2 to
18846fa
Compare
856f2b5 to
12abb14
Compare
96fe052 to
351a6eb
Compare
351a6eb to
57932ed
Compare
wwwjn
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thank you!
nit: please add a badge here as well: https://github.com/pytorch/torchtitan/tree/main/torchtitan/experiments#current-experiments
torchtitan/experiments/compiler_toolkit/tests/integration_tests.py
Outdated
Show resolved
Hide resolved
57932ed to
bb122fe
Compare
bb122fe to
95b32b7
Compare
| ], | ||
| "llama3 FSDP+TP+FlexAttn", | ||
| "llama3_fsdp_tp_flexattn", | ||
| ngpu=8, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Sorry I have one last comment - Can we use 4 GPUs with FSDP=2, TP=2, I feel like it's better to lower the GPU usage for CI which might help run it faster. Then we could change --ngpu 8 in yaml file to be --ngpu 4
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
sure!
6de1893 to
677394e
Compare
677394e to
f3c6733
Compare
…ents (pytorch#1944) Add integration tests for compiler toolkit experiments on llama3 and deepseek_v3 models to guard regressions taking over from pytorch#1953 Newly added Compiler Toolkit 8 GPU Integration Tests passed
Add integration tests for compiler toolkit experiments on llama3 and deepseek_v3 models to guard regressions
taking over from #1953
Newly added Compiler Toolkit 8 GPU Integration Tests passed