Releases: aws/sagemaker-training-toolkit
Releases · aws/sagemaker-training-toolkit
v4.4.7
Bug Fixes and Other Changes
- Revert SMDDP collectives feature from smdataparallel runner
v4.4.6
prepare release v4.4.6
v4.4.5
prepare release v4.4.5
v4.4.4
Bug Fixes and Other Changes
- Update libraries for SMDDP collectives validation
v4.4.3
Bug Fixes and Other Changes
- Upgrade protobuf to prevent conflicts with smdebugger.
v4.4.2
prepare release v4.4.2
v4.4.1
Bug Fixes and Other Changes
- Add support for p4de instances, update when FI_EFA_USE_DEVICE_RDMA flag is set to only p4d{e} instances.
v4.4.0
Features
- integrate SMDDP collectives into smdataparallel runner
v4.3.2
Bug Fixes and Other Changes
- add general exception to filter
v4.3.1
Bug Fixes and Other Changes
- integrate upcoming dataparallel change to modelparallel
- add unit tests for torchrun launcher and collections package deprecationWarning