v0.6.5
版本发布时间: 2022-05-26 04:35:57
microsoft/DeepSpeed最新发布版本:v0.15.1(2024-09-05 09:30:51)
What's Changed
- GatheredParameters - accept a tuple of params by @stas00 in https://github.com/microsoft/DeepSpeed/pull/1941
- Update partition_parameters.py by @manuelciosici in https://github.com/microsoft/DeepSpeed/pull/1943
- fix step in adam by @szhengac in https://github.com/microsoft/DeepSpeed/pull/1823
- [pipe] prevent deadlock with multiple evals sequence by @stas00 in https://github.com/microsoft/DeepSpeed/pull/1944
- Fairseq support by @jeffra in https://github.com/microsoft/DeepSpeed/pull/1915
- DeepSpeed needs to start cleaning up by @tjruwase in https://github.com/microsoft/DeepSpeed/pull/1947
- trivial fix by @kisseternity in https://github.com/microsoft/DeepSpeed/pull/1954
- Enabling CUDA-graph for the bert-type models by @RezaYazdaniAminabadi in https://github.com/microsoft/DeepSpeed/pull/1952
- Add loss scale guard to avoid inf loop by @Quentin-Anthony in https://github.com/microsoft/DeepSpeed/pull/1958
- [launcher] add option to bypass ssh check by @liamcli in https://github.com/microsoft/DeepSpeed/pull/1957
- Bump nokogiri from 1.13.4 to 1.13.6 in /docs by @dependabot in https://github.com/microsoft/DeepSpeed/pull/1965
- Fix typo in timer.py by @Quentin-Anthony in https://github.com/microsoft/DeepSpeed/pull/1964
- [docs] fix dependabot version issue by @jeffra in https://github.com/microsoft/DeepSpeed/pull/1966
- Don't add curand on rocm by @jeffra in https://github.com/microsoft/DeepSpeed/pull/1968
- Add Unidirectional Sparse Attention Type to BigBird and BSLongformer by @Quentin-Anthony in https://github.com/microsoft/DeepSpeed/pull/1959
- Fix: Sparse tensors not updating by @Dipet in https://github.com/microsoft/DeepSpeed/pull/1914
- Fixing several bugs in the inference-api and the kernels by @RezaYazdaniAminabadi in https://github.com/microsoft/DeepSpeed/pull/1951
New Contributors
- @Quentin-Anthony made their first contribution in https://github.com/microsoft/DeepSpeed/pull/1958
Full Changelog: https://github.com/microsoft/DeepSpeed/compare/v0.6.4...v0.6.5