adding optimizer overlap for FSDP #203

HamidShojanazeri · 2023-09-15T16:10:29Z

What does this PR do?

This PR adds Optimizer overlap that bring addition memory savings by fusing the gradinet calculation and parameter update steps.

for 7B --> max reserved memory saving is 7%
---> allocated memory 4%
---> active memory 4%

Feature/Issue validation/testing

Please describe the tests that you ran to verify your changes and relevant result summary. Provide instructions so it can be reproduced.
Please also list any relevant details for your test configuration.

Test A without optimizer_overlap
https://gist.github.com/HamidShojanazeri/08ee3d23bdb0fa60466071dee1efda1f
Test B with optimizer_overlap
https://gist.github.com/HamidShojanazeri/3d1147012e9db130dd7cebf75d3caa64

Logs with/out anyprecision

Logs with AdamW

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
[ X] Was this discussed/approved via a Github issue? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

Thanks for contributing 🎉!

README.md

chauhang · 2023-09-16T16:20:03Z

src/llama_recipes/finetuning.py

+        print(f"setting up optimizer overlap")
+        optim_kwargs = {"lr": train_config.lr}
+        _apply_optimizer_in_backward(
+            optimizer_class=optim.AdamW,


If the optimizer_in_backward_available flag is set and user has selected AnyPrecisionAdamW, will be good to add for that case as well, unless there is a restriction on which optimizers support this feature.

Added for Anyprecision as well, as per my test it works for anyprecision as well.

src/llama_recipes/finetuning.py

chauhang

@HamidShojanazeri Thanks for this PR. Left few comments. It will be great to capture a memory profile as well

README.md

src/llama_recipes/finetuning.py

adding optimizer overlap for FSDP

5d16d1a

HamidShojanazeri requested a review from chauhang September 15, 2023 16:10

facebook-github-bot added the cla signed label Sep 15, 2023

HamidShojanazeri requested a review from rohan-varma September 15, 2023 16:10

chauhang reviewed Sep 16, 2023

View reviewed changes

README.md Outdated Show resolved Hide resolved

chauhang reviewed Sep 16, 2023

View reviewed changes

src/llama_recipes/finetuning.py Outdated Show resolved Hide resolved

chauhang suggested changes Sep 16, 2023

View reviewed changes

rohan-varma reviewed Sep 18, 2023

View reviewed changes

README.md Outdated Show resolved Hide resolved

rohan-varma reviewed Sep 18, 2023

View reviewed changes

src/llama_recipes/finetuning.py Outdated Show resolved Hide resolved

src/llama_recipes/finetuning.py Outdated Show resolved Hide resolved

src/llama_recipes/finetuning.py Outdated Show resolved Hide resolved

HamidShojanazeri added 5 commits October 25, 2023 20:26

update to main

aff8f5d

fixing comments

ea5d0d4

fixing the logic

c63428d

adding comment

db1fd97

remove new line

d2f8324

rohan-varma approved these changes Nov 1, 2023

View reviewed changes

init27 closed this Aug 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

adding optimizer overlap for FSDP #203

adding optimizer overlap for FSDP #203

HamidShojanazeri commented Sep 15, 2023 •

edited

Loading

chauhang Sep 16, 2023

HamidShojanazeri Oct 30, 2023 •

edited

Loading

chauhang left a comment

adding optimizer overlap for FSDP #203

adding optimizer overlap for FSDP #203

Conversation

HamidShojanazeri commented Sep 15, 2023 • edited Loading

What does this PR do?

Feature/Issue validation/testing

Before submitting

chauhang Sep 16, 2023

Choose a reason for hiding this comment

HamidShojanazeri Oct 30, 2023 • edited Loading

Choose a reason for hiding this comment

chauhang left a comment

Choose a reason for hiding this comment

HamidShojanazeri commented Sep 15, 2023 •

edited

Loading

HamidShojanazeri Oct 30, 2023 •

edited

Loading