-
Notifications
You must be signed in to change notification settings - Fork 53
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
feat: retain only last checkpoint directory
Introduced a new command-line argument `--keep_last_checkpoint_only`. This flag determines whether we should only keep the last checkpoint directory, with the previous epoch directories always being overwritten. When this flag is enabled, the epoch directory is named `last_epoch`. This flag is useful for managing disk space efficiently during model training. By keeping only the last checkpoint directory and overwriting the previous ones, it helps to significantly reduce the amount of storage required. This is particularly beneficial when working with large models and datasets, where each epoch can consume a substantial amount of disk space. By enabling the --keep_last_checkpoint_only flag, users can ensure that only the most recent model state is saved, which is often sufficient for many training and evaluation purposes. This approach helps to avoid clutter and maintain a cleaner and more manageable file system. Given the fact that we always pick epoch 7 during phase 1 training and do not perform evaluation on each epoch, one might decide it is not worth to save all epochs. By keeping only the last checkpoint, we can significantly reduce the amount of storage required, avoid clutter, and maintain a cleaner and more manageable file system. Signed-off-by: Sébastien Han <[email protected]>
- Loading branch information
Showing
4 changed files
with
27 additions
and
2 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters