Skip to content

Commit

Permalink
Fix[critical] AdamW epsilon default value
Browse files Browse the repository at this point in the history
  • Loading branch information
okoge-kaz authored Jul 16, 2024
1 parent ade3a65 commit 93f317e
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion src/llama_recipes/arguments.py
Original file line number Diff line number Diff line change
Expand Up @@ -306,7 +306,7 @@ def _add_regularization_args(parser: argparse.ArgumentParser) -> argparse.Argume
help='Second coefficient for computing running averages of gradient and its square'
)
group.add_argument(
'--adam-eps', type=float, default=1e-06,
'--adam-eps', type=float, default=1e-08,
help='Term added to the denominator to improve numerical stability'
)

Expand Down

0 comments on commit 93f317e

Please sign in to comment.