add precompute scale in README

Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:
pytorch-labs · Jul 10, 2024 · ba085e5 · ba085e5
1 parent fa2f08a
commit ba085e5
Showing 1 changed file with 12 additions and 1 deletion.
diff --git a/README.md b/README.md
@@ -51,7 +51,18 @@ model = FSDP(model, use_orig_params=True)
 # optional: enable torch.compile for improved performance
 m = torch.compile(m)
 
-# train/finetune (not shown)
+# toy training loop
+for _ in range(N_ITER):
+    optimizer.zero_grad()
+    y = m(x)
+    y.sum().backward()
+    optimizer.step()
+
+    # specific to fsdp2 + float8 with dynamic scaling
+    # this method is optional but is highly recommended for performance
+    # it calcuclates scales for all parameters in a single all-reduce
+    precompute_float8_scale_for_fsdp(model)
+
 ```
 
 ## float8 linear with delayed scaling