Skip to content

Latest commit

 

History

History
31 lines (21 loc) · 1.08 KB

NEWS.md

File metadata and controls

31 lines (21 loc) · 1.08 KB

Tsunami Release Notes

v0.3.0

Breaking changes:

  • fit! returns nothing instead of a FitState object. The FitState object can be accessed via trainer.fit_state.
  • on_before_pullback has been removed. Use on_train_batch_start instead.
  • on_*_batch_start now receives the batch on device.
  • Some of the hooks now take more inputs.

Highlights:

  • Now Tsunami uses MLDataDevices.DeviceIterator to wrap dataloaders for more efficient device memory management.

  • training_step, validation_step, and test_step can now return a named tuple for flebility. One of the fields of the named tuple should be loss which is used to compute the loss value.

v0.2.0 - 2025-01-03

Breaking changes:

  • Devices indexing now starts from 1, as in MLDataDevices.gpu_device.

Highlights:

  • Updated to Flux v0.16.
  • Models (i.e. subtypes of FluxModule) are now not required to be mutable.
  • Tsunami.fit is deprecated in favor of Tsunami.fit!.
  • Added Trainer option to use Enzyme for automatic differentiation.
  • Improved test infrastructure.
  • Improved documentation.