Implement luz_callback_validation_check #56

mattwarkentin · 2021-07-29T21:28:56Z

This is a first attempt at implementing the validation check callback. It may still need some work so I am submitting this as a draft PR for now. By design, this check only runs a few batches and computes the loss. It does not strictly follow the standard validation loop because it does not call the validation-related callbacks.

We may also want to compute the validation metrics in this check. I will await your thoughts before more changes are made.

dfalbel · 2021-07-30T16:42:27Z

R/callback-validation-check.R

+    if (is.null(ctx$valid_data)) return()
+    if (self$batches <= 0) return()
+
+    ctx$model$eval()


We might want to extract this out into a function:

luz/R/callbacks.R

Lines 298 to 300 in 6e0bb77

ctx$model$eval()

ctx$training <- FALSE

ctx$loss <- list()

And reuse here so we make sure that the same changes are always set?

dfalbel · 2021-07-30T16:57:36Z

R/callback-validation-check.R

+    input  <- list(batch[[1]])
+    target <- batch[[2]]
+    pred <- do.call(ctx$model, input)
+    self$loss <- ctx$model$loss(pred, target)


I think in general we would want to do the full validation step because the errors could be in any of the callbacks etc. But we would need to take care of the side effects that this might cause.

We would need to call valid_one_step() and then make sure we can reset the state. Not sure yet what would be the best way to do it though.

I was thinking about this problem. It feels to me that the safest way would be to on_fit_begin() we call fit again and we add a callback that breaks the training loop after batches steps for both training and validation. This way, no side effects would interfere in the actual training loop but we still run the full loop which would detect the other possible bugs.

I think this is possible if the first thing we do in the ctx object is to save a list with all arguments that were passed to
fit, before we do any kind of manipulation (like we do for callbacks).
To avoid the infinite recursion we could check ctx$callbacks to check if the callback that breaks the loop is present.

Yeah I've kind of gone in circles here. We want to call the validation callbacks so it is a complete check of the validation loop, but I was worried about any changes in state this might have. I did consider using valid_one_batch() but at the time decided against it for the above reasons.

Somewhat related question, when ctx$call_callbacks("on_..._...") is called, if there are multiple callbacks with available methods for the breakpoint, what is the order they are called in? Default callbacks first, user-supplied callbacks second?

Yes, they are called in that order: default callbacks then user callbacks.
I think that if we call fit again, there would be no interference, the only difference is that it would also test the training loop. But we could also skip it anyway...

I did actually think about calling fit() again inside on_fit_begin() but I decided against. But you're right, it would be a good way to check both the training and validation loops before committing to a full fit.

Thinking again, there could still be some side effects, eg: the callbacks passed by the user can have side effects outside of the R session (maybe writing to a file or something like this). So maybe we want to call fit again, only with the default callbacks + the one that breaks the training loop.

This is not completely ideal, because still there would be callbacks that could fail in the 'real' pass. Bu sounds like enough, I guess.

I agree with only calling the default callbacks. My original reason for avoiding callbacks was for loggers and other things written to disk. But if we only run default callbacks we can avoid this issue. The function docs can just point out that user callbacks aren't validated.

Matthew T. Warkentin added 2 commits July 29, 2021 17:23

Add train_data to context cleanup, update ctx docs

b2df54e

Add validation check callback

240adc6

dfalbel reviewed Jul 30, 2021

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement luz_callback_validation_check #56

Implement luz_callback_validation_check #56

mattwarkentin commented Jul 29, 2021

dfalbel Jul 30, 2021

dfalbel Jul 30, 2021

dfalbel Jul 30, 2021

mattwarkentin Jul 30, 2021

mattwarkentin Jul 30, 2021 •

edited

Loading

dfalbel Jul 30, 2021

mattwarkentin Jul 30, 2021

dfalbel Jul 30, 2021 •

edited

Loading

mattwarkentin Jul 30, 2021

Implement luz_callback_validation_check #56

Are you sure you want to change the base?

Implement luz_callback_validation_check #56

Conversation

mattwarkentin commented Jul 29, 2021

dfalbel Jul 30, 2021

Choose a reason for hiding this comment

dfalbel Jul 30, 2021

Choose a reason for hiding this comment

dfalbel Jul 30, 2021

Choose a reason for hiding this comment

mattwarkentin Jul 30, 2021

Choose a reason for hiding this comment

mattwarkentin Jul 30, 2021 • edited Loading

Choose a reason for hiding this comment

dfalbel Jul 30, 2021

Choose a reason for hiding this comment

mattwarkentin Jul 30, 2021

Choose a reason for hiding this comment

dfalbel Jul 30, 2021 • edited Loading

Choose a reason for hiding this comment

mattwarkentin Jul 30, 2021

Choose a reason for hiding this comment

mattwarkentin Jul 30, 2021 •

edited

Loading

dfalbel Jul 30, 2021 •

edited

Loading