Padded batching #78

casper2002casper · 2022-04-19T15:58:58Z

When working with ANN that output different sized vectors depending on the input (for example when using GraphNeuralNetworks.jl), it would be useful to convert the output of a batch to a CuArray in order to perform loss computations.
Current:

julia> MLUtils.batch([[1,2],[3,4]])
2×2 Matrix{Int64}:
 1  3
 2  4

julia> MLUtils.batch([[1,2],[3]])
ERROR: DimensionMismatch("mismatch in dimension 1 (expected 2 got 1)")

Feature:

julia> MLUtils.batch([[1,2],[3]], pad =  0)
2×2 Matrix{Int64}:
 1  3
 2  0

darsnack · 2022-04-20T21:41:10Z

As a temporary workaround, you can do

batches = [[1, 2], [3]]
MLUtils.batch(rpad.(batches, 2, 0))

Probably a variation of this where we pad as we iterate batches would be a possible PR for this feature.

CarloLucibello mentioned this issue Feb 4, 2025

add batch_sequence #197

Merged

CarloLucibello closed this as completed in #197 Feb 4, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Padded batching #78

Padded batching #78

casper2002casper commented Apr 19, 2022 •

edited

Loading

darsnack commented Apr 20, 2022

Padded batching #78

Padded batching #78

Comments

casper2002casper commented Apr 19, 2022 • edited Loading

darsnack commented Apr 20, 2022

casper2002casper commented Apr 19, 2022 •

edited

Loading