Re-allocate Buffer if the incoming `eltypes` don't match preallocated one #65

DhairyaLGandhi · 2022-04-08T14:13:16Z

Currently, the flat vector is stored inside the Restructure struct. It therefore assumes that incoming parameters also match the same eltype of the model. This will fail in mixed-mode AD, using struct-of-arrays, etc. To combat that, try to reallocate a buffer that can hold in the actual new parameters properly. Note that with mixed-precision, we also pay for conversion (and therefore allocation) with every operation. cc @ChrisRackauckas MWE:

using DiffEqFlux, OrdinaryDiffEq, Test

u0 = Float32[2.0; 0.0]
             datasize = 30
             tspan = (0.0f0, 1.5f0)

function trueODEfunc(du, u, p, t)
    true_A = [-0.1 2.0; -2.0 -0.1]
    du .= ((u .^ 3)'true_A)'
end
t = range(tspan[1], tspan[2], length=datasize)
prob = ODEProblem(trueODEfunc, u0, tspan)
ode_data = Array(solve(prob, Tsit5(), saveat=t))

model = Chain(x -> x .^ 3,
              Dense(2, 50, tanh),
              Dense(50, 2))
neuralde = NeuralODE(model, tspan, Rodas5(), saveat=t, reltol=1e-7, abstol=1e-9)

function predict_n_ode()
  neuralde(u0)
end
loss_n_ode() = sum(abs2, ode_data .- predict_n_ode())

data = Iterators.repeated((), 10)
opt = ADAM(0.1)
cb = function () #callback function to observe training
   display(loss_n_ode())
end

# Display the ODE with the initial parameter values.
cb()

neuralde = NeuralODE(model, tspan, Rodas5(), saveat=t, reltol=1e-7, abstol=1e-9)
ps = Flux.params(neuralde)
loss1 = loss_n_ode()
Flux.train!(loss_n_ode, ps, data, opt, cb=cb)

It might be good to update the actual struct else it would lie about the actual contents of the parameters. This would mean making Restructure mutable.

This still needs tests before merging; and some test failures are expected since we have to accumulate the gradients properly still

DhairyaLGandhi · 2022-04-08T17:36:03Z

Of course doing it out of place is less efficient, but seems like at least some of the tests show that there was indeed some cases of implicit conversion going on. It might also be the more correct implementation since we can't assume that the types of the primal and the pullback would match always.

ChrisRackauckas · 2022-04-09T00:00:14Z

Fixes SciML/DiffEqFlux.jl#699

CarloLucibello · 2022-04-20T08:06:58Z

Solved in #66

reacllocate buffer if the incoming eltypes dont match

f4e85b5

ChrisRackauckas mentioned this pull request Apr 9, 2022

test Flux 0.13 SciML/DiffEqFlux.jl#699

Merged

CarloLucibello mentioned this pull request Apr 20, 2022

Widen in _grad! #66

Merged

CarloLucibello closed this Apr 20, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Re-allocate Buffer if the incoming `eltypes` don't match preallocated one #65

Re-allocate Buffer if the incoming `eltypes` don't match preallocated one #65

DhairyaLGandhi commented Apr 8, 2022 •

edited

Loading

DhairyaLGandhi commented Apr 8, 2022

ChrisRackauckas commented Apr 9, 2022

CarloLucibello commented Apr 20, 2022

Re-allocate Buffer if the incoming eltypes don't match preallocated one #65

Re-allocate Buffer if the incoming eltypes don't match preallocated one #65

Conversation

DhairyaLGandhi commented Apr 8, 2022 • edited Loading

DhairyaLGandhi commented Apr 8, 2022

ChrisRackauckas commented Apr 9, 2022

CarloLucibello commented Apr 20, 2022

Re-allocate Buffer if the incoming `eltypes` don't match preallocated one #65

Re-allocate Buffer if the incoming `eltypes` don't match preallocated one #65

DhairyaLGandhi commented Apr 8, 2022 •

edited

Loading