Cache TLM and Hessian solvers for NLVS blocks #4554

angus-g · 2025-09-10T13:41:55Z

Description

Computing the adjoint on a NonlinearVariationalSolve block has a few optimisations: in effect the LVP and LVS are only initialised once when the block is created, then only the RHS is updated during adjoint evaluation (with some extra fiddling for the non-constant Jacobian case).

The TLM and Hessian should look similar to this, but they currently go through the slower code path on GenericSolveBlock that does assembly and solver creation on every evaluation, making them significantly slower. For very ballpark figures, a simple Stokes box model takes ~20s for a forward or adjoint evaluation, but ~180 for the first Hessian evaluation (involving a couple of compilation phases, I guess) and ~100 for subsequent Hessians. I think the theory dictates that this should be closer to only 2x the adjoint.

This PR is some pretty horrific code-mangling to attempt to bring parity across the different evaluation types. Currently, it sets up the LVP and LVS for TLM and Hessian on a NLVS block initialisation. I think it's fair to assume that in most cases the adjoint LVS is useful, but the TLM and Hessian are less common. Perhaps they can be gated by a flag if you know you'll need them, or initialised lazily (maybe a little more fiddly).

I also have completely hacked my way around the form replacement and constant Jacobian business, and they're almost certainly completely wrong. This probably deserves a more careful eye once the overall approach is a bit more settled. Indeed, maybe the form replacement mechanism could be re-engineered somewhat.

With the changes here, the same box model takes 100s for the first Hessian, then ~65 thereafter. There's still a bunch of time being spent on assembly and form manipulation for the second-order adjoint, and I assume that only needs to happen once. My model also has a couple of ProjectBlock that go through the slow path, but they are a tiny percentage of the overall runtime. It does mean it's a bit complicated to follow the logic through, depending on whether the GenericSolveBlock or NonlinearVariationalSolveBlock implementations of adj/tlm/hess are being used.

angus-g added 2 commits September 10, 2025 22:24

Start to move TLM evaluation into NonlinearVariationalSolveBlock

bbb6817

Start to move Hessian evaluation into NonlinearVariationalSolveBlock

538bd82

angus-g force-pushed the angus-g/cache-nlvs-hessian branch from 0be0568 to 538bd82 Compare September 10, 2025 13:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Cache TLM and Hessian solvers for NLVS blocks #4554

Cache TLM and Hessian solvers for NLVS blocks #4554

Uh oh!

angus-g commented Sep 10, 2025

Uh oh!

Uh oh!

Cache TLM and Hessian solvers for NLVS blocks #4554

Are you sure you want to change the base?

Cache TLM and Hessian solvers for NLVS blocks #4554

Uh oh!

Conversation

angus-g commented Sep 10, 2025

Description

Uh oh!

Uh oh!