[WIP] made whole adaptive workflow jax/jit #400

mrshirts · 2020-06-29T01:59:35Z

Converted the entire loop in adaptive to jax/jit.
See discussion on issue #340 for discussion of timings.

Intended for inspection, not merging for now.

Failing because jax/jit isn't loaded via conda (not clear it can be right now, conda-forge install on my machine didn't work, had to pip install).

Merging in recent changes from upstream.

Merging jit changes into pymbar4

made adaptive for pymbar jittable.

proteneer · 2020-06-29T16:32:30Z

pymbar/mbar_solvers.py

+import jax
+from jax.scipy.special import logsumexp
+from jax.ops import index_update, index
+from jax.config import config; config.update("jax_enable_x64", True)


This will slow things down dramatically, esp. on the GPU. Do you really need 64bit precision?

Probably yes - the algorithm doesn't seem to converge well with 32 bit floats. But I can poke around and see if I can isolate the particular problems in convergence.

Still need to work on this . . .

proteneer · 2020-06-29T16:35:49Z

pymbar/mbar_solvers.py


+jit_core_adaptive = jax.jit(core_adaptive,static_argnums=(0,1,2,3,4,5,))


When you call jax.jit, the first invocation will be exceedingly slow it's compiling the JIT kernels. So if you're benchmarking you should call this more than once to amortize out the jit time.

Futhermore, if the arguments have shapes that contain (None,) in one of the shapes, a recompilation is required due to an XLA requirement for each specialization of (None,) into a known shape.

Ah, this explains why the partial jitification works better- then it's calling different functions multiple times. Though only 7-8 times in many cases.

I will test running mbar initialization several times with different data sets in the same script to see if that helps.

I also noticed you're declaring every arg to be static, which means that unless you're passing in the exact args you'll be triggering a recompilation. Why did you have to declare them to be static to start with?

U_kn probably shouldn't be static if you're trying to pass in results from different runs?

I had to declare it static because otherwise, gnorm_sci and gnorm_nr end up as traced arrays instead of concrete arrays (since they are functions of traced arrays), and jit refuses to compile the conditional comparing the two, which decides the branch to take, unless they are declared static.

@proteneer had a good suggestion for handling the conditional:

import jax import jax.numpy as jnp # can't be JIT'd def foo(a, b, cond_a, cond_b): if cond_a < cond_b: return a+b else: return a-b # foo_jit = jax.jit(foo) # fails print(foo(0., 1., 2., 3.)) # can be JIT'd def bar(a, b, cond_a, cond_b): return jnp.where(cond_a < cond_b, a+b, a-b) bar_jit = jax.jit(bar) print(bar_jit(0., 1., 2., 3.))

So, if we accelerate just the inner loop, then we can pull the conditional out of the accelerated code with little loss of timing - all of the conditionals are just comparing floats and assigning values, so there is little use in accelerating them.

proteneer · 2020-06-29T16:37:12Z

pymbar/mbar_solvers.py


+    # Perform Newton-Raphson iterations (with sci computed on the way)
+    for iteration in range(0, maximum_iterations):


can you write this as jax.lax.while_loop? Just jit-compiling this for loop while be a nightmare for XLA

can you write this as jax.lax.while_loop?

I can take a look, though the fact that it's on the adaptive loop that is generally only called once in pymbar means it won't be as useful to do this.

maybe it makes sense then to just jit the body of this function? (i.e. move everything inside the for loop to a separate function and just jit that)

Ah, yes, that makes a lot of sense given the constraints. I'll try that.

proteneer · 2020-06-29T16:42:53Z

pymbar/mbar_solvers.py


    obj = math.fsum(log_denominator_n) - N_k.dot(f_k)

    return obj, grad

+def jax_mbar_hessian(u_kn, N_k, f_k):
+
+    jNk = 1.0*N_k


what's the point of 1.0*?

N_k is an int, jit (or maybe Jax, I can't recall) complains if it's not converted to float first. I wasn't able to find the best direct conversion to float function that made jit happy. so I just multiplied by 1.0 to do it automatically.

mrshirts · 2020-06-30T05:21:37Z

Closing in favor of one that just accelerates the inner loop.

mrshirts added 24 commits December 20, 2019 07:12

Changed to umbrella sampling for PMF.

7e6436f

Remove debugging.

f374859

Merge branch 'master' of https://github.com/choderalab/pymbar

1d8dabe

Merging in recent changes from upstream.

Merge branch 'master' of https://github.com/choderalab/pymbar

871db11

adding profiling.

8d1bc64

partial jax.

bce97f9

mostly working jax partial.

4810131

Running with jax

1fa4c69

working with jit.

70c40c4

Working with jit and jax

60bc37b

Updating solvers for more jaxness.

7070040

trying to jit all of adaptive.

8e754b9

working on more complete jax.

0b1c7ad

Remove some unneeded conversions

2e4380c

Adding a test file.

db13aab

different choices in profiler.

b78d0bf

Changes in profiling script

b27f1a5

more script changes for profiling.

dc3d581

Cleaning up jitting the whole adaptive path.

855851b

add profiling script.

c93a8ef

Merge branch 'partial_jax' into pymbar4_partial_jit

e2ac7f7

Merging jit changes into pymbar4

Partial jit code.

7be779f

Merge branch 'more_jit' into adaptive_jit

a071c6b

made adaptive for pymbar jittable.

Updated parameters.

44a9f93

mrshirts requested review from jaimergp, jchodera and Lnaden June 29, 2020 01:59

mrshirts mentioned this pull request Jun 29, 2020

Accelerate the code, starting with the inner loops #340

Open

proteneer reviewed Jun 29, 2020

View reviewed changes

mrshirts closed this Jun 30, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] made whole adaptive workflow jax/jit #400

[WIP] made whole adaptive workflow jax/jit #400

mrshirts commented Jun 29, 2020 •

edited

Loading

proteneer Jun 29, 2020

mrshirts Jun 29, 2020 •

edited

Loading

mrshirts Jun 30, 2020

proteneer Jun 29, 2020

mrshirts Jun 29, 2020

mrshirts Jun 29, 2020

proteneer Jun 29, 2020

proteneer Jun 29, 2020

mrshirts Jun 29, 2020

mrshirts Jun 29, 2020 •

edited

Loading

mrshirts Jun 30, 2020

proteneer Jun 29, 2020

mrshirts Jun 29, 2020

proteneer Jun 29, 2020

mrshirts Jun 29, 2020

proteneer Jun 29, 2020

mrshirts Jun 29, 2020

mrshirts commented Jun 30, 2020


		jit_core_adaptive = jax.jit(core_adaptive,static_argnums=(0,1,2,3,4,5,))


		# Perform Newton-Raphson iterations (with sci computed on the way)
		for iteration in range(0, maximum_iterations):

[WIP] made whole adaptive workflow jax/jit #400

[WIP] made whole adaptive workflow jax/jit #400

Conversation

mrshirts commented Jun 29, 2020 • edited Loading

Choose a reason for hiding this comment

mrshirts Jun 29, 2020 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mrshirts Jun 29, 2020 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mrshirts commented Jun 30, 2020

mrshirts commented Jun 29, 2020 •

edited

Loading

mrshirts Jun 29, 2020 •

edited

Loading

mrshirts Jun 29, 2020 •

edited

Loading