add `Random.fork(rng::Xoshiro)` to split `rng` into a new instance #58193

rfourquet · 2025-04-22T12:30:03Z

When new tasks are spawned, the task-local RNG of the current task is "split", or "forked", into a new RNG for the new task. This is achieved in a fast and sound way, involving a secondary RNG "hidden" in the 5th field of the TaskLocalRNG struct.
Xoshiro was made to mirror TaskLocalRNG and also has a 5th field, mostly unused. It was useful for example when you want to save the state of TaskLocalRNG() via copy(TaskLocalRNG()), which returns a Xoshiro.

This PR re-implements in julia this forking mechanism from jl_rng_split() in "src/task.c", to allow forking Xoshiro RNGs independently of task spawning. This is in particular useful for parallel (reproducible) computations, where tasks can be spawned with a forked (explicit) RNG from an initial Xoshiro object.

There are alternatives, like "jumping", or for example seeding new RNGs objects from a master seed and a "worker id", like in Xoshiro([master_seed, worker_id]), but these techniques require some coordination. For example, it can be unsafe to locally create a new RNG via jumping, because you might have collision with another RNG created from the parent task also via jumping with the same number of steps.
In contrast, "forking" allows to make local decisions about the shape of the computation without risks of collisions.
And it's simpler: you just write forked_rng = Random.fork(src_rng).

Alternative names could be

spawn, which is used in Numpy, but that I find less descriptive than fork; or
split, perhaps what is mostly found in the literature, and also appears in the name jl_rng_split; one problem being that Base.split means something else.

"Fork" seems appropriate as it's used in the name Random.forkRand for array-filling with SIMD, and in the long comment above jl_rng_split.

When new tasks are spawned, the task-local RNG of the current task is "split", or "forked", into a new RNG for the new task. This is achieved in a fast and sound way, involving a secondary RNG "hidden" in the 5th field of the `TaskLocalRNG` struct. `Xoshiro` was made to mirror `TaskLocalRNG` and also has a 5th field, mostly unused. It was useful for example when you want to save the state of `TaskLocalRNG()` via `copy(TaskLocalRNG())`, which returns a `Xoshiro`. This PR re-implements in julia this forking mechanism from `jl_rng_split()` in "src/task.c", to allow forking `Xoshiro` RNGs independently of task spawning. This is in particular useful for parallel (reproducible) computations, where tasks can be spawned with a forked (explicit) RNG from an initial `Xoshiro` object. There are alternatives, like "jumping", or for example seeding new RNGs objects from a master seed and a "worker id", like in `Xoshiro((master_seed, id))`, but these techniques require some coordination. For example, it can be unsafe to locally create a new RNG via jumping, because you might have collision with another RNG created from the parent task also via jumping with the same number of steps. In contrast, "forking" allows to make local decisions about the shape of the computation without risks of collisions. Alternative names could be * `spawn`, which is used in Numpy, but that I find less descriptive than `fork`; or * `split`, perhaps what is mostly found in the literature, and also appears in the name `jl_rng_split`; one problem being that `Base.split` means something else. "Fork" seems appropriate as it's used in the name `Random.forkRand` for array-filling with SIMD, and in the long comment above `jl_rng_split`.

rfourquet added randomness Random number generation and the Random stdlib feature Indicates new feature / enhancement requests labels Apr 22, 2025

small updates

889c18f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add `Random.fork(rng::Xoshiro)` to split `rng` into a new instance #58193

add `Random.fork(rng::Xoshiro)` to split `rng` into a new instance #58193

rfourquet commented Apr 22, 2025 •

edited

Loading

add Random.fork(rng::Xoshiro) to split rng into a new instance #58193

Are you sure you want to change the base?

add Random.fork(rng::Xoshiro) to split rng into a new instance #58193

Conversation

rfourquet commented Apr 22, 2025 • edited Loading

add `Random.fork(rng::Xoshiro)` to split `rng` into a new instance #58193

add `Random.fork(rng::Xoshiro)` to split `rng` into a new instance #58193

rfourquet commented Apr 22, 2025 •

edited

Loading