Add new `create_inits()` methods to other stanfit classes #791

amas0 · 2025-05-08T02:05:54Z

Submission Checklist

[ x ] Run unit tests
[ x ] Declare copyright holder and open-source license: see below

Summary

This PR addresses #745. Implementation largely mirrors that for CmdStanPathfinder. In particular, this adds the following methods: CmdStanVB.create_inits(), CmdStanLaplace.create_inits(), CmdStanMLE.create_intis(), CmdStanMCMC.create_inits(). With the exception of MLE, inits are selected by sampling without replacement from draws. The MLE implementation will initialize all requested chains at the optimized parameter values.

Unit tests for all these methods are added as well.

I also updated the User's Guide example on creating inits from VI outputs to be more general and cover these new methods.

I assume this is already well-known but it seems like there is a cleaner library structure to have each of these different stanfit objects be implementations of the same parent (or abstract) class to unify these implementations (and those of other common methods). I haven't worked through the details, but it seemed possible given how similar the internal structures tend to be.

Copyright and Licensing

Please list the copyright holder for the work you are submitting (this will be you or your assignee, such as a university or company): myself

By submitting this pull request, the copyright holder is agreeing to license the submitted work under the following licenses:

Code: BSD 3-clause (https://opensource.org/licenses/BSD-3-Clause)

codecov-commenter · 2025-05-08T02:14:36Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 80.60%. Comparing base (650d2bb) to head (b18d450).
Report is 24 commits behind head on develop.

Additional details and impacted files

@@             Coverage Diff             @@
##           develop     #791      +/-   ##
===========================================
+ Coverage    80.24%   80.60%   +0.35%     
===========================================
  Files           25       25              
  Lines         3878     3949      +71     
===========================================
+ Hits          3112     3183      +71     
  Misses         766      766

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

WardBrian

Thanks! Overall this looks great, I just have a few minor comments.

Re:

I assume this is already well-known but it seems like there is a cleaner library structure to have each of these different stanfit objects be implementations of the same parent (or abstract) class to unify these implementations (and those of other common methods).

Yes, we definitely could. So far most of what we've done in this style is using helper methods we throw in the utils subpackage for code re-use, but inheritance would also get us there, and maybe a bit more cleanly. In another project that deals with Stan outputs, I don't even differentiate between the different algorithms, I just have one 'StanOutput' class that can hold any of them.

cmdstanpy/stanfit/laplace.py

WardBrian · 2025-05-08T12:17:54Z

cmdstanpy/stanfit/mle.py

+        if chains == 1:
+            return mle_inits
+        else:
+            return [mle_inits for _ in range(chains)]


Because the default behavior is if you pass a dictionary to sample, it uses it for every chain, I don't know if this is super necessary.

I do think this function should probably accept both a seed and chains argument, even if they're both unused, to let people write more generic code

(I also think this function might be exactly equivalent to self.stan_variables() after those changes?

Updated the signature for compatibility. I also use stan_variables() in the function now, but because stan_variables() will currently return a float if the parameter is scalar, I just map everything to an np.array for type compatibility. It seems this is intended to be the future behavior of stan_variables() in 2.0, at which point it will be equivalent.

docsrc/users-guide/examples/VI as Sampler Inits.ipynb

This change simplifies the create_inits() for the MLE params to always return a dictionary of inits, rather than a list for multiple chains. The default behavior of the sample method is to initialize all chains at the same init if only one is given per param. The chains paramter is kept and the seed parameter is added to the signature, despite being no-ops, for the purposes of maintaining uniformity across the other create_inits() methods on other stanfit objects.

amas0 · 2025-05-08T21:51:31Z

Thanks for the review -- responded to your comments and made changes where appropriate.

Not sure what's going on with that failed pytest check. Some of the cancelled jobs seem to have gotten through the testing stage without issue.

Going through the library, there are a number of references to a 2.0 version -- that might be a good opportunity to clean up some of these stanfit objects and such. Not sure if there are explicit plans for what that will contain (or timeline), just a thought. Happy to help out if there's a direction there.

WardBrian · 2025-05-09T15:13:34Z

Because ADVI can be unstable it might be worth setting a known-good seed in the new test that runs it

amas0 · 2025-05-09T22:30:03Z

Okay, pinned some seeds that seemed fine and checks look good. Serendipitous that I went back and did that, because I found a bug in one of the tests I wrote, which I now have fixed.

I think this should be set?

bob-carpenter · 2025-05-12T16:17:44Z

Thanks. @mitzimorris may be able to review and approve sooner than @WardBrian for this one.

mitzimorris · 2025-05-14T20:50:23Z

the code looks great.

the documentation section on creating inits should also mention creating inits from previous MCMC runs.
I would be happy merging this now and doing a follow-on PR to improve the docs.

mitzimorris

This could be done as a follow-on PR - the code looks good.

docsrc/users-guide/examples/VI as Sampler Inits.ipynb

amas0 · 2025-05-14T22:02:02Z

I'm happy to quickly update the docs as part of this PR. I do mention that previous MCMC estimates can be used as inits, but it's more of an aside at the end. I'll make it clearer that it's included.

amas0 · 2025-05-14T22:23:40Z

Updated the docs to clarify that one can use the output of one MCMC run as inits into another.

One relatively minor point. With this update, the file name of the example notebook VI as Sampler Inits.ipynb is somewhat outdated. I think it would make sense to update this, but doing so would break any existing links to that example as the URL is generated from the file name. I imagine a redirect could be set up with Sphinx somehow.

mitzimorris · 2025-05-14T22:43:08Z

you're right - we're stuck with the filename.

mitzimorris

Thanks - this all looks great!

amas0 added 8 commits May 6, 2025 21:23

Add CmdStanMCMC.create_inits()

bee9baf

Fix inconsistent string quoting

e0b35fb

Add CmdStanMCMC.create_inits() test

617b5cb

Add CmdStanLaplace.create_inits()

409b13f

Add CmdStanMLE.create_inits()

62f46d2

Add CmdStanVB.create_inits()

40c96e3

Add sampling tests from *.create_inits() for MLE, VB, Laplace, and MCMC

ab73751

Update sampler init user's guide example

162ed0b

WardBrian reviewed May 8, 2025

View reviewed changes

amas0 added 3 commits May 8, 2025 16:36

Fix errant period in docstrings

9b9a3df

Strip output from sampler inits notebook

8ce6ebe

Fix test_variational test on creating inits

b18d450

mitzimorris approved these changes May 14, 2025

View reviewed changes

docsrc/users-guide/examples/VI as Sampler Inits.ipynb Outdated Show resolved Hide resolved

docsrc/users-guide/examples/VI as Sampler Inits.ipynb Outdated Show resolved Hide resolved

Clarify docs on using sampler output as inits

d6d5bd3

mitzimorris approved these changes May 14, 2025

View reviewed changes

mitzimorris merged commit d1aeceb into stan-dev:develop May 14, 2025
16 checks passed

mitzimorris mentioned this pull request May 15, 2025

add "create_inits" to fits from sampler, optimization, advi #745

Closed

Uh oh!

Add new create_inits() methods to other stanfit classes #791

Add new create_inits() methods to other stanfit classes #791

Uh oh!

Conversation

amas0 commented May 8, 2025

Submission Checklist

Summary

Copyright and Licensing

Uh oh!

codecov-commenter commented May 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

WardBrian left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

WardBrian May 8, 2025

Choose a reason for hiding this comment

Uh oh!

WardBrian May 8, 2025

Choose a reason for hiding this comment

Uh oh!

amas0 May 8, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

amas0 commented May 8, 2025

Uh oh!

WardBrian commented May 9, 2025

Uh oh!

amas0 commented May 9, 2025

Uh oh!

bob-carpenter commented May 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mitzimorris commented May 14, 2025

Uh oh!

mitzimorris left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

amas0 commented May 14, 2025

Uh oh!

amas0 commented May 14, 2025

Uh oh!

mitzimorris commented May 14, 2025

Uh oh!

mitzimorris left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Add new `create_inits()` methods to other stanfit classes #791

Add new `create_inits()` methods to other stanfit classes #791

codecov-commenter commented May 8, 2025 •

edited

Loading

bob-carpenter commented May 12, 2025 •

edited

Loading