[Question]: Sharing remote cache between MacOS and Linux #687

gregjacobs · 2022-12-07T17:06:28Z

What is the current behavior?

Hey guys, I'm hoping you can help me with this. I'm using a remote cache that I was hoping to share outputs between Mac and Linux (CI).

However, currently building everything on my Mac still causes my CI machine (on Linux) to rebuild everything. I'm guessing this has something to do with the platform somehow? (even though JS outputs are platform-independent?)

Do you know of a way to make this work?

alexeagle · 2022-12-07T18:05:43Z

This isn't JS-specific, Bazel generally treats action inputs as opaque, checksummed files, and so if one action has a linux nodejs interpreter as an input, it will have a different cache key than the same action with a mac nodejs interpreter.

My understanding is that bazelbuild/bazel#15542 gets us closer, fixing the case where an output .js file is the same between linux and mac, so that subsequent actions that use that .js file could be cache hits between different platforms. However the nodejs interpreter would still make the cache keys different.

@fmeum is there an issue on Bazel that expresses the general "cross-platform cache hits" feature request?

fmeum · 2022-12-07T19:00:03Z

#6526 is the most fitting one. @tjgq is also working on solving the "multiple interpreters/SDKs" issue.

gregjacobs · 2022-12-08T15:31:09Z

Hey @alexeagle, @fmeum, thanks for the replies. I'll be following bazelbuild/bazel#6526 and hoping for this soon! This would be a huge win for our developers developing our monorepo on Mac but building/testing on Linux CIs. Being able to share the cache artifacts and test outputs back and forth should save a lot of time.

@alexeagle Feel free to close this issue if you like, unless you want to track this here.

Thanks again,
Greg

alexeagle · 2023-08-28T17:25:19Z

I discussed this with @tjgq and there's an approach which is easy for us to try, I think of it as a "multiplex toolchain". Interpreter for all platforms are inputs, which is a bit wasteful, then when execution begins you pick the one for the exec platform. That way the cache inputs appear the same on all platforms.

gregjacobs · 2023-09-10T01:09:16Z

Well that's definitely an interesting idea!

I'm just realizing though: scripts could in theory do something different based on os.platform(), so maybe the per-platform requirement makes sense 😶 Although on the other hand, for web outputs, that wouldn't matter. This is a tough one.

gregjacobs · 2024-04-19T21:35:19Z

After working with the rules for a while now, I'm having difficulty imagining a case where the outputs of a JS program would be different based on exec platform. Are you guys able to think of any?

If not, I think the above solution might be a worthy tradeoff. I'd rather have the shared cache and have a little longer download time for the Node binaries (which happens only once every so often). And in the event that someone ever comes up with a reason to not follow this anymore (i.e. they've found a case where Node outputs are different based on exec platform), could revert.

What do you guys think?

alexeagle · 2024-10-18T19:07:46Z

This came up at BazelCon this year in a talk on performance: https://static.sched.com/hosted_files/bazelcon2024/d0/TB-137%20Sharmila%20BazelCon%202024%20-%20Performant%20Bazel%20Builds%20for%20Web%20Monorepos%20at%20Scale.pdf

I think it's time to implement this. Since it's breaking in theory when a program senses the os.platform, we should just have a flagged rollout. In rules_js 2.x it would be off by default with a TODO to flip default to true in rules_js 3.0

gregjacobs added the enhancement New feature or request label Dec 7, 2022

alexeagle added the question This issue is a question. Close the loop with documentation? label Dec 7, 2022

gregmagolan added this to Open Source Feb 4, 2023

gregmagolan added blocked Blocked by another issue and removed enhancement New feature or request labels Feb 4, 2023

gregmagolan moved this to 📋 Backlog in Open Source Feb 4, 2023

gregmagolan moved this from 📋 Backlog to 🛑 Blocked in Open Source Feb 15, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Question]: Sharing remote cache between MacOS and Linux #687

[Question]: Sharing remote cache between MacOS and Linux #687

gregjacobs commented Dec 7, 2022 •

edited

Loading

alexeagle commented Dec 7, 2022

fmeum commented Dec 7, 2022

gregjacobs commented Dec 8, 2022

alexeagle commented Aug 28, 2023

gregjacobs commented Sep 10, 2023

gregjacobs commented Apr 19, 2024

alexeagle commented Oct 18, 2024

[Question]: Sharing remote cache between MacOS and Linux #687

[Question]: Sharing remote cache between MacOS and Linux #687

Comments

gregjacobs commented Dec 7, 2022 • edited Loading

What is the current behavior?

alexeagle commented Dec 7, 2022

fmeum commented Dec 7, 2022

gregjacobs commented Dec 8, 2022

alexeagle commented Aug 28, 2023

gregjacobs commented Sep 10, 2023

gregjacobs commented Apr 19, 2024

alexeagle commented Oct 18, 2024

gregjacobs commented Dec 7, 2022 •

edited

Loading