-
-
Notifications
You must be signed in to change notification settings - Fork 28
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Completion of the symlink feature #539
Completion of the symlink feature #539
Conversation
I checked all the other files and all had one
…other directories
:/ Let me check why the CI is failing |
My guess is that we are trying to write a .version file and the directory above isn't created yet? |
… path All other calls were already doing it, this was the only exception. The change is required because not all .version files are in $module_base
See the error:
and notice |
@muffato one thing that just came up in a conversation with someone is this idea of development environments. Arguably a symlink base (as we have it here) allows for generating an entire tree at some single named symlink base, and then loading from it. But is it so crazy to extend the idea to say "you can have as many little unique trees as you like"? The issue (as I see it) is that we don't have a good way to manage that space - e.g., let's say you have a module symlinked in three places, and we decide to uninstall a module, are we 1. happy with uninstalling in all three places, and 2. can we even find all the places? Just something to chew on - I think shpc (for now) is primarily a "run production stuff" sort of tool, but even for that if we have use cases like we were talking about with @georgiastuart and @marcodelapierre of wanting to have particular groups of things kept together (e.g., for different MPI or cuda or whatever) allowing for multiple symlink trees (and maybe calling them environments or module groups) isn't such a crazy idea. Let me know what y'all think! (side note, these discussions are very fun to have :) ) Update: also see marcodelapierre/toy-shpc-nf#2 ! |
…needs to know about the current ones
Co-authored-by: Vanessasaurus <[email protected]>
lol I just noticed that you had already dubbed this feature view in #502 (comment) . All credits go back to you, then ! |
No worries @muffato ! I was also thinking of a database view when I used that term :) If we both thought it was more an appropriate term, it probably would be a good choice. If they're held centrally, say in the container.yaml or nearby, then shpc can correctly install and uninstall everything. I think there are two ideas here:
For 1., I think I still like the first option - the user can ask to create a named view:
And it gets generated either in the default of the user home, OR a custom view path (views_base). $ ~/.singularity-hpc/views The reason we don't want to just throw it into the symlink base is because I expect people would want to do: $ module use $symlink_base and if they were under there you'd just get a mess of conflicts, too many things trying to load, etc. For 2., how to define the view, it looks like you are suggesting we have "named views" provided by container.yaml, e.g.,:
But I don't think that's the right approach because it means that everyone is going to have to write their cluster-specific modules into the container.yaml - what if they have different needs? Different names? it breaks very quickly. On the other hand, if the views are flexible to "whatever" you could just do: $ shpc view create mpi
$ shpc view add mpi-my-custom-1
$ shpc view add mpi-my-custom-2 OR you might have some equivalent requirements file that mimics the above: view:
name: mpi
modules:
- mpi-my-custom-1
- mpi-my-custom-2 And then $ shpc view create view.yaml
>> Creating view mpi from view.yaml... And arguably if you create it manually first, you could just export it. $ shpc view export mpi
>> Exporting view mpi to view.yaml And share that view file somewhere else. This creates the needed separation (I think) between a container module and how you want to group it / use it in the context with others. It's flexible to different names or needs for groups of modules, and still able to save / share. The reason I paused us from merging here is because I think having views would make the symlink_base defunct. The default view would generate a symlink tree. It would be confusing to merge support for a symlink_base and then provide views, which are a slight variant on that? The view is actually the same thing, just with a more elegant design (imho). In terms of the cleaning up bits, as long as we are storing views in a consistent place it shouldn't be any different than it is now - it would just take more checks to ensure that a module isn't symlinked across any views.
This could be an issue, I agree. But arguably if you want separation, couldn't the user just shpc install from their own clone? The use case this meets is what @marcodelapierre and @georgiastuart were talking about with respect to groups of modules to be installed together / alongside. Which makes me realize, we would want to allow the view.yaml to have separate sections for shpc and "other modules" view:
name: mpi
modules:
- mpi-my-custom-1
- mpi-my-custom-2
system_modules:
- mpi
- system_module And then perhaps you'd "install" an shpc module to a view, and "add" a system module: $ shpc view install clingo # shpc symlink
$ shpc view add mpi # system module Actually I'm just realizing - if we are adding a system module, we'd need to change the modulefile to require the module. The symlink won't work here. But is there a way to ask the module software to, given we are loading children of some path, to write some file at a higher level that requires the modules? Then we could still do the symlinks (the module files are unchanged) but "add" would rewrite the file that specifies that in the root of the view. If that doesn't work, then symlinks won't work for this use case, and we don't actually get any added functionality with view, because you could just do
The issue though is permissions - the user would need write permission to write new containers, and thus they could delete old ones. If all users don't set some config command like: $ shpc config set container_delete:false It could get ugly really quickly! There would need to be some kind of support for a set of "admin commands" that create a setting that absolutely doesn't allow delete. Still, someone could do it manually 🤷 Okay so after thinking about this - I think:
Okay so now this other approach:
I think "where we define them" is a separate thing, and I don't think they should be hardcoded alongside the container.yaml, for the reasons I said earlier.
Oh interesting. But this still assumes a shared set of admin managed views... which maybe is what is desired anyway? E.g., providing groups of modules? At least at Stanford the module system had a way to provide those groups, although I don't know how it was implemented.
I agree the implementation won't be too bad - but we do need to figure out how this design will look, and if it relates to the symlink tree (or not) and if not, what makes it functionally different? I'm not an admin of an HPC system so I have the least experience here, but I've done my best to write out some thoughts. I'll think more on it and post again if I get any insights. Oh, and before I forget, one use case I want to bring up (that isn't obvious here) is being able to generate quick "throw away" install environments for running something like Snakemake or Nextflow. A view would be perfect for that. |
Right, I get your main point. Making Overall it sounds like the only possibility to still allow clean uninstalls is that the user who installs/maintains shpc containers+modules would also control the views. Just need to define a config file for storing the views, etc. I would request an option to have a view that automatically contains all software. Not only it's what I want (being selfish), I think it's what @marcodelapierre had in mind as well in #456 (comment) . That's what I thought Jacqui meant in #456 (because that's how the modules are organised on her screenshot) but paying attention to her first comment #456 (comment) she asked for groupname/software/version which could be nicely implemented with views.
I agree that views can be used to group software that have different major dependencies, like open-mpi vs mpich. In #527 (comment) I was thinking about distinct shpc installations, but views could definitely help.
As a ground rule, I would keep views as strict views, i.e. one can only make an already-installed software visible in a view, but not install new software. |
I like that too. |
Okay so here is my proposal then: We first create a new view. $ shpc view create myview My original thinking is that all views would be the shortened version of the symlink first. But the need to "install all modules" gives me pause for that. @muffato would you prefer an option to create a view that is configured for "full paths" (and then be able to install all) or shortened paths (default) but then possibly have some conflicts that we'd either need to automatically manage or let the last one win? Either way, creating a new view actually means generating a new directory in some views/
myview Only software that is installed by shpc can be added to the view. E.g., this would work: $ shpc install ghcr.io/autamus/clingo
$ shpc view install ghcr.io/autamus/clingo And then since it's shortened symlinks by default, you'd have: views/
myview/
clingo/
module.lua of course then you would want to cd there and do use from the myview root to avoid the extra path: cd vieww
module use ./myview and maybe there could be an shpc command to help that: $ shpc view myview
$ module load $(shpc view myview) and that we would need to figure out tricks to only load some specific level, e.g., for that to work you'd need a relative path so it couldn't be run from anywhere. Or I wonder if we could figure out how to do: shpc load myview Do any other tools load modules on behalf of the user and it works? Thoughts? Then you could export, view, etc. $ shpc view export myview view.yaml
>> myview exported to view.yaml $ shpc view import anotherview view.yaml
>> view.yaml imported into anotherview or probably more meaningful messages than that, or even: $ shpc view import anotherview view.yaml
>> my view requires shpc install ghcr.io/autamus/clingo, do that first And then to show all views, in case you forget $ shpc views |
we can just use a filesystem loader and then from_string instead note that I did this work last night but forgot to commit and push. Signed-off-by: vsoch <[email protected]>
Hi @vsoch . To answer your first question, I think the case for shortened paths stands regardless of how we do views, cf #456. Something that's been bothering me about this is who would control the views, where they would be stored, and the risk of having "rogue" views that
|
Hi @vsoch, @muffato, it's some quite overly busy weeks here, I haven't got the chance yet to review your proposed features for environments and views...sorry! Overall, they indeed sound like a potentially very powerful new addition to SHPC.
TLDR: right now, if I think of SHPC envs/views, I pretty much see it as the simple, but super-duper powerful equation |
Are you folks thinking of decoupling the release of the symlink features from that of the new envs/views? Or are you thinking of doing all of it before releasing? |
I think my realization is that the symlink feature is a kind of view, so I wanted to figure views out and refactor to that, and not for example have a few releases of shpc that then change how it works entirely. |
Makes total sense, thanks for sharing this! |
Ok, did some more reading above :) Short story: fantastic discussion and fantastic ideas, I loved reading it! I like all of what you guys said, in general: functionalities, UI commands, ideas on the specifications of a view. I am saying "in general" just because I am at end of day and was unable to read all of the details :-) I have some thoughts I will try and share, focusing for now on the specifications and functionalities of the views. So, for this view YAML:
|
So, to summarise from my YAML lines above, a couple of thoughts:
Why point number 2.? Some cases:
So, my implied idea for point 2. is to try and have one single SHPC installation that tames them all (paraphrasing Vanessa here ;) ). |
How would management of this views with their own settings work? I agree with your comments above, suggesting that to avoid going crazy we should probably make it so that a view can only be managed by its creator, and stop. One possible limitation that might be implied by equating ....and here more thoughts. I like the idea some of you stated of starting with a default view with all modules. So, almost wondering....could we reinterpret the default starting context of SHPC as a kind of "base" view? and a base view which can only have "all modules", and possibly only "long" module tree naming? We could even add the It would be nice if all of this could keep working with the site vs user settings.yml, which I think is the case right now. What I like about the options for the user settings is that it enables one single SHPC installation to be used by multiple users, that can customise EVERYTHING, including, crucially the SIF file storage path. |
One more thought on the location of the view module tree. Scenario 1. Scenario 2. |
And last one for today... About my idea above of two kinds of view, 1. typical symlinks , 2. new module tree with its own settings. In this case, a hierarchy of which settings are used first is needed, eg
With this idea of the view setting, we would also need to decide whether to allow the customisation of the container_base. Maybe best to force it to be the same as the underlying user/site setting, but not sure. |
I hope this can contribute to your great conversation and work, feel free to pick/edit/ignore, and to ping me! :-) |
Remove custom loader in favor of Jinja2 Filesystem -> from_string
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Ready to go! I think next I'll start from this branch and put in some early ideas for a PR. I don't want to merge into main (and thus release) because I expect the view work will tweak some of this a bit, and better to put one new feature in at once instead of a new feature that immediately gets deprecated for another one.
As always, great work @muffato !
@marcodelapierre don't worry I've kept a link to the discussion here - will try to write it all up in one place to stare at before working on views, should be next! |
No worries @vsoch , thank you! Thinking back at what I shared yesterday, I think a chunk of it is probably pushing it too much, in terms of adding complexity without big benefit. I am thinking of the view-with-specific-settings idea. Glad to hear what you guys think, but otherwise feel free to skip. The ideas which I most invite you and @muffato to think about are the couple of extra specifications for the views:
One more thing I thought about this morning is on the GPU case. IF we want to handle the case of a AMD+Nvidia container, this may complicate things: the symlinks would not do, because we need to provide the right Singularity flag inside the aliases/wrappers definitions. Or...? And one more morning thought on the UI, it would be good to have a view command that just produces the root path of the view, eg
which could eventually be used by
Finally, I think it might help the conversation to re-state what we're trying to achieve with the views, in terms of functionalities and user cases. It might help to decide what features to include, and some aspects of their mechanics. I hope this can help your very interesting brainstorming on the views feature. Thank you! |
Thanks @marcodelapierre your feedback is really helpful! This is a holiday (3 day) weekend so I anticipate getting in our first PR for views soon - I know how I'm going to restructure shpc to prepare and it's exciting :) |
Discussion here moved / continued in #545. |
* adding testing of a symlink tree install * unset symlink_home * missing module load * adding symlink_tree setting to indicate to always set also adding support for custom config on the fly with -c and for removing symlinks on uninstall * missing command argument * symlink_home -> symlink_tree and --symlink-tree instead of --symlink * forgot to add dest * ensure that we cleanup symlink directory of .version and empty * ensure we dont create symlink version unless tcl and default version is true * do not require symlink_base to be defined we can have a default of $root_dir/symlinks instead * bugfix: without this, create_symlink would only be called based on the CLI argument, not the .yml setting (#509) * verison bump * Minor refactoring of `check_symlink` (#510) * No need to create the symlink base directory here since it will be created by `create_symlink` * Make this part of the code symmetric with self.create_symlink() * Fixed a truncated sentence * running black * Skip `module` in the symlinks (#511) * Implemented "default_version" for TCL * Use write_version_file for the symlink tree too * Skip `module.tcl` in the symlinks This is done by symlinking `<software>/<version>` itself to `<namespace>/<software>/<version>/module.tcl`. For the directory of the wrapper scripts to be correctly found, the symlink has to be resolved, but TCL's `file normalize` won't normalise the filename. So, we need to use `file readlink` instead, but only on real symlinks because it raises an error. * Symlink to module.lua when possible which is when default_version==True (default_version==False can't be made to work with symlinks). * Added a `--force` option to `shpc install` to force overwriting existing symlinks The name `--force` is generic, so that other things could be forced through it, not just overwriting symlinks. Also added an info message if a symlink is overwritten, which can be hidden with the `--quiet` flag. * Made `force` optional * Forgot the variable for substitution * The "delete" command was superseded by "uninstall" in #6 * Added `--no-symlink-tree` to override the config file `--symlink-tree` now also overrides the config file * Make it explicit we are expecting yes or no * add force and symlink arg back in * do not pin black * Completion of the symlink feature (#539) * Removed leftover from an earlier implementation of `default_version` * bugfix: True and False are deprecated (but still valid) * Lmod symlinks can now skip "module" too thanks to @marcodelapierre * Added the missing newline character at the end of the file * bugfix: the return needs to happen after the creation of the symlink * `symlink_base` should allow environment variable expansion, like the other directories * Cannot consider using symlinks without the base defined * The two classes are meant to address exactly this * bugfix: the symlink needs to be cleaned regardless of where the modules are held * rmdir_to_base does the upwards clean-up correctly * No need to complain if symlinks are not enabled in the first place * bugfix: the caller of write_version_file needs to build the directory path All other calls were already doing it, this was the only exception. The change is required because not all .version files are in $module_base * bugfix: .version needs to be updated in the symlink tree too * bugfix: when uninstalling the last version of a tool there is no directory any more * Make the settings object upgrade legacy values so that the code only needs to know about the current ones * Expanded the function to deal with files and symlinks * Adding we can just use a filesystem loader and then from_string instead * Message update cf #502 (comment) * saving start of work on views this adds an shpc view command and refactors the symlink nomenclature to instead b about views! I need to write documentation and make a few additional commands to load a view from file and write tests for views too. * adding changes to docs and tests for views! * spelling mistake and missing tests helpers file * force needs to be specific for uninstall * remove deprecated symlink-tree * adding support to install from a file and list installed modules * legacy values is broken - remove for now * fixing bug with module version files - cannot do any funny business with creating module class on the fly! * module .version in symlink only should exist for lmod * fixing bug with uninstall and adding shpc view list * adding support to generate a view module a user can now do shpc view add <var> <val> to add customizations to a view! In practice this means that installed modules need to make an attempt to load the view and given that it is a symbolic link, it should find the file (at least we hope). This means in practice one view should be loaded at once, otherwise you will have two conflicting .view_module. * ensure we only use try-load for modules >= 4.8. I am also updating the testing to run for both an older and newer version (with and without support) * fixing bug with test I had changed the client variable to be module_sys instead of module to match the rest of the library, and forgot to update it in the tests helper script. * adding support for shpc view remove <name> <var> <value> and I am also adding better testing to look at both the config and file content of view_module in both cases Signed-off-by: vsoch <[email protected]> Co-authored-by: vsoch <[email protected]> Co-authored-by: Matthieu Muffato <[email protected]> Co-authored-by: Matthieu Muffato <[email protected]>
Hi
Follow-up of #502 with these changes:
module
in the name) for Lmod whendefault_version
is False. Looks like in Adding testing of a symlink tree install #502 (comment)Tested with Lmod and TCL