Input data backup from betzy #712
Replies: 34 comments 78 replies
-
|
One way to identify missing files is to run tests with One current issue with this method on Betzy is NorESMhub/ccs_config_noresm#70. |
Beta Was this translation helpful? Give feedback.
-
|
I have been doing 6 semi-sporadically @MichaelSchulzMETNO . Do you want some more regular update schedule? |
Beta Was this translation helpful? Give feedback.
-
|
No, I think it only matters under which project they are stored under (for who they are accounted to), but yes the wrong group ownership can be a problem regarding access. |
Beta Was this translation helpful? Give feedback.
-
|
Potential show stopper: |
Beta Was this translation helpful? Give feedback.
-
|
@TomasTorsvik that's interesting. When I try rsync from betzy to nird it always asks for 2FA which is VERY annoying! What trick do you use to circumvent it? |
Beta Was this translation helpful? Give feedback.
-
|
Just for documentation: If someone feels responsible for those, please either adjust permissions (readable for the group |
Beta Was this translation helpful? Give feedback.
-
|
Some very preliminary info regarding the amount of input data files on Is this high number expected? SOURCE_PATH_EXCLUDE_LIST = [".svn/*", "*/.svn/*", "*.lock"]Please suggest more file patterns for removal... |
Beta Was this translation helpful? Give feedback.
-
|
I think it is copied to the research archive though. |
Beta Was this translation helpful? Give feedback.
-
|
Hopefully the last post before copying... The following list is the files below the path Please confirm that these are the files that are supposed to be copied to
This would leave around |
Beta Was this translation helpful? Give feedback.
-
|
Next question: |
Beta Was this translation helpful? Give feedback.
-
|
How about |
Beta Was this translation helpful? Give feedback.
-
|
Just for info: |
Beta Was this translation helpful? Give feedback.
-
|
Next question: |
Beta Was this translation helpful? Give feedback.
-
|
Just for info: |
Beta Was this translation helpful? Give feedback.
-
|
There are two folders in /datalake/NS16001B, where the second is referred to from the www folder: cdl-ns16001b-noresminputdata |
Beta Was this translation helpful? Give feedback.
-
|
Just for info: Looking a bit further, the main thing is the directory Please advise what's really needed in the backup. The script can exclude directory structures. |
Beta Was this translation helpful? Give feedback.
-
|
@kjetilaas @maritsandstad can you clarify? |
Beta Was this translation helpful? Give feedback.
-
|
Documentation of input data saving/syncing : @tylov @jgriesfeller could you add a section on the input data maybe on the wiki: and check / edit this one : we have now: /datalake/NS12077K/CESM-input-data betzy olivia How do they hang together? |
Beta Was this translation helpful? Give feedback.
-
|
To my knowledge, some of this is also a bit in flux still @MichaelSchulzMETNO, especially for the Olivia / 16001B / CDL part... I sent an email to sigma2 last week about it, and they say it's a priority for them to get that coupling working, however, right now they are very busy with sorting out the allocations for 2026.1... |
Beta Was this translation helpful? Give feedback.
-
|
@tylov @jgriesfeller @matsbn @maritsandstad @MichaelSchulzMETNO Inputdata on nird (/datalake/NS16001B/) is not complete copy of Betzy; I was planning to check functionality of CDL by putting inputdata in cache as presented by Dhanya and Lorand during meeting. I was able to put most data in cache folder but still, some data are missing on nird. For testing purpose, it would be good if we have a complete copy of Betzy on nird in /datalake/NS16001B/. Please can be prioritise this task. Thanks a lot |
Beta Was this translation helpful? Give feedback.
-
|
@monsieuralok, is it possible to generate a list of the missing files? What test shows missing files which are needed by current simulations? |
Beta Was this translation helpful? Give feedback.
-
|
For NF2000, atleast this file is missing /datalake/NS16001B/cdl-ns16001b-NorESMInputdata/lnd/clm2/snicardata/snicar_optics_5bnd_c013122.nc I will start to update the list once I start to build other cases |
Beta Was this translation helpful? Give feedback.
-
|
This file (and some others) did not have the noresm group set. They have been fixed and should get copied on the next script run. |
Beta Was this translation helpful? Give feedback.
-
|
I changed the group for my missing file /cluster/shared/noresm/inputdata/share_alok/domains/domain.ocn.fv0.9x1.25_tnx1v4.170609_djlo.nc I did however notice that the identical file already being owned by the noresm group exists in /cluster/shared/noresm/inputdata/atm/cam/ocnfrac/ @monsieuralok |
Beta Was this translation helpful? Give feedback.
-
|
@matsbn @MichaelSchulzMETNO @lisesg @jgriesfeller @gold2718 It would be good if could decide that which files from inputdata should be copied from betzy to nird. Also, we should send message to users to change group to noresm on Betzy. I am checking for functionality presented by Lorand for Olivia; it wokrs but, still some files are missing and I also have few questions. We should also think about previous versions of NorESM for inputdata. From my part, I have changed all files to group noresm those I owned. |
Beta Was this translation helpful? Give feedback.
-
|
Sorry, for being slow... I checked the summary files prepared by @TomasTorsvik (thank you!). There were indeed a few essential ocean files that had group |
Beta Was this translation helpful? Give feedback.
-
|
Just to say, some of the users on this list have data there (irismuz and janko at least) which are on purpose not included in the noresm group to not have large datasets that don't need to be copied over to the input data |
Beta Was this translation helpful? Give feedback.
-
|
@jgriesfeller @gold2718 Has we synced again the inputdata directory from Betzy to nird as many others have changed group in last couple of weeks; as for Olivia testing it is required on PRIORITY basis.. |
Beta Was this translation helpful? Give feedback.
-
|
Hi, |
Beta Was this translation helpful? Give feedback.
-
|
@jgriesfeller please could you copy following files from Betzy to nird: |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
goal: Save betzy NorESM input files to nird /projects/NS9560K/www/inputdata
steps:
Identify "NorESM input files" on betzy:/cluster/shared/noresm/inputdata of group owner noresm
which are NOT in NCAR copy : /datalake/NS12077K/CESM-input-data
Exclude some files: svn, lock, etc
Rsync those "NOT-in-NCAR NorESM input files" to /projects/NS9560K/www/inputdata
2b) identify problems with permissions - communicate with sigma2 to make all files readable for NS9560K
2c) rsync will only copy new files on www/inputdata
2d) Only the rsync script should add files on www/inputdata
Users on betzy can add files to this process by making files and directories owned by group "noresm"
Make a repository under NorESMhub with scripts for steps 1+2 @jgriesfeller
Run script regularly (tbd) (Jan)
5b) Manually correct files on /projects/NS9560K/www/inputdata on demand with Data management group
Update NCAR files now and then (Marit?)
Look at input files from other machines , on demand
Move eventually www/inputdata to a specific input data project
Please comment @maritsandstad @gold2718 @tylov @jgriesfeller @DirkOlivie @monsieuralok @matsbn @lisesg
Beta Was this translation helpful? Give feedback.
All reactions