feat: SST1RSoXSDB: flexible tool for finding run numbers #31

pbeaucage · 2022-06-05T22:30:48Z

One major obstacle to the use of the DataBroker loader is fishing the run you want out of the sea of all 40,000+ runs (as of summer 2022) on the instrument.

One approach to this is to use a hybrid sort of scheme with, say, a Pandas frame that contains basic metadata, from which scans could be loaded.

API would be vaguely like summarize_run(proposal=None,saf=None,user=None,institution=None,project=None,sample=None,plan=None) which could return a pd.Dataframe with columns in the start document.

The user could then select down that data frame, even using pandas tools like:
df.sample_id.drop_duplicates() to view unique sample id's, say.

Closing this loop would probably involve actually doing isel, sel, or where calls to get a reasonable number of scans, then passing that frame back into a loader function that would make one big xarray out of those scans.

The text was updated successfully, but these errors were encountered:

pbeaucage · 2022-09-21T13:40:19Z

Remaining needs on this:

documentation
a smooth api to reduce a dataframe returned by summarize_run to a list of run numbers for feat: multiple-scan / cross-scan loading in SST1RSoXSDB #38

Addresses feat #31 Expanded functionality of the summarize_run function in SST1RSiXSDB.py (#31). I believe the signature matches that of the old version, so existing code should remain functional. New Features: Slightly expanded set of preset keyword search terms, made many case-insensitive, made many regex-based. Allowed for additional search terms to be provided as keyword arguments, specifying the match method If the catalog is reduced to zero, the user is notified which search term failed to match. Expanded the variety of metadata that is output to the dataframe and provided a set of preset collections of metadata through the outputType parameter, including scan numbers only Allowed for additional output metadata fields to be requested through the userOutputs keyword argument Implemented failing gracefully at multiple stages Implemented limited 'troubleshooting tips' on bad user input See signature docstring for full documentation and example functions.

pbeaucage · 2022-10-12T00:21:23Z

I'm going to reopen this one with the documentation-goodfirstissue labels solely because it would be really nice to have docs and test coverage for this function. Anybody that wants to take @BijalBPatel's outstanding, best-in-class example docstring for SST1RSoXSDB.summarizeRun() and start a Sphinx page for it, please do so!

Feel free to reach out to me or @pdudenas if anyone would like a hand getting started w Sphinx docs.

pbeaucage added a commit that referenced this issue Jun 5, 2022

Initial attempt #31: Add summarize_run to SST1RSoXS client

4bc1324

pbeaucage added a commit that referenced this issue Jun 5, 2022

Rudimentary docs for summarize_run #31

4901cd0

pbeaucage added a commit that referenced this issue Jun 5, 2022

Further toward #31, including parallelizing calls and more fields

b9c9bff

pbeaucage added documentation Improvements or additions to documentation good first issue Good for newcomers labels Sep 21, 2022

pbeaucage mentioned this issue Oct 8, 2022

Feat31 sst1 summarize bp #51

Merged

BijalBPatel linked a pull request Oct 9, 2022 that will close this issue

Feat31 sst1 summarize bp #51

Merged

BijalBPatel closed this as completed in #51 Oct 12, 2022

pbeaucage reopened this Oct 12, 2022

andrewjlevin self-assigned this Sep 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: SST1RSoXSDB: flexible tool for finding run numbers #31

feat: SST1RSoXSDB: flexible tool for finding run numbers #31

pbeaucage commented Jun 5, 2022

pbeaucage commented Sep 21, 2022

pbeaucage commented Oct 12, 2022 •

edited

Loading

feat: SST1RSoXSDB: flexible tool for finding run numbers #31

feat: SST1RSoXSDB: flexible tool for finding run numbers #31

Comments

pbeaucage commented Jun 5, 2022

pbeaucage commented Sep 21, 2022

pbeaucage commented Oct 12, 2022 • edited Loading

pbeaucage commented Oct 12, 2022 •

edited

Loading