Skip to content

[DATASET ROADMAP] convert public bases #219

@bstaber

Description

@bstaber

Let's try to make a curated list of datasets that we could convert to PLAID. Here's a first draft:

Add @fabiencasenave list as checklist

I think we should converge on the following points before mass conversion:

MANDATORY:

OPTIONAL

  • Clarify the status of the time in feature_identifiers. time=all means all time steps are considered by the feature ? Or no mention for trying to access all existing time steps for this feature ? What about sample.get_all_features_identifiers() ?
  • Enforce the use of feature_identifiers in problem_definitions (in the yaml as well) and replace node feature by 'mesh' (the complete support, with nodes, elements and tags, at a particular base and zone). Proposal: flattened keys (already implemented in the Huggingface bridge) for feature ids ?
  • [ARCHITECTURE UPDATE] update data organisation #241 should also be addressed for the plaid part - with the mandatory part above, it will be implemented totally for the HF format

Important note

We just tneed to converge on a representation on HuggingFace to start converting other bases.

Metadata

Metadata

Assignees

No one assigned

    Labels

    help wantedExtra attention is needed

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions