Skip to content

[DATASET ROADMAP] convert public bases #219

@bstaber

Description

@bstaber

Let's try to make a curated list of datasets that we could convert to PLAID. Here's a first draft:

Add @fabiencasenave list as checklist

I think we should converge on the following points before mass conversion:

MANDATORY:

OPTIONAL

  • Clarify the status of the time in feature_identifiers. time=all means all time steps are considered by the feature ? Or no mention for trying to access all existing time steps for this feature ? What about sample.get_all_features_identifiers() ?
  • Enforce the use of feature_identifiers in problem_definitions (in the yaml as well) and replace node feature by 'mesh' (the complete support, with nodes, elements and tags, at a particular base and zone). Proposal: flattened keys (already implemented in the Huggingface bridge) for feature ids ?
  • [ARCHITECTURE UPDATE] update data organisation #241 should also be addressed for the plaid part - with the mandatory part above, it will be implemented totally for the HF format

Important note

We just tneed to converge on a representation on HuggingFace to start converting other bases.

Metadata

Metadata

Assignees

No one assigned

    Labels

    help wantedExtra attention is needed

    Type

    No type

    Projects

    No projects

    Milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions