-
Notifications
You must be signed in to change notification settings - Fork 4
[DATASET ROADMAP] convert public bases #219
Copy link
Copy link
Open
Labels
help wantedExtra attention is neededExtra attention is needed
Milestone
Description
Let's try to make a curated list of datasets that we could convert to PLAID. Here's a first draft:
- Vortex shedding (MGN paper)
- Airfoil dataset (MGN paper)
- Deforming plate (MGN paper)
- Flag dataset (MGN paper)
- DrivAerNet [Elrefaie et al., 2024]
- Ahmed body
- Datasets from the GEO-FNO paper available here: elasticity, plasticity, airfoil naca, pipeflow.
- Datasets used in OTNO paper
- The FlowBench 3D Lid-Driven Cavity dataset hosted on HuggingFace
- Datasets used in this benchmark paper
- PDEArena
- BubbleML
- BLASTNet
- PDEBench
- The Well
- PINNacle
- May be some interesting cases here
- CFDBench
- Dataset from Fourier Neural Operator with Learned Deformations for PDEs on General Geometries
Add @fabiencasenave list as checklist
I think we should converge on the following points before mass conversion:
MANDATORY:
- globals in CGNS trees to replace scalars and time series -> PR ♻️ introduce global in CGNS trees to replace scalars and time_series mechanism #229
- Assess the links/paths mechanism -> PR ♻️feat(links/paths) remove links/paths and CGNS linking capacities #239
- Use of Hugging Face's native split -> PR ♻️ Modernize the Hugging Face bridge #240 , still time variable samples to implement.
- Problem_definition: indicate in and out split concerned by the regression task and name the score function for the moment (maybe later we should find a way to define an implementation), see next comment.
OPTIONAL
- Clarify the status of the time in
feature_identifiers.time=allmeans all time steps are considered by the feature ? Or no mention for trying to access all existing time steps for this feature ? What aboutsample.get_all_features_identifiers()? - Enforce the use of
feature_identifiersin problem_definitions (in the yaml as well) and replacenodefeature by 'mesh' (the complete support, with nodes, elements and tags, at a particular base and zone). Proposal: flattened keys (already implemented in the Huggingface bridge) for feature ids ? - [ARCHITECTURE UPDATE] update data organisation #241 should also be addressed for the plaid part - with the mandatory part above, it will be implemented totally for the HF format
Important note
We just tneed to converge on a representation on HuggingFace to start converting other bases.
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
help wantedExtra attention is neededExtra attention is needed