Not duplicating meta data when we upload a derived dataset to SciCat. #27
Replies: 2 comments 4 replies
-
There is a subset of metadata that we can and should copy to the derived dataset; the sample in particular. |
Beta Was this translation helpful? Give feedback.
-
I generally agree. But the frontend is unfortunately a long way off supporting this properly. Also think about the metadata in the file: We will still write the sample information into it (where possible). So, to make it findable, that metadata should also be in the SciCat dataset. I agree that we cannot blindly copy everything. All I am saying is that we should duplicate some metadata. And that we have to make informed decisions about it when writing workflows. |
Beta Was this translation helpful? Give feedback.
-
Context
I was in a meeting with SWAT team about imaging workflows.
Data analysis software needs to search the derived dataset(reduced data) on SciCat based on the meta data of input datasets.
It's because data-analysis tools will need to use some physical characteristics or description about the sample, and those information will be in the metadata.
And I just talked to Max in SIMS team, and he said they can provide a simple query that can search
derived
dataset based oninput-dataset-metadata
.Discussion Point
One of the suggested solutions is to just copy+paste
input-dataset-metadata
intoderived
dataset so that it's self-containing.But we concluded that it should be avoided to copy the metadata, since metadata can be updated relatively easily, and then there is a risk that the metadata between
input dataset
and thederived dataset
don't match.I thought it's worth documenting this here since it might be relevant for all instruments.
Beta Was this translation helpful? Give feedback.
All reactions