You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Since many properties could identify input, output, and output_annotations, I think it is best to use hashes to identify inputs, outputs, and list_of_example_annotations.
I image that each data entry will have a hash
input
- input_idx # determining dataset, split and particular example, how to example was preprocess/rendered by factgenie etc...
output
- input_idx # reference to the exact input which was used for generation
- output_idx # uniquely identifying the output
annotations_list
- output_idx # uniquely identifying which output was annotated
- annotations_idx # uniquely identifyiing the annotation list
The text was updated successfully, but these errors were encountered:
Our dataset management can be illustrated based on the dependencies how the entries are generated.
Since many properties could identify input, output, and
output_annotations
, I think it is best to use hashes to identify inputs, outputs, and list_of_example_annotations.I image that each data entry will have a hash
The text was updated successfully, but these errors were encountered: