You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
In the implementation of checkSpatialTemporal (once #71 is merged), we determine a column's spatial/temporal-ness (semantic type) based on the first ProfilerConfig.NUM_RECORD_READ data. If all these data are null, the column's semantic type will be considered NONE, even though there may exist non-null values later on in the table.
We could either drop all non-null values before passing data into the PreAnalyzer or modify the estimateSemanticType function to retry the determination of a column's semantic type if all read values were null.
The text was updated successfully, but these errors were encountered:
Is this issue solved already? I tried running it on a csv file by changing NUM_RECORD_READ to 1 (all values in the first row of the csv file are null), but there is no issue for labeling the column. Perhaps you have a way to reproduce the issue?
In the implementation of
checkSpatialTemporal
(once #71 is merged), we determine a column's spatial/temporal-ness (semantic type) based on the firstProfilerConfig.NUM_RECORD_READ
data. If all these data are null, the column's semantic type will be consideredNONE
, even though there may exist non-null values later on in the table.We could either drop all non-null values before passing data into the PreAnalyzer or modify the
estimateSemanticType
function to retry the determination of a column's semantic type if all read values were null.The text was updated successfully, but these errors were encountered: