-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
/similarity/frequency incorrectly returns zero for "owl:Thing" and corpus taxa #380
Comments
This currently leads to a test failure in Rphenoscape (phenoscape/rphenoscape#153). |
@balhoff what's a reasonable ETA for getting this fixed? |
The special semantics of |
Yes I think we can safely filter it out of any semantic similarity-related results, including from the subsumer matrix. It is far from the only upper level term that are commonly in the union for subgraph similarity. (However, this means you should not include it in your own Jaccard calculation, either, or otherwise our scores will start to diverge.) If someones does pass it into |
Should I post this a separate (generic) issue? |
That sounds good. I will plan to fix at least this one this week, and see what else needs changing. |
We are filtering out owl:Thing from return values: see #383. That should prevent this from being an issue in the future. |
"http://www.w3.org/2002/07/owl#Thing"
is one the subsumers returned in the subsumer matrix, however, when used as input for/similarity/frequency
, the count returned is zero rather than the corpus size.This needs to be fixed on the server end. On the client side I shouldn't guess whether zero is somehow the magic number meaning the size of the corpus, or that the occurrence count really is zero.
The text was updated successfully, but these errors were encountered: