-
Notifications
You must be signed in to change notification settings - Fork 8
Documenting current consensus on possible technical solutions
John's and Miloš's diagrams serve as far points that need to be brought together before spec writing can start
The situation is really a triangle with Lemon<>Lex-0 zero being more fine grained, Miloš and Lemon being more prescriptive in allowed relationships -> better for industry interoperability
Should not be modelled as subsenses, should be its own headwords. Needs a cut between example sentences and idioms that constitute their own entries..
Should not be modelled as subsenses either, a referencing method at the headword/entry level should be developed. Possibly similar to Wikipedia disambiguation page..
DMLex could be homograph agnostic, this information is addressed automatically via the identity/identification Need a cut between homonymy and polysemy..
Link exists at Entry level, has source and target selector, also an attribute for type of relationship
Instead of free form text, values driven by an authority should be exchanged
lex:NMTOKEN
lex:
will be the prefix reserved for LEXIDMA TC DMLex
The general structure of the POS information should be:
[authority]:NMTOKEN
possible authority prefix examples:
[universal part of speech tagging]
upos:
clarin:
lemon:
Morphology object with possible value pairs
@POS="[authority]:NMTOKEN"
@formsLink="[formsURI]"
example: @relationshipType="imperative"