You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
, and I think just searching there are a number of cases in OAK where these are occur, seems dangerous to me. @joeflack4 just uncovered a case where we passed in a oboInOwl prefix to semsql, which resulted in lexmatch no longer being able to understand that oboInOwl:hasExactSynonym (which was used in the ontology) is, in fact, the same as oio:hasExactSynonym. There are various ways to solve this problem:
No code exists where curies are defined that cant be overwritten by the user. In the ex
A standardised "oak context" exists to which all incoming information is standardised before processing, or, the other way around, OAK entities in the code are standardized (using curies.Converter.standardize()) against an incoming prefix map.
We could require that incoming semsql ontologies must be standardised against the OAK context (prefix map).
None of this is particularly easy - (3) is probably easiest, but we would have to give some tool support, like
runoak normalise-prefixes -i ont.db.
The text was updated successfully, but these errors were encountered:
Currently, you can pass a prefix map (currently only non-EPM bimap supported; prefixes.csv) when creating a SemSQL DB. Are we saying that this prefix map can have additional entries not already in the OAK context so long as there is no conflict (i.e. a URI prefix which is assigned in the to a different prefix than OAK has assigned)?
Couldn't we just interpret such conflicts as prefix synonyms and maybe throw a warning to the user?
prefixes.csv is actually a "one way epm in disguise" the same prefix can be mapped to multiple URL prefixes. See my comments in #699 for what I think the best solution would be. The key issue here is not the EPM - it is that prefix assumptions are hardcoded in the code. All entities in the code should be cycled through a standard epm before being used (say, "curies.Converter.standardise("oio:hasDbXref")" or something similar. Ideally, --epm can always be passed in to all oak commands to replace the default epm, which re-serialises the built-in curies prior to usage.
Having pieces of code like
ontology-access-kit/src/oaklib/utilities/lexical/lexical_indexer.py
Line 52 in d139e99
oboInOwl:hasExactSynonym
(which was used in the ontology) is, in fact, the same asoio:hasExactSynonym
. There are various ways to solve this problem:None of this is particularly easy - (3) is probably easiest, but we would have to give some tool support, like
runoak normalise-prefixes -i ont.db
.The text was updated successfully, but these errors were encountered: