Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Multiple synonyms with only capitalization differences #8495

Open
joeflack4 opened this issue Dec 16, 2024 · 1 comment
Open

Multiple synonyms with only capitalization differences #8495

joeflack4 opened this issue Dec 16, 2024 · 1 comment
Assignees
Labels

Comments

@joeflack4
Copy link
Collaborator

joeflack4 commented Dec 16, 2024

Overview

There are cases where a Mondo term has 2+ synonyms that differ only in capitalization. This shouldn't happen. We need to merge these into a single synonym declaration.

Examples

  1. Capitalized word(s). No acronyms.
id: MONDO:0005764
synonym: "follicular Dendritic cell sarcoma" EXACT [DOID:6262, NCIT:C9281]
synonym: "follicular dendritic cell sarcoma" EXACT [MONDO:patterns/location]
  1. Abbreviation
id: MONDO:0010916
name: polycystic kidney disease 3 with or without polycystic liver disease
synonym: "PKD3" EXACT ABBREVIATION [MONDO:Lexical, OMIM:600666]
synonym: "Pkd3" EXACT [DOID:0110860]
  1. Acronym within multi-word synonym
id: MONDO:0011225
name: severe combined immunodeficiency due to DCLRE1C deficiency
synonym: "SCID due to ARTEMIS deficiency" EXACT [Orphanet:275]
synonym: "SCID due to artemis deficiency" EXACT [DOID:0090012]

Additional info

Context in which this has come up:

@matentzn
Copy link
Member

Possible QC query:

PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#>
PREFIX rdfs: <http://www.w3.org/2000/01/rdf-schema#>
PREFIX oboInOwl: <http://www.geneontology.org/formats/oboInOwl#>
PREFIX MONDO: <http://purl.obolibrary.org/obo/MONDO_>

SELECT DISTINCT ?mondo_id ?label ?synonym1 ?synonym2 WHERE {
  VALUES ?synonym_type { oboInOwl:hasExactSynonym oboInOwl:hasRelatedSynonym oboInOwl:hasBroadSynonym oboInOwl:hasNarrowSynonym }
  
  ?mondo_id rdfs:subClassOf <http://purl.obolibrary.org/obo/MONDO_0000001> ;
            rdfs:label ?label ;
            ?synonym_type ?synonym1;
            ?synonym_type ?synonym2 .
  
  FILTER (lcase(str(?synonym1)) = lcase(str(?synonym2)) && ?synonym1 != ?synonym2)
}

Over 2000 hits: https://api.triplydb.com/s/qXVEsJ27J

@joeflack4 joeflack4 assigned twhetzel and unassigned matentzn Dec 16, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

3 participants