Using ontologies to standardize
rare disease data collection
Nicole Vasilevsky
University of Colorado Anschutz Medical Campus
June 15, 2022
Overview of biomedical ontologies
How ontologies can be used for standardizing and integrating data and downstream analyses
Why you should contribute to ontology development efforts
How to contribute to ontologies
Overview
tislab.org/ontologycontributor
Documentation:
Overview of biomedical ontologies
Prevailing clinical diagnostic pipelines leverage only a tiny fraction of the available data
Ontologies are systematic representations of knowledge that can be used to integrate and analyze large amounts of heterogeneous data
DOI: 10.1056/NEJMra1615014
Image credit: https://kyndi.com/blog/creating-knowledge-and-maximizing-the-value-of-data-with-ontologies/
Defining entities and the relationships between them in a way that allows computational logical reasoning
What
How
What is an ontology?
A community of ontologists committed to a shared set of principles to build open biomedical ontologies.
http://obofoundry.org/
https://www.ebi.ac.uk/ols/index
Which is the right ontology to use?
Image credit: https://solutionsreview.com/business-process-management/why-is-process-improvement-so-important/
How ontologies can be used for standardizing and integrating data and downstream analyses
11
What is the Monarch Knowledge Graph?
doi.org/10.1093/nar/gkz997
Human Phenotype Ontology (HPO)
Over 15,000 phenotype terms
13
Ontologies are used in disease diagnostics
doi.org/10.1093/nar/gkz997
Legend
Perfect Match
Fuzzy Match
No Match
Differential diagnosis with similar but non-matching phenotypes is difficult
Mondo Disease Ontology
https://www.medrxiv.org/content/10.1101/2022.04.13.22273750v3
Aligning disease knowledge across sources
Mondo aggregates synonyms and provides semantic mappings to source ontologies
How many rare diseases are there?
Overlap and unique rare disease concepts in 5 selected knowledge sources
Only 333 shared disease concepts in all five sources
Many diseases are in only one source
Intersection size
Why you should contribute to ontology development efforts?
Image credit: Nomi Harris, Monarch Initiative
Ontology development is a community effort
Ontologies are continuously iterated upon and improved
tislab.org/ontologycontributor
How to contribute to ontologies
How to contribute to ontologies
Sign up for free at: www.github.com
Ontology issue trackers
24
1
2
3
4
A tracker is a place to put a formal ontology request
Trackers have long been used in the software community for keeping track of bugs, feature requests, etc
Advantages:
Tracker IDs can be referenced in ontology metadata, such as in an editor note or definition annotation
How to request new terms (Mondo IDs) & changes
Synonym types
https://oboacademy.github.io/obook/reference/synonyms-obo/
Exact
Related
Narrow
Broad
An exact match
A word or phrase that has been used synonymously with the primary term name in the literature, but the usage is not strictly correct
A more specific term
A more general term
Excluded
Deprecated
Some synonyms are annotated with EXCLUDE, e.g. “NOS” (not otherwise specified) synonyms. It is useful to have these in the edit version, but these are filtered on release.
We may also mark synonyms with DEPRECATED. E.g. all occurrences of “mental retardation” should be “intellectual disability”
Scope
Type
E.g. hereditary Wilms' tumor
exact synonym: familial Wilms’ tumor
E.g. asthma
narrow synonym: exercise-induced asthma
E.g. autoimmune hepatitis
broad synonym, autoimmune liver disease
E.g. AGAT deficiency
related synonym: disorder of glycine amidinotransferase activity
Requesting changes to an OBO ontology
Request new term (or changes) on GitHub
Curators adds term to ontology, creates a Pull Request (PR)
Pull requests undergo review and are merged; changes are added to ontology-edit.obo file
Term is available in next release (varies between ontologies)
OLS is updated approx 7 days after release
✔
Community and expert advice
YOU!
Application of ontologies in RDCA-DAP
Use of ontologies in the RDCA-DAP
Disparate data types
Data standardization using ontologies
Standardized data is accessible in a cloud interface
Actionable rare disease drug development solutions
RDCA-DAP Workflow
Summary
Overview of biomedical ontologies
Structured knowledge covering a specific domain
Ontologies can be used to standardize and integrate data
E.g. phenotype, disease, genotype data for rare diseases
Why you should contribute to ontologies
Community resource, need expertise from various areas
How you should contribute to ontologies
GitHub - open a free account
Chris Mungall
Lawrence Berkeley National Lab
Acknowledgements
Melissa Haendel
University of Colorado
David Osumi-Sutherland�European Bioinformatics Institute
Nico Matentzoglu
Semanticly
This content is available at: https://oboacademy.github.io/obook/
Anne Thessen
University of Colorado
Thanks!
You can find me at:
nicole@tislab.org
@n_vasilevsky
Documentation:
https://oboacademy.github.io/obook/pathways/ontolgoy-contributor-c-path/
10% of the US population has a rare disease
80% of rare disease cases are genetic
Phenotype to disease annotations can be used for rare disease diagnoses
Disease to phenotype annotations
155,624 rare disease - phenotype annotations
136,268 common disease -phenotype annotations
Phenotype-driven Exome Analysis
Validated for the most difficult GEL diagnoses; top candidate correct in 67% of cases and executes in under 1 minute.
Exomiser
V 10.0
March 2018
bit.ly/exomiser-10
Recommendations for GitHub tickets/new term requests
General Recommendations:
We appreciate your contributions to extending and improving our ontologies
Formatting:
Writing Ontology Definitions
https://philpapers.org/archive/SEPGFW.pdf
Term search �and request workflow
Adapted from:
https://douroucouli.wordpress.com/2021/07/03/how-select-and-request-terms-from-ontologies/
Search using search strings
Return term list
Assume no results…
Examine list
Confirm, these are analogous to what you need
Check the parent ontology
Examine tickets
This is the term you are looking for
Check definitions and parent ontologies
Reads ticket
Make new term locally
Search for similar terms
Sibling terms list
Search for relevant discussion
Return tickets
Post term request
You
Portal
GitHub
Curator
You
Portal
GitHub
Curator
New term ID
Check
Term ID
Acknowledge
Make new ontology release