Closing the gap between effective biocuration and meaningful ontology development
Annual General Meeting of the International Society for Biocuration, 18th October 2023
Nicolas Matentzoglu
The role of ontologies in Biocuration
The role of ontologies in Biocuration
MP:0009906
The role of ontologies in Biocuration
MP:0009906
MP:0009906
The role of ontologies in Biocuration
increased tongue size
MP:0009906
abnormal tongue morphology
MP:0000762
What genes are associated with abnormal tongue morphology?
The role of ontologies in Biocuration
Example of phenotypic profile matching, see doi: 10.1002/cphg.92
Mutual dependence
needs to deal with changes to ontology, needs updates to terms
needs feedback and integrated data to be truly valuable
Database
Ontology
The Gap
We need 5000 new terms for protein level measurements in urine, e.g.
“glucose level in urine”.
Ehm, until when?
Asap.
The Gap
Ahhhh, until when?
In 4 weeks.
We are going to obsolete 500 classes.
Closing the gap
What can we do to make direct contributions easier?
Push ontology curation as far down the expert hierarchy as possible
12
4
Domain Experts
All Biocurators
3
Ontology Engineers
1
Ontology Developers
availability
Required familiarity with ontology engineering
2
Thank you Sue Bello for advising!
The amazingness of standardised open ontology development systems
Generate standard git repository
editors file
release files
imports
Social workflows:
CI/CD:
Executable workflows:
Design Patterns and spreadsheets
defined_class | cargo | membrane | start | end |
GO:0098713 | leucine | plasma membrane | extracellular membrane | cytosol |
Change languages and widgets in curation interfaces
1
2
3
Embed curation widgets directly in ontology browsers!
Any proposal opens an issue on the ontologies issue tracker!
The issue gets automatically translated into a pull request.
https://incatools.github.io/kgcl/
Using hackathons with domain experts as “sprints” to curate sections of the ontology
“Based on the survey results it seems that most people did not contribute because no one had asked them.” (R. Mazumder)
The Socio-technological side…
From [isb-biocuration] Examples of successful community curation models for databases? (Fri, 6 Oct
OBO Academy: Training materials for bio-ontologists
Running online seminar series and ontology trainings to increase confidence
https://bit.ly/obo-academy
Seed funding from:
ISB also has similar great resource: https://www.biocuration.org/dissemination/biocuration-training-materials/
Raising awareness for FAIR, open data and ontologies and its impact on the world
“[w]e neither have staff dedicated to this initiative only, nor any specific funding. We are all intrinsically motivated to improve the situation” (R. Giessmann)
Thanks to Sabrina Toro, Sue Bello, Nicole Vasilevsky, Zoe Pendlington, Ray Stefancsik, Chris Mungall for your help researching for this talk. Mistakes are all my own.
Thank you,
and the amazing
Open Ontology
(General Concepts)
Open FAIR data annotated using ontologies
Open community of contributors and users
Slide adapted from Chris Mungall, “Pistoia 2023.10.11 - Open Ontologies in the Biomedical Domain” - The triangle of success
Community curation: strategies for eliciting engagement
“[w]e neither have staff dedicated to this initiative only, nor any specific funding. We are all intrinsically motivated to improve the situation” (R. Giessmann)
“The DisProt team are also developing Apicuron [...] to provide community curators recognition for contributions at ORCID.” (V. Wood)
“Many people will curate if they understand the benefits, and it is made as easy as possible to do. Communication is key.” (V. Wood)
“some funding, awards, travel fellowships, authorship in publications.” (Taner Z. Sen)
“Based on the survey results it seems that most people did not contribute because no one had asked them.” (R. Mazumder)
From [isb-biocuration] Examples of successful community curation models for databases?
(Fri, 6 Oct)
Do we really need a term for “abnormally increased levels of Athenian particulate matter in the lung”? - Shared semantic schemas
abnormal
Modifier
increased amount
particulate matter
located in
Characteris.
Entity
lung
originating from
Athens
This is a stupid example, but do we really need terms for abnormally increased or decreased levels for the entirety of ChEBI? Or UniProt?