12-14 November 2024, Costa Rica and Online
RDA 23rd Plenary Meeting (RDA P23) | Sustainable Science
Vocabulary Services IG �(aka VSSIG, Vocabulary and Semantic Services IG): �All About Terms and Ontologies (You Can Help!)
Collaborative Notes: https://docs.google.com/document/d/1MhGaGDISwufM_vMEAY9XJV2GFRqs54XA �Vocabulary and Semantic Services IG: https://www.rd-alliance.org/groups/vocabulary-services-interest-group
RDA 23rd Plenary Meeting
www.rd-alliance.org
Agenda
RDA 23rd Plenary Meeting
www.rd-alliance.org
Housekeeping for all attendees
Please complete the table Attendee Check-in to indicate your attendance in the Collaborative Notes: here
Reminder: The session is being recorded
There will be time for questions after each talk, so please:
Participants in the room ask questions using the microphone
Virtual participants ask questions on the chat starting with @nameOfPresenter and following with your question
RDA 23rd Plenary Meeting
www.rd-alliance.org
VSSIG group: Who we are
by Alexandra Kokkinaki and the other VSSIG co-chairs:
Yann Le Franc, John Graybeal, Juliane Schneider
RDA 23rd Plenary Meeting
www.rd-alliance.org
Semantic artefacts and FAIR guiding principles
I2: (Meta)data use vocabularies that follow the FAIR principles
task forces, groups and projects focusing on
#semantic interoperability #semantics#ontologies #vocabularies #semantic artefacts
RDA 23rd Plenary Meeting
www.rd-alliance.org
The VSSIG past & present
RDA 23rd Plenary Meeting
www.rd-alliance.org
Vision
Our vision is to harmonize our ways of working/current practices with Semantic Web technologies through Knowledge and Expertise exchange across all domains.
Join the group: https://www.rd-alliance.org/groups/vocabulary-services-interest-group/work-statement/
RDA 23rd Plenary Meeting
www.rd-alliance.org
Groups
Working group
Task Groups
BoFs
RDA 23rd Plenary Meeting
www.rd-alliance.org
Collaboration
Focus on collaboration
where semantics are involved
RDA 23rd Plenary Meeting
www.rd-alliance.org
I-ADOPT: Latest achievements and Activities�
Gwenaëlle Moncoiffé
12-14 November 2024, Costa Rica and Online
RDA 23rd Plenary Meeting (RDA P23) | Sustainable Science
Motivation:
Led by a core group made of 4 co-chairs (Barbara Magagna, Gwenaëlle Moncoiffé, Anusuriya Devaraju and Maria Stoica) plus Sirko Schindler and Alison Pamment, with regular contributions from many others.
“InteroperAble Descriptions of Observable Property Terminologies”
The RDA I-ADOPT Working Group�Active work: 2019-2021 - Status: Maintaining Deliverables
RDA I-ADOPT recommendations�DOI: 10.15497/RDA00071�
The I-ADOPT Ontology �https://w3id.org/iadopt
Highlights of latest activities�
When? Challenge 1 (16-29 Sept 2024), Challenge 2 (14-18 Oct 2024)
How many participated? 24 people/16 groups/different domains, but all data scientists
What was the purpose?�Reveal similarities and differences in the way different people use the I-ADOPT Framework to describe the same set of variables (30 variables in total)
What are the outcomes?
The I-ADOPT Variable Modeling Challenge�https://i-adopt.github.io/challenge.html�
First Challenge:
provided variables: 30
counted variables: 10 best
winner principle: highest scores
challenge period: two weeks
analysis: the more people model the better is their result
Second Challenge:
provided variables: 10 (highest variance)
counted variables: 5 of high quality > 12pts
winner principle: all with 5 quality variables
challenge period: 5 days
Add. material: DO’s and DON’Ts
analysis: Modelling tips help to make less errors, but the result is better the more is modelled
First versus Second Challenge
The I-ADOPT Variable Modeling Questionnaire�Answers from the participants�
“some terminologies don’t resolve”
“providing meaningful labels”
“several options to model”
Why is I-ADOPT useful?
I-ADOPT Core team:
- Barbara Magagna (GO FAIR Foundation, NL)
- Gwenaëlle Moncoiffé (National Oceanography Centre - British Oceanographic Data Centre, UK)
- Anusuriya Devaraju (CSIRO, AU)
- Maria Stoica (University of Colorado, Boulder, US)
- Sirko Schindler (German Aerospace Center, DE)
- Alison Pamment (Centre for Environmental Data Analysis/Sience and Technology Facilities Council, UK)
External reviewers:
- Markus Stocker (TIB - Leibniz Information Centre for Science and Technology, GER)
- John Graybeal (Information Technology Consultant, US)
- Rob Atkinson (Open Geospatial Consortium, AU)
The I-ADOPT Challenge: Reviewer Team�
Challenge material:
- An introduction video to the I-ADOPT Framework
- A step-by-step guide for creating an I-ADOPT compliant variable description (video)
- Instructions for submitting modeled variables including participation and scoring rules (slides)
- The list of variables to be selected
- A questionnaire for background information about participants
Submission formats based on templates:
- human readable: Excel, text�- machine-readable: nanopublication, turtle
The I-ADOPT Challenge: Material prepared
Evaluation sheet includes:
I-ADOPT Challenge variables visualzed in the I-ADOPT Variable Examples repository
GitHub issues for discussing variable models
The I-ADOPT Challenge: Results
First Challenge:
1st prize - €400 plus 2 virtual attendance fees for RDA 23: �Anne Fouilloux and Jean Iaquinta (212.75 points)
2nd prize - €300 plus 1 virtual attendance fee for RDA 23: �Andrea Tarallo, Martina Pulieri, Christina Di Muri, and Jessica Titocci (208.5 points)
3rd prize - €200: �Jörg Klausen, Morgan Silverman and Gao Chen (170.6 points)
The RDA TIGER Programme sponsored the attendance fees for the winners.
Qualified for the Second Challenge getting each €100:
Margherita Martorana, Samantha Blakeman, Petra ten Hoopen, Naouel Karam, �Juliana Menger, Saurav Kumar, and Norbert van Dijk
The I-ADOPT Challenge: Winners!
Metadata Task Group (Clement Jonquet)
Metadata for Ontology Description and Publication Ontology
Main focus of this task group was/is to develop MOD, as standard vocabulary to describe semantic artefacts.
⇒ https://github.com/FAIR-IMPACT/MOD
Work done since 2022 in the context of the EOSC project FAIR-IMPACT
RDA 23rd Plenary Meeting
www.rd-alliance.org
MOD
RDA 23rd Plenary Meeting
www.rd-alliance.org
MOD example
RDA 23rd Plenary Meeting
www.rd-alliance.org
Now MOD v3.2
RDA 23rd Plenary Meeting
www.rd-alliance.org
MOD-API - specification of an API for semantic artefact catalogues
RDA 23rd Plenary Meeting
www.rd-alliance.org
A Simple Standard for Ontological Mappings 2024:
A quick guide for getting started with publishing better entity mappings @ RDA’s 23rd plenary
Contributors: Alasdair Gray, Alex Wagner, Amelia L. Hoyt, Andrew Williams, Anita Caron, Anne Thessen, Benjamin M. Gyori, Bill Baumgartner, Cassia Trojahn, Charlie Hoyt, Chris Mungall, Chris T. Evelo, Christopher Chute, Clement Jonquet, Damien Goutte-Gattat, Damion Dooley, Davera Gabriel, David Osumi-Sutherland, Emily Hartley, Ernesto Jimenez-Ruiz, Harold Solbrig, Harry Caufield, Harshad Hegde, Henriette Harmse, HyeongSik Kim, Ian Braun, Ian Harrow, James Malone, James McLaughlin, James Overton, James P. Balhoff, James Stevenson, Javier Millán Acosta, Jiao Dahzi, Joe Flack, John Graybeal, Jooho Lee, Julie McMurry, Kori Kuzma, Kristin Kostka, Lauren Chan, Melissa Haendel, Melissa Haendel, Monica Munoz-Torres, Nicolas Matentzoglu, Nicole Vasilevsky, Nomi Harris, Núria Queralt-Rosinach, Sabrina Toro, Sebastian Koehler, Shahim Essaid, Sierra Moxon, Simon Jupp, Sophie Aubin, Sue Bello, Sujay Patil, Sven Hertling, Thomas Liener, Tiffany Callahan, Tim Putman, Vinicius de Souza, William Duncan
https://w3id.org/sssom
What are entity mappings?
28
“Friedreich's Ataxia”
OMOP:441554
Entities are symbols, such as codes in a terminology, classes in an ontology, permissible values in a data model, identifiers in a database or simply strings in a text field that are intended to refer to a real world thing.
The anatomy of a SSSOM-style semantic entity mapping
are insufficient
29
SUBJECT
PREDICATE
OBJECT
�subject_id:
EFO:10000070
�object_id:
MONDO:0006071�
�object_label:
adenofibroma�
�subject_label:
Adenofibroma�
�predicate_id:
skos:exactMatch
JUSTIFICATION
mapping_justification: semapv:LexicalMatching
subject_match_field: rdfs:label�object_match_field: oio:hasExactSynonym
match_string: adenofibroma
mapping_date: 2022-12-13
reviewer_id: orcid:0000-0002-7356-1779
mapping_tool: wikidata:Q64360017
confidence: 0.8
JUSTIFICATION
mapping_justification: semapv:ManualMappingCuration
author_id: orcid:0000-0002-7356-1779
confidence: 0.8
SSSOM aims to
Provide a simple model to capture mapping evidence and provenance
30
30
mapping_set_id: https://w3id.org/sssom/commons/mouse-human/mappings/mp_hp_mgi_all.sssom.tsv
mapping_set_title: All mappings of MP terms to HPO terms generated by MGI
mapping_set_description: "Consolidated list of all HPO to MP mappings done by MGI…."
creator_id:
- orcid:0000-0003-4606-0597
license: https://creativecommons.org/licenses/by/4.0/
object_source: obo:hp
subject_source: obo:mp
curie_map:
HP: http://purl.obolibrary.org/obo/HP_
MP: http://purl.obolibrary.org/obo/MP_
orcid: https://orcid.org/
obo: http://purl.obolibrary.org/obo/
Mapping Table
What entity mappings are covered?
31
MONDO:0006071
Type 1: lexical token - identifier
Type 2: identifier - identifier
Type 3: complex
EFO:1000070
MONDO:0006071
adenofibroma
Hypertensive heart disease without congestive heart failure
modifies
Not
Congestive heart failure
AND
Hypertensive heart disease
Experimental
SSSOM is not for data structure mappings / schema crosswalks
New kid on the blog: LinkML-Map for �FAIR schema crosswalks!�https://linkml.io/linkml-map/
PERSON:001 ;
rdf:type my:Person ;
my:Name “Chris Mungall” .
PERSON:001 ;
rdf:type schema:Person ;
schema:givenName “...?...” ;
schema:familyName “...?...” .
But of course we can use SSSOM metadata to annotate crosswalks!
subject_id | predicate_id | object_id |
my:Name | skos:narrowMatch | schema:givenName |
my:Name | skos:narrowMatch | schema:familyName |
???
We will integrate this with RDA FAIR Mappings effort!
The FAIR mappings community aims to�Promote the creation of interoperable FAIR mapping registries
33
m1.sssom.tsv
m2.sssom.tsv
m3.sssom.tsv
b.sssom.tsv
Registry
Shared QC, � automatic reconciliation
Wrong mapping!
Collaborative curation
SSSOM 1.0: The data model is now stable
With support from 79 community members contributing 1230 comments, 154 pull requests and 13 releases over the past 4 years, SSSOM 1.0 was finally released in August 2024.
34
2 functional, independent and interoperable implementations:
Database (Oxford), Volume 2022, baac035, �https://doi.org/10.1093/database/baac035
Acknowledgements and call to action
35
Funding
Phenomics First (NIH / NHGRI #1RM1HG010860-01): Spec, sssom-py CLI��Monarch (NIH / OD #5R24OD011883): outreach, knowledge graph integration
Bosch Gift to LBNL: sssom-py IO, testing, converters, tutorials
DARPA: Young Faculty Award W911NF2010255�(PI: Benjamin M. Gyori)
�
�Community contributions: https://w3id.org/sssom
Learn about SSSOM!
1
Spread the word
4
Set up a SSSOM Mapping Registry for your community
3
Publish your entity mappings in SSSOM format and share with RDA FAIR Mappings WG
2
FAIR Mapping WG
Yann Le Franc, PhD
CEO e-Science Data Factory/ Head of EUDAT Secretariat
RDA 23rd Plenary Meeting
www.rd-alliance.org
Where did we start?
Two workshops about mappings and crosswalks during the RDA P21/International Data Week conference
RDA 23rd Plenary Meeting
www.rd-alliance.org
Where did we start?
Two workshops about mappings and crosswalks during the RDA P21/International Data Week conference
LOADS OF COMMUNITY FEEDBACKS
RDA 23rd Plenary Meeting
www.rd-alliance.org
Where are we now?
RDA 23rd Plenary Meeting
www.rd-alliance.org
Which problems do we address ?
RDA 23rd Plenary Meeting
www.rd-alliance.org
What are our planned outputs?
RDA 23rd Plenary Meeting
www.rd-alliance.org
Working Session: Identifying and Prioritizing Evaluation Methods for Choosing Terms and Ontologies (John Graybeal & OntoChoice Team)
RDA 23rd Plenary Meeting
www.rd-alliance.org
About the OntoChoice Project: �Evaluations for Choosing Terms and Ontologies
Project Goals
Work to Date
RDA 23rd Plenary Meeting
www.rd-alliance.org
Introducing Evaluations Team and Facilitators
RDA 23rd Plenary Meeting
www.rd-alliance.org
Planned Activity: What’s Happening Here?
RDA 23rd Plenary Meeting
www.rd-alliance.org
Overview of Working Document
🟢 0A. Introduction
🟢 0B. Categories of Evaluation Criteria (Relevance, Popularity/Reuse, Best Practices, Governance/Intl)
🔶⭕ 1. Category Descriptions and Detailed Criteria: Relevance Group (John)
🟢🔶 2. Category Descriptions and Detailed Criteria: Popularity and Reuse Group (Anna Maria)
⭕ 3. Category Descriptions and Detailed Criteria: Best Practices and Analytics Grp (Hande)
🔶🟢 4. Category Descriptions and Detailed Criteria: Governance & Internationalization (Asiyah)
🔶 5. Existing Evaluation Systems, Technologies, and Models
⭕ 6. Recommended Evaluation Facets (from the list in captures 1-4)
🔶 7. Use Cases to Discuss
⭕ 8. Evaluation Guidance
🔶 10. Resources and Bibliography
🟢 : first-draft complete. (Additions/improvements still welcome.) 🔰 : useful text and guidance but needs work. 🔶 : outline topics/example only. ⭕ : little useful content.
Our�Focus�Today
RDA 23rd Plenary Meeting
www.rd-alliance.org
The Categories of Evaluation Criteria
RDA 23rd Plenary Meeting
www.rd-alliance.org
Breakout Plan
RDA 23rd Plenary Meeting
www.rd-alliance.org
Today’s Goals
✅ Expose participants to the project and its progress
Create content and feedback about this document!
RDA 23rd Plenary Meeting
www.rd-alliance.org
What You Will Do (in 25 minutes!)
RDA 23rd Plenary Meeting
www.rd-alliance.org
BREAKOUT SESSIONS AND LINKS
Starred (*) breakouts are facilitated. Two-star (**) breakouts may be facilitated.
These links go to the breakout copy of that chapter. And choose your Zoom breakout session with the same title. (See this slide at link bit.ly/ontochoice.)
RDA 23rd Plenary Meeting
www.rd-alliance.org
RDA 23rd Plenary Meeting
www.rd-alliance.org
Return from Breakout: How was that for you?
RDA 23rd Plenary Meeting
www.rd-alliance.org
THANK� YOU
RDA 23rd Plenary Meeting
www.rd-alliance.org