ADC-IARPC Vocabularies and Semantics Working Group
Minutes
- Participants
- Rebekah Ingram, Paul Thompson, Chantelle Verhey, Bill Manley, Øystein Godøy, Pier Luigi Buttigieg, Mark Schildhauer
- Adoption of the agenda
- Adopted
- Review of the minutes of the previous meeting
- 2021-06-15 (The meeting in August was cancelled)
- Accepted, to be published.
- Review of open action items
- Action: Pier Luigi will put the open letter on Zenodo once signed.
- Pending on finalisation of the letter.
- Action: SiriJodha is missing from signatories, Øystein to poke him.
- SiriJodha has signed.
- Action: Øystein will close the open letter for new signatures 2021-06-30.
- Øystein closing this now.
- Action: Chantelle and Bill will give an update on the merged documents with semantic resources in the next meeting.
- Scheduled for presentation in this meeting.
- Action: Peter to demonstrate the online site containing the matrix of metadata harvesting relationships in the next meeting.
- Scheduled for presentation in this meeting.
- Peter not present, but Rebekah reported.
- Update from the Polar Data Forum IV (Rebekah/Chantelle)
- Rebekah updated that the letter was discussed in the hackathon. It has been announced that it will be published on Zenodo.
- The PDF4 semantics agenda
- https://docs.google.com/document/d/1YEUib7k44HdK5xa-KUzepjWbz0ThBK20vw0o4J-GujQ/edit
- A lot of discussions on not reinventing the wheel, i.e. utilising existing activities and processes.
- Bill found the hackathon and generation of the overview of semantic resources useful. People need guidance on how to approach this issue.
- During the hackathon there was a gap analysis initiated.
- Brief status on distribution of the open letter with signatures
- Spreadsheet for tracking distribution
- Usage pending publication of the letter.
- Øystein to close the open letter and leave the document to Pier Luigi for publishing in Zenodo.
- This allows for future updates using meta DOI.
- Details of the metadata to attach will be discussed using Slack and email.
- Pier Luigi will initiate a discussion using Slack on which metadata to use.
- Decided to complete the process around the letter using Slack and email. No need to wait until next meeting. Øystein to follow up and push to Pier Luigi.
- Status on merging documents with semantic resources (Chantelle and Bill)
- Chantelle has merged resources from Bill into her document.
- This was further discussed during the PDF IV hackathon and some updates were done there.
- The merged document is available at https://docs.google.com/spreadsheets/d/1mTvCH2995l73uqJa2Lt3KaIqsGxdy8CdraF7wPQc1D4/edit#gid=753078636
- Chantelle has continued to update the document following the hackathon.
- The document contains several tabs representing different updates (including the original document) and includes linkages to I-ADOPT activities.
- Ontologies that are not actively maintained should be identified/tagged in the document.
- Pier Luigi suggested adding the list to the Semantic Resources for Earth and Environment (SeREEn) branch to get ESIP involved. This could be a polar branch of the SeREEn activity.
- It's just an idea now (with a repo) that will emulate the OBO registry, but for Earth science
- it's in YAML, so easy to edit
- https://github.com/ESIPFed/SeREEn
- Marking expressivity level could be a good way to enrich the list Chantelle has created. Maintenance using Git will make downstream use easier.
- Mark commented
- Many of the listed “vocabularies” are not necessarily specialized for polar-specific terms, and many aspects of polar will be represented in more general vocabs. E.g. I don’t see schema.org on our list, yet that is a vocab that will be useful for broad scale data interoperability despite having no arctic-specific terms.
- How to continue and actively progress in this field should be further discussed in the next meeting.
- Action: Øystein to add this to the agenda.
- If moving to Git, discussions on how to work and not lose part of the community is needed.A procedure could involve using Google forms, issues and PR. To be further discussed.
- Some relevant links
- The OBO registry http://www.obofoundry.org/ which is auto-generated by the YAML in their repo.
- this is the YAML
- https://github.com/OBOFoundry/OBOFoundry.github.io/blob/master/registry/ontologies.yml
- Demonstration of the online representation of the matrix of harvesting relationships (Rebekah)
- Both the actual metadata harvesting and the presentation of the matrix of harvesting relationships are In prototyping, probably able to show something during 1-2 months.
- Rebekah is actively working on crosswalks between metadata standards and also translation of vocabualries using SKOS/OWL.
- Mark gave an update on how DataOne is doing crosswalks. He also referred to Science on schema.org.
- This topic is to be addressed in future meetings.
- Øystein to add to agenda.
- Following the discussion on AI during the last meeting, Pier Luigi suggested that this group should engage in https://en.unesco.org/artificial-intelligence.
- See e.g. https://en.unesco.org/artificial-intelligence/ethics for ethics aspects.
- AI is questioned in some contexts but is also something we need to relate to. Pier Luigi mentioned that the ethical perspectives have to be strengthened and representativeness sought. This group is open to participation. Pier Luigi mentioned that the Arctic community could give valuable input to this activity as most communities are concerned with machine learning and not knowledge representation. This group could bring the indigenous perspective into the discussion. SiriJodha indicated some interest in the previous meeting.
- Discussion continued from the last meeting...
- This was not covered in the meeting, keeping it for the next meeting.
- Awareness updates (roundtable)
- No time.
- On hold, to be addressed if time
- Polar Semantics Planning Matrix updates - Peter and Ruth
- The matrix is available at Matrix of metadata harvesting relationships
- The matrix and other materials from this family of meetings are being documented in two papers:
- PDPS White Paper Draft - Harvesting Portion (Peter)
- Matrix with additional fields as discussed during a WG meeting in 2020 for consideration on the new triple store to support this activity.
- https://docs.google.com/spreadsheets/d/1m_xBrpW6EqqZ4oRzc5kZkpt4c8h6qqGmJIVcd9BJCpc/edit#gid=1643486333
- Update on status
- PDPS White Paper Draft - Recommendations Portion (Rut)
- Update on status
- Discussion of the concepts data, information and knowledge and whether existing definitions we are working with applies in a broad sense, including local and traditional/indigenous knowledge. - Rebekah
- Discussion of issues raised in the 2020-12-15 meeting.
- Concepts being discussed now are:
- Data
- A set of values, symbols, or signs (recorded on any type of medium) that represent one or more properties of an entity. For example, the numbers generated by a sensor, values derived from a model or analysis, text entered into a survey, or the raw text of a document
- Information
- Products derived from data that lead to a greater understanding of an entity. For example, (i) the interpretation of a range of data from an array of conductivity sensors across the Arctic Ocean that informs us about that ocean’s salinity range or (ii) the narrative text of a report on harmful algal blooms that informs the reader on the timing of these blooms.
- Knowledge
- An abstract representation (i.e. a mental model) of an entity which: (i) is constructed from a substantial collection of information, (ii) grants its bearer reliable familiarity with that entity, and (iii) can be used to reason and take action about that entity. For example, an expert with knowledge about the salinity range of the Arctic Ocean (constructed from large amounts of information on the topic) would be able to reason that a salinity value of 43% is a likely error, rather than a real measurement.
- It was in the previous meeting we agreed that a living process that can be adapted as we continue engagement is required. How do we address this?
- It was in the previous meeting suggested that a joint statement towards e.g. Arctic Council outlining activities, goals, objectives, requests etc. is developed together with other relevant groups. How do we address this?
- Following discussions on the 2020-08-18 meeting, discuss definitions of data and datasets that could be used in the Arctic Community - Rebekah/Pier Luigi
- Background material
- The UN Decade of Ocean Science for Sustainable Development Implementation Plan v2.0. (See glossary for definitions of data, information, knowledge, and digital knowledge ; note we had to tone down the technical aspects as this document goes to the GA) : https://www.oceandecade.org/assets/uploads/documents/Ocean-Decade-Implementation-Plan-Version-2-0-min_1596634145.pdf
- IWG-SODIS https://iode.org/index.php?option=com_content&view=article&id=598:inter-sessional-working-group-to-propose-a-strategy-on-ocean-data-and-information-stewardship-for-the-un-ocean-decade-iwg-sodis&catid=65&Itemid=89
- A dedicated channel in the Slack space has been set up to capture discussions between meetings.
- A paper/report on the Polar Vocabularies Questionnaire - Pier Luigi
- Pier Luigi is exploring this with support from Ruth, Mark and Øystein.
- Link to responses: Polar Vocabularies (Responses)
- See previous minutes for details.
- Update on status
- Regular interoperability workshops debriefs and plans - Rebekah/Øystein
- Lessons learned in the September 2nd workshop that should be further discussed
- https://github.com/POLDER-Crew
- Polar practices recommendations for implementing schema.org is the short term goal
- Next meeting
- Agreed to do the third Tuesday of the month, at 19:00 UTC (21:00 CEST, 15:00 EST, 13:00 MST).
- Next meeting will be Tuesday 19th October.
- Øystein will create a calendar invite.
- Since not everyone is using Slack, emails should be circulated for each meeting.