1 of 37

2022 Vocabulary Symposium and Workshop

Report to AVSIG Meeting. December 6, 2022.

Lesley Wyborn, Megan Wong*, Steve McEachern, Kheeran Dharmawardena, Simon Cox, Rowan Brownlee*

2 of 37

Acknowledgement of Country

We acknowledge and celebrate the First Australians on whose traditional lands we meet, and pay our respect to the elders past, present and emerging.

3 of 37

Overview

  • Organisational partners
    • Collaboration underpinning these events. CODATA, ARDC & ADA
  • Symposium
    • What was it about?
    • Who attended & where did they come from?
    • Presentations & keynotes
    • Q & A, vocab list, links to outputs
  • Workshop
    • The focus of each day’s activities informing the foci of a roadmap
    • Planning the roadmap, staring it, taking it forward (next steps proposed by Workshop participants)

4 of 37

Organisational partners

  • Australian Data Archive (ADA)
  • Australian Research Data Commons (ARDC)
  • CODATA (Committee on Data of the International Science Council)

Organising Committee, Members from:

  • Australian Data Archive, Australian National University;
  • Australian Research Data Commons;
  • CSIRO Environmental Informatics;
  • Department of Climate Change, Energy, the Environment and Water and Cytrax;
  • Federation University Centre of eResearch and Digital Innovation (CeRDI)
  • Support team ANU and ARDC - thank you!

5 of 37

Vocabulary Symposium

  • Open access
  • Onsite at ANU & online
  • 2 Days of presentations & discussion
  • AU & international
  • What’s happening within & across domains?
  • What may we learn, reuse, apply and develop?

6 of 37

Who attended?

  • 59 onsite registrations
  • 181 online registrations

7 of 37

Who attended?

  • Universities & other research organisations
  • Cultural heritage organisations
  • NCRIS facilities
  • State & Federal Government

8 of 37

Who attended?

  • Science, health & medicine, indigenous, social sciences, earth & environmental, humanities, arts
  • New Zealand, Japan, Iraq, Spain, Italy, Indonesia, Ireland, Slovenia, Finland, France, Germany, Israel, India, Canada, USA

9 of 37

Presentations

  • 2 keynote presentations
    • 1 was online. 1 was onsite
  • 17 session presentations
    • 11 were online. 6 were onsite

10 of 37

What did the presentations cover?

  • International Vocabularies and services
  • Vocabularies in Australia
  • Finding, managing and reusing vocabularies
  • Systems and tools for publication and access
  • Applying vocabularies within and across domains
  • Interoperability frameworks
  • Co-creation of inclusive vocabularies
  • Vocabularies in support of privacy, security, consent
  • Vocabulary governance

11 of 37

Keynotes

CODATA - Simon Hodson

  • International Scientific Unions & Terminologies
  • Governance & sustainability
  • GOSC, WorldFAIR, FIPS, FERS & interoperability frameworks

ARDC - Adrian Burton

  • Knowledge infrastructure for impactful research
  • Catalogs, identifiers & vocabularies
  • Data connections across organisations & sectors, spanning inputs, activities & outputs

12 of 37

We asked questions of the participants

13 of 37

We asked questions of the participants

14 of 37

We asked questions of the participants

15 of 37

We asked questions of the participants

How would you describe the current vocabulary landscape in Australia?

16 of 37

We asked questions of the participants

What are the main challenges that make meeting your, or your stakeholders’ vocabulary needs difficult ?

17 of 37

We asked questions of the participants

What is missing from the Australian vocabulary landscape in 2022?

18 of 37

We asked what vocabularies they knew about

19 of 37

Links to Symposium outputs

  • Symposium website and program.
    • Program includes links to presentation abstracts, slides and recordings
  • Link to the vocabulary list
  • Updates to the AVSIG discussion list
  • Updates also via ARDC, ADA & CODATA communication channels

** Wrap-up discussion point. What of those communities and data that are left out by current approaches (have not been connected or included)? Requires discussion

20 of 37

Workshop Aim: Develop a program for how we move from the current state of Australian vocabularies toward a future state that meets next generation vocabulary requirements (including FAIR and well governed).

Vocabulary Workshop. Held November 16 - 18 2022

Research School of Social Sciences, Australian National University, Canberra

21 of 37

Focus questions

Orientation around 5 question

  • What is our current vocabulary landscape and what does it need to look like in the future?
  • What would constitute the optimum future state?
  • How we can work towards the future state,
  • What we need to do to be able to harmonise and converge our thinking
  • How might we achieve sustainability and longevity of the critical services within the vocabulary landscape so that these will be there in 10 years time.

22 of 37

Participants -

  • Invited participants from across thematic areas (based on ARDC Thematic Research Data Commons areas of People, Planet, HASS and Indigenous )

  • Also representatives from generic (cross domain) areas such as service, tooling provision and geospatial

  • 21 participants with organisational or research affiliations including (but not exhaustive):
    • AIATSIS, ARDC, AODN, ADA, ALA, APPF, ANU, AURIN, AgReFed, ANSIS, CSIRO Land and Water, CSIRO Australian e-Health Research Ctre, Cytrax, CSIRO, CODATA, DDI, DAWE, FedUni, Indigenous Data Commons, IMOS, Kurrawong, NCI, New Zealand, Permanent Committee on Place Names, Qld geological survey, Statistics University of Melbourne, TERN.

23 of 37

Workshop structure

Day 1: Wednesday 16th NovemberContext Setting and Mapping the Current Landscape

Day 2: Thursday 17th NovemberPlanning to roadmap

Day 3: Friday 18th NovemberThe Path Forward

Roughly 8 sessions each day

Its a hands on workshop

24 of 37

Day 1 Context Setting and Mapping the Landscape

  • to collectively map the Australian vocabulary ecosystem (and relevant international engagements).

  • intent of this mapping
    • To identify and map what is available now, services and relationships, and how they meet, or could meet, Australia’s ‘next generation vocabulary requirements’ . This included domain and generic vocabularies, infrastructures, relevant documents and communities.

25 of 37

Activities and outputs of Day 1: Context Setting and Mapping the Landscape

Activity 1. Introductions

Activity 2a Context setting. Presentation by ARDC. Adrian Burton

Activity 2b. Presentation by Andrew Hancock.

  • StatsNZ, Tatauranga Aotearoa - Principal Analyst
  • Chair, UN Committee of Experts on International Statistical Classifications
  • Presented on Modernisation of Statistical Classifications
  • See Andrew’s talk on topic recorded from AVSIG meeting Sept 2022 on YouTube

26 of 37

Activities and outputs of Day 1: Context Setting and Mapping the Landscape

Thematic areas of People, Planet, HASS and Indigenous, as well as ‘Generic’ (ie broadly applicable cross-theme, including geospatial)

A session focused on vocabularies. Each thematic group reviewed-

  • The responses to questions posed at eResearch BOF and the Vocabulary Symposium
  • Where are Australians utilising International Vocabulary resources?
  • When should we have local, national vs international vocabularies. Is there a consensus?

List of vocabs viewed throughout Critical ones missing Or b. Multiple versions of important vocabularies

27 of 37

Activity 4. Mapping the landscape of vocabulary. Focus: Infrastructures

Each thematic group reviewed the area of infrastructures. Questions poised -

How many vocabulary repositories are there - how many of their assets are FAIR?

  • How many portals are there - are they FAIR?
  • How many vocabulary aggregators are there in Australia? RVA, others?
  • How many systems are there to edit/manage vocabularies? What is their stability/sustainability?
  • What are the connections of any of these efforts with international equivalents?
  • What gaps we may have compared to international infrastructures?

28 of 37

How many systems are there to edit/manage vocabularies? What is their stability/sustainability?

  • Not reviewed in detail. See previous workshop output spreadsheet

  • ARDC would support an output that has demonstrated community support/ownership/value.

  • Home for if of interest to community? Eg - Lesley previously mentioned a proposed VSSIG session - future activities/focus of VSSIG

  • Ideas welcome if from creator of this list and those with suggestions or interest in taking forward

29 of 37

Report back, e.g.

30 of 37

Day 2 Aim

  • Groups were across domain
  • Discussion and activities aimed to:
    • Inform road map planning for day 3

31 of 37

Day 2 Toward planning for a roadmap: Landscape - documents

  • Reviewed and Scoped Documents that influence Vision and roadmap (support or constrain)

  • groups/communities of practice that are also working on vocabularies were identified.

  • Then, the participants were asked if they knew of any other documents that could either
    • Inspire the roadmap
    • Constrain the roadmap

  • All identified likely to support rather than constrain

  • Some may also inform vocabulary best practice (Privacy Acts, Codes of Ethics)

32 of 37

Just some of the influential documents…..

33 of 37

Day 2 Toward planning for a roadmap: Landscape, continued

Current planning documents in Australia

  • RVA review and RVA Roadmap 2019-2024

https://archive.ardc.edu.au/resource/review-of-research-vocabularies-australia/

Communities of Practice

Existing in Australia

  • Australian Vocabulary Special Interest Group (AVSIG)
  • AGLDWG (Australian Government Linked Data Working Group)

Those that could be actively engaged with

34 of 37

35 of 37

Day 2.Stakeholder Mapping

36 of 37

Planning to Roadmap approaches to roadmap anvassed, key components identified

37 of 37

A path forward…..

Day 3. Activity 2. Beginning to Roadmap

Cross-domain groups begun to flesh out components, report back

DAY 3: Activity 3. The Path Forward (Friday, half-day)

focused on establishing an agenda for furthering the vocabulary ecosystem.

This included:

    • Interest in developing a road map for vocabularies and semantic resources in Australia was established
    • Initial working group to continue rough draft - workshop attendees Dec 2022 (ambitious!). With draft to go to communities for comment from first quarter 2023
    • Timeframe Roadmap to cover - 1 yr outcomes, 5 yr outcomes
    • Need for an advocacy group for this roadmap - AVSIG was flagged as most appropriate - broader Stakeholders than AGLDWG (Government Agency focus)