1 of 36

Research workflows and digital data sets:

traceability, reproducibility, comparability issues illustrated on spatio-acoustic data acquisition protocols

Iwona DUDEK

UPR 2002 CNRS MAP

FR-IT Bilateral Cooperation in Heritage Science – 7th Edition

DIVING INTO DIGITAL DATA FOR HERITAGE SCIENCE: DIAGNOSTICS, UNDERWATER HERITAGE, AND SOUND

Marseille, Centre National de la Recherche Scientifique (CNRS), Campus Joseph Aiguier 

February 7th, 2024

2 of 36

Workflow descriptions imply choice of a language

“HARD“

LANGUAGES

“SOFT”

LANGUAGES

ethnic languages

approximately 4000

poesy

dance

abstract

painting

programming

languages

formal

logic

polyinterpretation

S. LEM, Tajemnica chińskiego pokoju, Kraków 1996

A. KOBRZYCKI, The role off the language in a perceptual process, New York 1951

“context-free” languages

the meaning is highly context-dependent

I. Dudek (UPR2002 CNRS MAP) FR-IT Bilateral Cooperation in Heritage Science – 7th Edition,, Marseilles 07/02/2024

3 of 36

Workflow descriptions imply choice of a language

“HARD“

LANGUAGES

“SOFT”

LANGUAGES

poesy

dance

abstract

painting

programming

languages

formal

logic

polyinterpretation

S. LEM, Tajemnica chińskiego pokoju, Kraków 1996

A. KOBRZYCKI, The role off the language in a perceptual process, New York 1951

“context-free” languages

the meaning is highly context-dependent

ethnic languages

approximately 4000

I. Dudek (UPR2002 CNRS MAP) FR-IT Bilateral Cooperation in Heritage Science – 7th Edition,, Marseilles 07/02/2024

4 of 36

Workflow descriptions imply choice of a language

poesy

dance

abstract

painting

programming

languages

formal

logic

polyinterpretation

ethnic languages

approximately 4000

diagrammatic representations based on drawing conventions

conceptual modelling

controlled vocabulary

establishing rules or parameters

determining /identifying the essential qualities or meaning of concepts

A schematic representation explains parts, operations and relationships between elements.

I. Dudek (UPR2002 CNRS MAP) FR-IT Bilateral Cooperation in Heritage Science – 7th Edition,, Marseilles 07/02/2024

5 of 36

Structure of the presentation

  • brief outline of the Memoria IS objectives
  • illustration of these concepts by means of examples
  • discussion about advantages and limits of the system
  • clarification of the underlying concepts

I. Dudek (UPR2002 CNRS MAP) FR-IT Bilateral Cooperation in Heritage Science – 7th Edition,, Marseilles 07/02/2024

6 of 36

exploratory web-based information system

Projet SESAMES

ANR-18-CE38-0009-01

Mémorisation de ressources numériques et d‘activités

Record-keeping of digital resources and activities

traceability

reproducibility

replicability

comparability

verifiability

Ministère de la Culture

DRESTDépartement de la Recherche, de l’Enseignement Supérieur et de la Technologie, Ministère de la Culture

I. Dudek (UPR2002 CNRS MAP) FR-IT Bilateral Cooperation in Heritage Science – 7th Edition,, Marseilles 07/02/2024

7 of 36

main objectives

allow for the description of research results and their production history

output

publication

composition

metadata

I. Dudek (UPR2002 CNRS MAP) FR-IT Bilateral Cooperation in Heritage Science – 7th Edition,, Marseilles 07/02/2024

8 of 36

main objectives

allow for the description of research results and their production history

different types of results

sequences of actions that led to their creation

paradata

I. Dudek (UPR2002 CNRS MAP) FR-IT Bilateral Cooperation in Heritage Science – 7th Edition,, Marseilles 07/02/2024

9 of 36

chain - a succession of activities over time (displaying their order)

photographic documentation

followed by selection of photographs for storage

their annotation (metadata)

digital archiving of the entire collection

...

underlying concepts

I. Dudek (UPR2002 CNRS MAP) FR-IT Bilateral Cooperation in Heritage Science – 7th Edition,, Marseilles 07/02/2024

10 of 36

excavation carried out simultaneously in three trenches

parallel (sequences of) activities - activities that run concurrently in time

underlying concepts

I. Dudek (UPR2002 CNRS MAP) FR-IT Bilateral Cooperation in Heritage Science – 7th Edition,, Marseilles 07/02/2024

11 of 36

knot of activities - activities that take place simultaneously

or occur without any apparent sequential order

combined with subsurface observation

hand excavation

underlying concepts

I. Dudek (UPR2002 CNRS MAP) FR-IT Bilateral Cooperation in Heritage Science – 7th Edition,, Marseilles 07/02/2024

12 of 36

iterative activities - sequence of operations is repeated on the same object

until the desired result is obtained

combined with subsurface observation

hand excavation

followed by photographic documentation

and inventory of stratigraphic units

(measurements and documentation)

The whole sequence is repeated until an artefact is discovered

underlying concepts

I. Dudek (UPR2002 CNRS MAP) FR-IT Bilateral Cooperation in Heritage Science – 7th Edition,, Marseilles 07/02/2024

13 of 36

repetitive activities - repetition of the same operations on multiple objects

(done once for each item)

site cover up and backfilling

carried out successively trench by trench

underlying concepts

I. Dudek (UPR2002 CNRS MAP) FR-IT Bilateral Cooperation in Heritage Science – 7th Edition,, Marseilles 07/02/2024

14 of 36

data collection and acquisition

data filtering and treatment

data analysis

added value procedural activities

finalisation

underlying concepts

I. Dudek (UPR2002 CNRS MAP) FR-IT Bilateral Cooperation in Heritage Science – 7th Edition,, Marseilles 07/02/2024

15 of 36

data collection and acquisition

data filtering and treatment

data analysis

added value procedural activities

finalisation

archaeological excavation sub-group

added to our system to explore its potential in archaeology

underlying concepts

I. Dudek (UPR2002 CNRS MAP) FR-IT Bilateral Cooperation in Heritage Science – 7th Edition,, Marseilles 07/02/2024

16 of 36

Organising activities in a workflow diagram > drag-and-drop web interface

I. Dudek (UPR2002 CNRS MAP) FR-IT Bilateral Cooperation in Heritage Science – 7th Edition,, Marseilles 07/02/2024

17 of 36

Description of individual activities

I. Dudek (UPR2002 CNRS MAP) FR-IT Bilateral Cooperation in Heritage Science – 7th Edition,, Marseilles 07/02/2024

Once an activity is characterised, a background colour is added to its icon .

18 of 36

Paradata depicting each activity is stored within in the workflow structure

freely accessible, searchable, editable

I. Dudek (UPR2002 CNRS MAP) FR-IT Bilateral Cooperation in Heritage Science – 7th Edition,, Marseilles 07/02/2024

19 of 36

Spatio-acoustic data acquisition

acoustic data acquisition

spatio-acoustic data acquisition

SESAMES project (Villeneuve lez Avignon, Jean Salusse room)

Sesames seminar (Villeneuve lez Avignon, Jean Salusse room)

I. Dudek (UPR2002 CNRS MAP) FR-IT Bilateral Cooperation in Heritage Science – 7th Edition,, Marseilles 07/02/2024

http://memoria-dev.gamsau.archi.fr/is/enter.php?show=process&_op=set&id=113

http://memoria-dev.gamsau.archi.fr/is/enter.php?show=process&_op=set&id=116

20 of 36

Spatio-acoustic data acquisition

archaeological fieldworks

process described on the basis of an archaeological report

Chapelle Notre‐Dame de Bethléem,

Commune de Bras, Var, 2017

I. Dudek (UPR2002 CNRS MAP) FR-IT Bilateral Cooperation in Heritage Science – 7th Edition,, Marseilles 07/02/2024

https://sandbox.memoria.map.cnrs.fr/is/enter.php?show=process&_op=set&id=118

http://memoria-dev.gamsau.archi.fr/is/enter.php?show=process&_op=set&id=116

acoustic data acquisition

Sesames seminar (Villeneuve lez Avignon, Jean Salusse room)

21 of 36

Spatio-acoustic data acquisition

preparation of spatio-acoustic acquisition

Sesames seminar (Villeneuve lez Avignon)

design and production of online panoramas (360°)

with sound tracks

I. Dudek (UPR2002 CNRS MAP) FR-IT Bilateral Cooperation in Heritage Science – 7th Edition,, Marseilles 07/02/2024

acoustic data acquisition

Sesames seminar (Villeneuve lez Avignon, Jean Salusse room)

http://memoria-dev.gamsau.archi.fr/is/enter.php?show=process&_op=set&id=102

http://memoria-dev.gamsau.archi.fr/is/enter.php?show=process&_op=set&id=115

http://memoria-dev.gamsau.archi.fr/is/enter.php?show=process&_op=set&id=116

22 of 36

SESAMES project

(Villeneuve lez Avignon, Gothic room)

interdisciplinary acquisition process carried out by 5 people working in sub-groups

Spatio-acoustic data acquisition

I. Dudek (UPR2002 CNRS MAP) FR-IT Bilateral Cooperation in Heritage Science – 7th Edition,, Marseilles 07/02/2024

Villeneuve lez Avignon 2022

23 of 36

I. Dudek (UPR2002 CNRS MAP) FR-IT Bilateral Cooperation in Heritage Science – 7th Edition,, Marseilles 07/02/2024

results

acoustic data acquisition

documentation

instrument positioning

metric and visual data acquisition

Spatio-acoustic data acquisition

actions

Villeneuve lez Avignon 2022

interdisciplinary acquisition protocol

24 of 36

instruments positioning

Memoria workflow diagram

Spatio-acoustic data acquisition

I. Dudek (UPR2002 CNRS MAP) FR-IT Bilateral Cooperation in Heritage Science – 7th Edition,, Marseilles 07/02/2024

25 of 36

documentation

instruments positioning

Spatio-acoustic data acquisition

I. Dudek (UPR2002 CNRS MAP) FR-IT Bilateral Cooperation in Heritage Science – 7th Edition,, Marseilles 07/02/2024

Memoria workflow diagram

26 of 36

acoustic data acquisition

and pre-processing

documentation

instruments positioning

Spatio-acoustic data acquisition

I. Dudek (UPR2002 CNRS MAP) FR-IT Bilateral Cooperation in Heritage Science – 7th Edition,, Marseilles 07/02/2024

Memoria workflow diagram

27 of 36

acoustic data acquisition

and pre-processing

documentation

instruments positioning

laser distance measurement

photogrammetric acquisition

Spatio-acoustic data acquisition

I. Dudek (UPR2002 CNRS MAP) FR-IT Bilateral Cooperation in Heritage Science – 7th Edition,, Marseilles 07/02/2024

Memoria workflow diagram

28 of 36

Spatio-acoustic data acquisition

I. Dudek (UPR2002 CNRS MAP) FR-IT Bilateral Cooperation in Heritage Science – 7th Edition,, Marseilles 07/02/2024

29 of 36

Information Visualisation

acoustic recordings

video report

photogrammetric image set

disto measurements

photographic documentation

outputs

sweep production

protocol preparation

inputs

I. Dudek (UPR2002 CNRS MAP) FR-IT Bilateral Cooperation in Heritage Science – 7th Edition,, Marseilles 07/02/2024

workflow diagram

activities' proportion ring

30 of 36

I. Dudek (UPR2002 CNRS MAP) FR-IT Bilateral Cooperation in Heritage Science – 7th Edition,, Marseilles 07/02/2024

Information Visualisation

process variability chord diagram

project A

project B

31 of 36

I. Dudek (UPR2002 CNRS MAP) FR-IT Bilateral Cooperation in Heritage Science – 7th Edition,, Marseilles 07/02/2024

Information Visualisation

process variability chord diagram

project A

project C

32 of 36

Potential of the system

MEMORIA is an exploratory web-based prototype, an attempt to provide a methodological framework to meet the requirements of scientific integrity.

  • documentation framework enables : tractability of results, verifiability of research protocols, comparability and analysis of different protocols, ... replicability and reproducibility of results

  • support of the planning of interdisciplinary research protocols based on past experiences

  • theoretical/tacit knowledge-sharing environment

  • innovative visual analytical framework

  • new documentation framework

by requiring more reflection, it can help us to work more effectively,

more consciously and less routinely (human inertia)

I. Dudek (UPR2002 CNRS MAP) FR-IT Bilateral Cooperation in Heritage Science – 7th Edition,, Marseilles 07/02/2024

33 of 36

Main constraints and limits of the system:

MEMORIA remains a human-centred system

I. Dudek (UPR2002 CNRS MAP) FR-IT Bilateral Cooperation in Heritage Science – 7th Edition,, Marseilles 07/02/2024

Acceptability constrains:

  • quality and reliability of the visualisations depend strongly on the data entered

falsehood can be inferred from incomplete or partly unreliable data

Technical constrains:

  • time-consuming documentation process, steep learning curve for some people

differences in individual capacities, difficulty in determining what is important to document,

lack of skills in graphic interpretation, ...

  • unreadiness to openly share information about our mistakes, errors, shortcomings

- so that they cannot be turned against us

If you shut the door to all errors, truth will be shut out.

Rabindranath Tagore

34 of 36

Main constraints and limits of the system:

Technical constrains:

MEMORIA remains a human-centred system

I. Dudek (UPR2002 CNRS MAP) FR-IT Bilateral Cooperation in Heritage Science – 7th Edition,, Marseilles 07/02/2024

  • sustainability of web-based information systems

the constant evolution of the technology requires continuous mobilisation to ensure functionality of such systems - maintenance, development, user support and funding

  • costly and complex development steps

35 of 36

future works

we'll be happy to discuss with you later today

36 of 36

Memoria Information System

for UMR 3495 CNRS/MC MAP

(open consultation)

Memoria sandbox

familiarization platform

(user account available without restrictions)

Thanks for your attention!

Iwona DUDEK

UPR 2002 CNRS MAP

iwona.dudek@map.cnrs.fr

I. Dudek (UPR2002 CNRS MAP) FR-IT Bilateral Cooperation in Heritage Science – 7th Edition,, Marseilles 07/02/2024

MEMORIA - classification of activities

online PDF , 334 pages

(in English)

http://memoria-dev.gamsau.archi.fr/projet/index.php