1 of 26

Open science at EMBL�Introduction and Overview

30 November 2022

Victoria Yan ( 0000-0003-1982-9145)�Bastian Drees ( 0000-0003-3508-602X)

EMBL Office for Scientific Information Management

EMBL Open Science training for new Group Leaders

2 of 26

What is not Open Science?

Slide adapted from Anna Kreshuk

3 of 26

Why Open Science?

Problems and Crises:

  • Reproducibility
  • Affordability (serials crisis)
  • Reusability
  • Transparency
  • Metrics and assessment
  • Communication gap (science – society)
  • ...

The Turing Way Community, & Scriberia. (2021). Illustrations from the Turing Way book dashes. Zenodo. https://doi.org/10.5281/zenodo.5706310

4 of 26

What is Open Science?

4

Gallagher, et al. Open Science principles for accelerating trait-based science across the Tree of Life. Nat Ecol Evol 4, 294 (2020). https://doi.org/10.1038/s41559-020-1109-6

UNESCO Recommendations on Open Science (2021)

5 of 26

EMBL aims to be a

leader and innovator in open science

in terms of the scientific results it produces as well as in the way research in molecular biology is performed.

Open Science aims to make scientific research

accessible and transparent, and to remove barriers

EMBL research is open

6 of 26

Open Science policy at EMBL

  • Research articles, data, software

  • Show progress and create spirit

  • Spirit of the policy: transparent, open, trustworthy, FAIR

  • Supporting guidelines

  • Policy in line with funders

6

21/10/2022

7 of 26

Office for Scientific Information Management

Szilárd Library

EMBL Archive

and Records Management

Open Science Support

8 of 26

Office for Scientific Information Management (OSIM)

https://www.embl.org/internal-information/news/topic/we-are-embl

https://www.embl.org/news/lab-matters/welcome-maria-papanikolaou/

8

09/11/2022

9 of 26

EMBL Open Science Policy - Publications

Publish all articles initially as a preprint.

Publish Open Access with a

CC-BY licence.

Link publications to ORCID during submission.

Deposit in EPMC within 6 months of publication.

Standard EMBL affiliation acknowledgment

10 of 26

Credit and visibility of all research outputs

  • Register on ORCID, add id to SAP HR system
  • Using ORCIDs during submissions
  • Claim past publications on EuropePMC
  • Claim preprints, datasets, methods, and other research outputs

11 of 26

Central Article Processing Charge (APC) Budget

Open Access

  • EMBL lead author
  • no grant can cover APCs
  • CC-BY licence (no NC-ND)
  • APC discounts and publisher agreements
  • Check Open Science Policy Compliance
  • (Advise on exceptions)
  • (Publisher policy and licence agreements)

Invoice

Shopping Cart

(112 Off-prints)

Check OS Compliance

Approval

12 of 26

Open Access Publishing

OA publishing agreements

  • Publish and read agreements
    • Springer DEAL
    • Wiley DEAL
    • CoB
    • Nature P&R (2023)
    • RSC R&P (2023)
    • RUP R&P (2023)
  • Prepayment schemes
    • IUCr
  • OA membership
    • NAR
    • PLOS
    • Royal Society

13 of 26

Projekt DEAL

  • Access to 1.600 Wiley and 1.900 Springer journals (1997 – present)

  • Publish OA in 1.420 Wiley and 1.900 Springer journals for free

  • 20% discount in 110 Wiley and 600 Springer fully OA journals

  • No agreement with Elsevier:
    • ~ 200 German institutions canceled subscriptions
    • 42 scientists resigned from editorial activities

No one complained

14 of 26

EMBL Open Science Policy – Open Source Software

Analysis

Services

Methods

Education

Open Source by default.

Made available in open community software repositories.

15 of 26

Research Data Management

  • Introduce Research Data Management

  • What is a Data Management Plan (DMP)?

  • Resources and support at EMBL
    • EMBL's DMP template
    • Data Management App
    • STOCKS

16 of 26

EMBL Open Science Policy - Data

Accessible

Interoperrable

Reusable

Findable

FAIR Principles – for all projects from the start

Data Mgmt Plan

+

+

=

Permanent ID

Indexed, Documented

Who, where, retrieval, authorization

Requirements Benefits

Formats,

standards

Metadata,

Licence

17 of 26

Research Data Management

What is Research Data?

  • All research data that was USED or PRODUCED
    • Simulation, observational data
    • Code, scripts
    • AV data
    • Raw and processed

What is Metadata?

  • Data on data
    • Administrative, Descriptive, Structural
    • Who, when, how, why
    • Resources, reagents
    • Experimental information, treatment, conditions
    • Licence, location, Persistent identifiers => reuse

18 of 26

Standards and databases for data, metadata

  • Administrative: relevant to managing it
  • Descriptive/citation: assists with discovery/identity
  • Structural: how the data came about & is structured

Slide adapted from Lisanna Paladin (EMBL BioIT)

19 of 26

DMP template

Data management plan template

  • Data Management guide (Wiki - Ellenberg lab)

A DMP is a formal living document that outlines what you will do with the data during and after the project.

Required at EMBL at the project level.

Who can support you with DMPs?

  • OSIM@embl.org
  • Jean-Karim Heriche

heriche@embl.de

  • EMBL Bio-IT

bio-it@embl.org

20 of 26

STOCKS

STOCKS is a web platform for fundamental research data-management, featuring an inventory system coupled to an Electronic Lab Notebook (ELN).

A relational database to connect reagents to equipment, to protocol, to experiment, to data.

Who can support you with STOCKS?

  • STOCKS team

gbcs@embl.de

  • Bio-IT STOCKS Training bio-it@embl.org

21 of 26

Data Management App

(DMA)

The Data Management Application (DMA) is a flexible and customizable tool to assist you as an EMBL researcher in managing your data and documenting your data lifecycle from production to archiving.

Makes your DMP creation easier - delegate the documentation and description of how you will track your data on the file system level to the DMA docs.

Out of the box tracking for File System operations - sharing, archival, deletion etc.

Who can support you with EMBL's DMA?

DMA team

dma@embl.de

Slide adapted from DMA team presentation and documentation

22 of 26

Add value and interactivity to Open Data

22

21/10/2022

http://www.digitalembryo.org/

Gene Expression Atlas of the

Phallusia mammillata embryo

- Pierre Neveu Group

Mitotic Cell Atlas

- Jan Ellenberg Group

https://www.mitocheck.org/

EMBL's Computational Support for Web Apps

https://shiny-portal.embl.de

23 of 26

Research Assessment in Academia

Impact factors have been shown to:

  • Be unreliable to measure citations
  • Promote adverse behaviours in the research community
  • Do not apply to research outputs other than publications

Academics often rely on journal impact factors and H-indexes to evaluate the quality of research

Research Assessment Reform is an active initiative by researchers, research-performing institutes and funders

The aim is to improve the ways in which researchers and the outputs of scholarly research are evaluated

24 of 26

What is DORA?

Declaration On Research Assessment

  • Consider the value and impact of all research outputs (including datasets and software)

  • Consider a broad range of impact measures

  • Be explicit about the criteria used to reach hiring, tenure, and promotion decisions

https://sfdora.org

25 of 26

Four recommendations on Research Assessment implementation

  1. Clearly state EMBL’s commitment to the principles of DORA. We will include the following text on all job advertisements:

"EMBL supports fair and responsible research assessment, which includes its recruitment and performance assessment processes. We recognise a range of research outputs, discourage inappropriate use of proxies such as journal impact factors, and value research outputs based on their intrinsic merit. EMBL is a signatory of the San Francisco Declaration on Research Assessment (DORA)”

  • Update the type of research outputs requested of applicants

candidates will be asked to include different research outputs (publications, datasets, software and patents etc.) in their CV and a narrative describing the significance of key research outputs.

  • Provide guidance to internal and external decision-making committees and evaluators (about funding, hiring, assessment, tenure, or promotion).

Change to SAC explicit instructions and efactoring of Unit review dossier form to include alternative research outputs such as software and data

  • Communicate of EMBL’s Research Assessment practices

26 of 26

Thank you!