�SciLifeLab/NBIS�and�Services for Sensitive Data��� Bengt Persson�Director of NBIS��20 March 2025�AIDA Data Science Platform Launch Event
Contents
2
What is SciLifeLab?
Founded in 2010 by Karolinska Institutet, KTH Royal Institute of Technology, Stockholm University and Uppsala University
National hub enabling life science research that would otherwise not be possible
Government appointed mission as a national research infrastructure
Research community gathering scientists across universities and disciplines
Today, activities at all major Swedish universities with sites launched in Linköping, Lund, Gothenburg and Umeå
… and collaborations with healthcare, industry, other governmental agencies and international organizations
Areas of activities
Provide excellent and impactful life science infrastructure
10 service areas and 40 units
1,600 users and 3,500 projects yearly
600 technology experts
Strengthen research communities, capabilities, and global partnerships
300 group leaders across all sites
Capabilities: Precision Medicine, Pandemic Laboratory Preparedness, Planetary Biology>
Drug Discovery & Development
International collaborations e.g. EMBL
Innovation and bridge-building for the benefit of society
Collaborations across sectors and boarders, with industry and healthcare
Attract scientific excellence and provide advanced training
SciLifeLab and DDLS Fellows �program
Training hub
PhD and postdoc training
Facilitate the transformation of life science data into knowledge
SciLifeLab & Wallenberg National Program for Data-Driven Life Science (DDLS)
Computational and data science base for open, real-time FAIR data sharing
AI and data science expertise in life science
SciLifeLab Strategy
As Sweden's National Infrastructure for Molecular Life Sciences, SciLifeLab aims to:
Infrastructure user statistics
Infrastructure user statistics
Academic users
Non-academic users
Infrastructure staff
Capabilities
Strategic capabilities around which SciLifeLab gathers infrastructure technology, research & expertise
Planetary Biology
Studying life in the environmental context
Pandemic Laboratory Preparedness
Building laboratory capacity to assist in future pandemics
Precision Medicine
Bringing cutting-edge tech. and first-class expertise towards patient benefit
SciLifeLab and Wallenberg �National Programme �for Data-Driven �Life Science (DDLS)��12 years, 3.7 GSEK (~340 MEUR), �11 partners, coordinated by SciLifeLab
Overall 12-year plan for the DDLS programme
39 DDLS Fellows�78 PhDs and 78 postdocs
140 PhDs in academia and 45 industry PhDs
90 postdocs and 45 industry postdocs
210 MSEK WASP�35 MSEK WASP- HS
235 MSEK
670 MSEK
4 research areas
National Bioinformatics Infrastructure Sweden
11
NBIS staff at our recent retreat at Ystad Saltsjöbad 12 March 2025
Umeå
Göteborg
Lund
Linköping
Stockholm
Uppsala
Three pillars
12
Support
Infrastructure
Training
NBIS Units/Teams
13
Support currently ~55 national staff
14
https://www.nbis.se/services
Cryo-EM and structural biology
AI in medical imaging
Bioimaging
Erik Ylipää
Tim Schulte, Piotr Draczkowski, Claudio Mirabello
Frontend/backend
Visualizations
Code review
Software/workflows
Anna Klemm and team
Data Management & Human Data
15
Data management & Data publication support
Human Data
The Swedish ELIXIR node
16
1+MG Declaration of cooperation starting 2018
24 countries and 4 observers
Signatory countries
Observers
1+MG Roadmap 2018 - 2027
Credit: Karen Arnott/EMBL
Institutes from Finland, Germany, Norway, Spain and Sweden are first the nodes of the Federated European Genome-phenome Archive, one of the largest international networks for discovery of sensitive human data
“Before the EGA, data from a research study were generated once, analysed once, and often ‘locked away’ on the institute’s servers.
The Federated EGA expands the benefits of data reuse across national borders and increases the value and impact of the data.”
Mallory Freeberg
EGA Coordinator
at EMBL-EBI
FEGA – Federated European Genome-phenome Archive�Swedish node in production since Feb 2024
Federation of human genomic data
Many national datasets from human research participants needs to be stored locally (European Genome phenome Archive – EGA) |
ELIXIR developing a federation with shared metadata (FAIR) and local data store (secure). Based on suite of interoperable, reusable, adopted, and fit-for-purpose standards |
Linking local EGA to national clouds and international access (ELIXIR-AAI - Authentication and Authorisation Infrastructure) |
17/25 ELIXIR Nodes are funded in the FHD community |
Use case: COVID-19 |
GDI – European Genomic Data Infrastructure
What is GDI setting out to do?
Support the 1+Million Genomes (1+MG) initiative ambition to enable secure access to high-quality genomics and the corresponding clinical data across Europe for better research, personalised healthcare and health policy making
Establishing a federated, sustainable and secure infrastructure based on open community standards to access genomic and related phenotypic and clinical data across Europe
Building on the Beyond 1 Million Genomes (B1MG) project outputs
Countries’ commitment to GDI by 2026
Fully operational and integrated into 1+MG infrastructure: Belgium, Czech Republic, Denmark, Estonia, Finland, France, Germany, Italy, Luxembourg, Portugal, Slovenia, Spain, Sweden, The Netherlands, Norway�
Fully operational national node but not yet integrated in the 1+MG infrastructure:�Bulgaria, Latvia, Lithuania�
Onboarding:�Croatia, Cyprus, Hungary, Ireland, Malta, Romania
GDI project receives funding from the European Union’s Digital Europe Programme under grant agreement number 101081813.
Current work
Governance model - EDIC
Infrastructure products
Exploring federated learning
Data mgmt policy
Make it work
Make it useful
Make it last
P1
P2
P3
NBIS co-lead
Project coordination: ELIXIR hub
GDI
GDI project receives funding from the European Union’s Digital Europe Programme under grant agreement number 101081813.
Overview of the GDI major components
Data�Discovery
Data Access �Management
Storage & �Interfaces
Data�Reception
Data�Processing
5 FUNCTIONALITIES
Find applicable datasets based on phenotype
Authenticate yourself
Search mutation and/or phenotype
Apply for data access, �DAC evaluates request, approves
Data made available
Access data
Perform analysis
GDI project receives funding from the European Union’s Digital Europe Programme under grant agreement number 101081813.
Difference between EHDS och 1+MG
GDI project receives funding from the European Union’s Digital Europe Programme under grant agreement number 101081813.
Comparison between EHDS and 1+MG
EHDS
Query
Response
Query
Response
EC SPE
HDAB
HDAB
HDAB
1+MG EDIC Infrastructure
1+MG
GDI project receives funding from the European Union’s Digital Europe Programme under grant agreement number 101081813.
�NBIS is leading WP4 on Data Infrastructure
Anna Hagwall
anna.hagwall@nbis.se, UU
WP4 lead
Anna-Lena Ellasdotter
anna-lena.ellasdotter@nbis.se, UU
WP4 co-lead
Funded by the European Union’s Digital Europe Program, Grant agreement #101168231 || Part of 1+MG Initiative
GoE kickoff meeting
30th-31st October 2024
Task 1: Identify infrastructure gaps and implement advanced GoE use cases
Identify gaps between GoE needs and GDI infrastructure
Communication partner on advanced use cases
Susanna Repo and Tuuli Järvinen
Task 2: Data management
Expert network
- Data manager job profile
- Knowledge transfer to other data stewards
Submit data at GoE partners
- Synthetic
- Real
Niclas Jareborg and Karin Granström
Thank you for your attention!
Questions? Comments?