Open Science in data collection and analysis
06/05/2025
SciLifeLab Training Hub
Ineke Luijten, PhD
How to implement Open Science
The Open Science workflow
Plan &
Design
Collect &
Analyse
Publish &
Spread
Access &
Reuse
Evaluate &
Build
Communicate & Involve
Reuse data from data repositories
Use open-source software for method documentation & data analysis
Publish preprints
Publish Open Access
Publish all research outputs
Practice Open peer review
Deposit all outputs in open repositories
Use open licensing
Attach persistent identifiers
Add rich metadata
Use responsible research metrics
Adopt qualitative research assessment
Track Open Science contributions
Share key insights through (social) media
Encourage Citizen Science
Turn research into MOOCs or OERs
Preregister hypotheses, study protocols, analysis
workflows
Create a data management plan
How to implement Open Science
The Open Science workflow
Plan &
Design
Collect &
Analyse
Publish &
Spread
Access &
Reuse
Evaluate &
Build
Communicate & Involve
Reuse data from data repositories
Use open-source software for method documentation & data analysis
Publish preprints
Publish Open Access
Publish all research outputs
Practice Open peer review
Deposit all outputs in open repositories
Use open licensing
Attach persistent identifiers
Add rich metadata
Use responsible research metrics
Adopt qualitative research assessment
Track Open Science contributions
Share key insights through (social) media
Encourage Citizen Science
Turn research into MOOCs or OERs
Preregister hypotheses, study protocols, analysis
workflows
Create a data management plan
How to implement Open Science
Collect &
Analyse
The Open Science workflow
Collect &
Analyse
Document methods and analyze data with open-source tools
Work according to Good Research Practice standards
Reuse data from data repositories
Collect &
Analyse
Method documentation
What: provide a clear, detailed, and accessible record of how your data was collected, processed, and analyzed
Why: others can understand, evaluate, and reproduce your research decisions
How: Protocols.io (sharing and collaboratively editing research protocols)
Jupyter Notebooks (mixes code, results and narrative)
Github (version-control)
Open Science Framework (bombine protocol, data, and code in one public space)
eLabFTW (open-source ELN, inventory and experiment tracking)
Openness in methodology
Image Protocol by NutJannah Noun Project CC-BY 3.0
Openness in data analysis
Collect &
Analyse
Analyse data with open-source tools
What: use tools that are freely available and transparent in their functioning to process, visualize and analyse data
Why: no expensive licenses required, exact replication of analyses, fast-evolving
How: OpenRefine (clean and transform messy survey or archival datasets )
R/Python (scripting language for data wrangling, statistics, and visualization)
OpenSesame (tool for building behavioral experiments)
JASP (graphical user interface but still open and reproducible)
Select interoperable formats
What: use non-proprietary file formats that are platform-independent
Why: data can be read and reused across platforms, tools, and time
How: CSV (Comma-Separated-Values for tabular data)
JSON (JavaScript Object Notation for hierarchical or structured data)
XML (eXtensible Markup Language for structured, machine-readable data with metadata)
TXT, Markdown, YAML (for plain-text documentation and config files)
Image file format by Fauzan AdiimaNoun Project CC-BY 3.0
Open Science in practice
Reuse data from repositories�Saves time and resources
Find well-documented datasets in trusted repositories, check licensing and ethics, and clearly cite and document how you use them.
Good Research Practice�Ensures ethical, credible outcomes
Conduct research that is rigorous, transparent, ethical, and fully documented so that it can be understood, verified, and reused by others.
Collect &
Analyse