The Metascience Lab
Day 1
https://researchonresearch.org/project/a-f-i-r-e/
Welcome From Tom Stafford
Professor of Cognitive Science
& University Research Practice Lead
University of Sheffield
https://tomstafford.github.io/
Senior Research Fellow,
Research on Research Institute
Metascience Lab @ MS2025
- in partnership with Open Philanthropy and RoRI’s AFiRE programme
- three linked sessions will facilitate matchmaking and networking for experimentation
- all areas of metascience, with a focus on interventions to support higher quality, lower cost and more impactful research.
- Each session will showcase metascience principles, methods or examples of experimentation, as well as providing a platform for co-developing new project ideas by participants. Researchers, funders, universities, publishers and other actors in the research ecosystem are invited to propose experiments and matchmake with potential collaborators.
- The Abundance and Growth Fund at Open Philanthropy is happy to consider proposals that emerge from this process
- Topics you’d like considered? Please get in touch
Three days, three themes, three formats
Why and How to experiment
Funder experiments
Building institutional capacity
Welcome From Matt Clancy
Senior Program Officer, Abundance & Growth, Open Philanthropy
https://www.openphilanthropy.org/about/team/matt-clancy/
Senior fellow, Institute for Progress
Today’s plan (DAY ONE)
1400 Chair’s introduction
1405 Matt Clancey, Open Philanthropy: Why metascience needs new experiments
1410 Fiona Booth: "T0255: What is the appropriate ethics and governance framework for meta-research?”
1420 Albert Bravo-Biosca. “T0485 Exploring the use of randomised experiments in metascience”
1430 Response from Misha Teplitskiy
1440 Activity: matchmaking (facilitators: George Richardson, Amanda Kvarven, Youyou Wu, Albert Bravo-Biosca)
1510 Plenary: new idea pitches and challenge suggestions
What is the appropriate ethics and governance framework for meta-research?
Fiona Booth (University of Bristol)
Neil Jacobs (UK Reproducibility Network)
Marcus Munafò (University of Bath)
Pen-Yuan-Hsing (University of Bristol)
Local Community
Wider society
Anticipated harm
Unanticipated
harm
Kaupapa Māori Research 1,2
Wamba et al 20242
“….our story also illustrates the harm and challenges that can occur when researchers do not prioritize developing a foundation of trust with their participants.”
Manage relationships when starting and ending research with human participants (Joel Wambua (Busara), Anisha Singh (London School of Economics and Political Science), Kelvin Kihindas (Common Goal Research Center), Irene Gachungi (DIME, The World Bank) and Patrick S. Forscher (Busara) (2024). In P.S. Forscher & M. Schmidt (eds), A better how: notes on developmental meta-research (pp 161-166). Busara. DOI: doi.org/10.62372/ISCI6112
Trialling narrative CVs
Use of LLMs to screen conference abstracts versus manual review
CONSENT TO PARTICIPATE, FREEDOM TO WITHDRAW WITHOUT PENALTY
Unintended consequences
Safeguards for Meta-Research
Foundation of Trust
Consent (and withdrawal of it)
Cautious interpretation
“For whom is the study worthy & relevant?”
“Who says so?”
“To whom is the researcher accountable?”
References
Matchmaking activity
1. Are you a problem owner or a researcher?
2. Take three post-its
Problem owners: Yellow
Researcher: Green
3. Ask yourself this question: “if you are a problem owner, what is the most urgent problem you need a solution to? if you are a researcher, what metascientific area are you most excited about researching?”
4. Now write a two word phrase on a post-its along with your name, replicating the same thing on three other post-its
5. From a pair, ideally with someone with a different colour post-it
6. Explain your post-it note in a short pitch, then hand it to the other person in your pair
Suggestions ->
1 minute on this activity
Volunteer to pitch!
Peer review workbench
Plenary
Where are the biggest gains in improving efficiency or effectiveness in the research system?
What is something you learnt about someone else’s problem or perspective?
Topics for tomorrow?
Got an idea? https://forms.gle/E7hBDqwnbtHW9Bki9
Art of Funding @ MS2025
Come join a small group of funders to discuss the “Art of Funding”. Topics may include advancing new ideas within your organization, overcoming bottlenecks, efficiencies, and logistics of making and monitoring awards.
Bring your questions and ideas to share. Drinks will be served.
Please RSVP https://forms.gle/aHhpVWWc2gs7Fpux8
Discussions will continue afterwards at a restaurant of your choice.
Desk Rejection EoI
Funders! We are interested in speaking to research funders who use, or are considering using, quality-review based desk rejection
https://forms.gle/fWL4sa2ZdkU9tEwt8
TOMORROW: Funder experiments
2
Suggestions ->
The Metascience Lab
Day 2
https://researchonresearch.org/project/a-f-i-r-e/
Welcome From Tom Stafford
Professor of Cognitive Science
& University Research Practice Lead
University of Sheffield
https://tomstafford.github.io/
Senior Research Fellow,
Research on Research Institute
Metascience Lab @ MS2025
- in partnership with Open Philanthropy and RoRI’s AFiRE programme
- three linked sessions will facilitate matchmaking and networking for experimentation
- all areas of metascience, with a focus on interventions to support higher quality, lower cost and more impactful research.
- Each session will showcase metascience principles, methods or examples of experimentation, as well as providing a platform for co-developing new project ideas by participants. Researchers, funders, universities, publishers and other actors in the research ecosystem are invited to propose experiments and matchmake with potential collaborators.
- The Abundance and Growth Fund at Open Philanthropy is happy to consider proposals that emerge from this process
- Topics you’d like considered? Please get in touch
Three days, three themes, three formats
Suggestions
Why and How to experiment
Funder experiments
Building institutional capacity
What is an experiment?
PURITY
PLURALISM
RCTs
Planned
Principled
Public
More: https://researchonresearch.org/project/a-f-i-r-e/
Theodore Hodapp
Program Director, Science
Gordon and Betty Moore Foundation
co-chair
AFIRE Programme
Research on Research Institute
researchonresearch.org/project/a-f-i-r-e/
Today’s plan (DAY TWO)
1400 Chair’s introduction
1405 Ted Hodapp, Gordon and Betty Moore Foundation: What a research funder wants
1410 Stephen Pinfield “T0362 Evaluating Distributed Peer Review at the Volkswagen Foundation”
1420 Rhys Thomas and Adrian Barnett. “T0408 Did the switch to using partial randomisation at The British Academy change the characteristics of applicants?”
1430 Eric Brewe ”T0388 Evaluating scientific impact: A control group study at the Gordon and Betty Moore Foundation”
1440 Activity : topic discussions (facilitators: George Richardson, Amanda Kvarven, Youyou Wu, Albert Bravo-Biosca)
1510 Plenary: new idea pitches and challenge suggestions
Evaluating Distributed Peer Review at the Volkswagen Foundation��Anna Butters, Melanie Benson Marshall, Tom Stafford & Stephen Pinfield (Research on Research Institute and University of Sheffield);�Hanna Denecke, Alexander Bondarenko, Barbara Neubauer, Robert Nuske & Pierre Schwidlinski (Volkswagen Foundation)
Distributed Peer Review (DPR)
DPR Experiment at the Volkswagen Foundation
Internal Shortlisting
70 shortlisted
Quick Assessment
45 with 1+ A-, A, A+
Panel discussion
42 discussed
11 proposals recommended for funding
Proposal Matching
323 reviewers
Peer Review
1387 reviews
Proposal ranking
Trimmed mean method
10 proposals recommended for funding
140 proposals submitted
Panel Review
Distributed Peer Review and Panel Review - Parallel Processes
18 proposals funded
3 recommended by both processes
60% overlap
47% overlap
DPR
Panel selected proposals are found across the full range of DPR scores
DPR selected proposals are found across all Panel stages
Some headlines and moving forward
Areas of concern, particularly:
Our work is focusing on how these concerns can be addressed
Please let us know your thoughts!��researchonresearch.org�@RoRInstitute�
Did the switch to using partial randomisation at The British Academy change the characteristics of applicants?
Metascience Lab (II): Brokering experiments
Rhys.Thomas@dph.ox.ac.uk
1st July 2025
Presented by Rhys Llewellyn Thomas
Dr Rhys Llewellyn Thomas (University of Oxford), Dr Ken Emond (The British Academy), Professor Philip Clarke (University of Oxford), Professor Adrian Barnett (Queensland University of Technology)
The pattern may be replaced by an image, if preferred. When replacing the cover image, don't forget to �"send to back"
Slide
/ 9
Background
Small Grant Scheme
Conditional Lottery
40
To adjust the slide number total:
Slide
/ 9
Data
41
Slide
/ 9
Empirical Strategy
Empirical Strategy
Control Group
Outcomes
42
Slide
/ 9
43
43
Slide
/ 9
44
44
Slide
/ 9
45
45
Slide
/ 9
46
46
Slide
/ 9
Discussion and Conclusion
Mechanism of the effect
Conclusion
47
Slide
/ 9
Evaluating scientific impact: A control group study at the Gordon and Betty Moore Foundation
Eric Brewe, Meagan Sundstrom, Theodore Hodapp, Catherine Mader, Manolis Antonoyiannakis, Heidi Williams, Sheen S. Levine
48
Acknowledgements
Drexel PER Network
Meagan Sundstrom
Justin Gambrell
Maxwell Franklin
Colin Green
Ibukun Bukola
Ian Olivant
Gordon and Betty Moore Foundation
Tess Labbe
Richard Margoluis
49
Background
50
Experimental Physics Investigator Initiative (EPI)
Research Question
51
Do people who receive grant funding have more scientific impact than their equal-potential counterparts who do not get funded?
Measuring Scientific Impact
52
Network Normalized Citation Index
53
Citation Network
Normalize by average citations of papers in same year.
Random Assignment of Participants
54
Preregistering study of Ĉ
55
Preregistering study of Ĉ
56
| Role | |
| Investigator | Comparison |
Number of papers | 734 | 434 |
Average Ĉ | 1.86 | 2.09 |
Standard Dev. Ĉ | 1.98 | 2.22 |
Bayes Factor = 0.34 Null model is ~3x as likely
Thank You!
57
Activity - until 1510
Suggestions
1. Find a table according the “science production” process stage you are interesting
(ideally spread yourselves out across all tables)
2. The mission: identify an opportunity for improvement and agree on an IF..THEN.. sentence which captures an intervention (the IF) and the outcome measure (the THEN) in a simple sentence.
3. We will be sharing these at the end.
Suggestions
Suggestion form:
1. Have your details circulated to all attendees (and receive these details + the slides)
2. Suggest topics
3. Record your IF THEN idea
4. Volunteer to pitch
Plenary
Suggestions
1. Sharing our IF - THEN ideas
2. Suggestions for topics for tomorrow / pitches?
Got an idea? https://forms.gle/E7hBDqwnbtHW9Bki9
Art of Funding @ MS2025
Come join a small group of funders to discuss the “Art of Funding”. Topics may include advancing new ideas within your organization, overcoming bottlenecks, efficiencies, and logistics of making and monitoring awards.
Bring your questions and ideas to share. Drinks will be served.
Please RSVP https://forms.gle/aHhpVWWc2gs7Fpux8
Discussions will continue afterwards at a restaurant of your choice.
Desk Rejection EoI
Funders! We are interested in speaking to research funders who use, or are considering using, quality-review based desk rejection
https://forms.gle/fWL4sa2ZdkU9tEwt8
TOMORROW:
Building institutional capacity
3
Suggestions ->
The Metascience Lab
Day 3
https://researchonresearch.org/project/a-f-i-r-e/
Welcome From Tom Stafford
Professor of Cognitive Science
& University Research Practice Lead
University of Sheffield
https://tomstafford.github.io/
Senior Research Fellow,
Research on Research Institute
Metascience Lab @ MS2025
- in partnership with Open Philanthropy and RoRI’s AFiRE programme
- three linked sessions will facilitate matchmaking and networking for experimentation
- all areas of metascience, with a focus on interventions to support higher quality, lower cost and more impactful research.
- Each session will showcase metascience principles, methods or examples of experimentation, as well as providing a platform for co-developing new project ideas by participants. Researchers, funders, universities, publishers and other actors in the research ecosystem are invited to propose experiments and matchmake with potential collaborators.
- The Abundance and Growth Fund at Open Philanthropy is happy to consider proposals that emerge from this process
- Topics you’d like considered? Please get in touch
Three days, three themes, three formats
Suggestions
Why and How to experiment
Funder experiments
Building institutional capacity
McKenzie Leier
Policy Manager
Abdul Latif Jameel Poverty Action Lab | MIT
Science for Progress Initiative
Science for Progress Initiative (SfPI)
McKenzie Leier
J-PAL Has Funded Over 2,200 RCTs Across the Globe
72
Agriculture
Crime, Violence, �& Conflict
Education
Environment & Energy
Finance
Firms
Political Economy �& Governance
Social Protection
Gender
Health
Labor Markets
GROWTH IN J-PAL RCTS OVER TIME
2,223
2024
1,367
2018
792
2013
327
2008
103
2003
J-PAL | Metascience 2025
Relevant research questions for SfPI
What contracts, incentives, and institutions work best when funding scientific research?
��How can we ensure that the most talented individuals – including younger researchers, those entering science from non-traditional career paths, and those from underrepresented groups – are not discouraged from pursuing science?
��How best should we encourage the diffusion of socially valuable scientific discoveries out of labs and academic papers, so as to encourage innovation and economic growth?
73
High-skilled immigration RCT: A case study in failure
High-risk/high-reward project:
The project does end up failing – the researchers realize their design is not feasible after the pilot. However, this was a worthy failure in our eyes and worth taking a risk on.
74
Call for Researchers + Proposals + Contact information
75
Today’s plan (DAY THREE)
1130 Chair’s introduction
1135 McKenzie Leier, Poverty Action Lab: Effective partnerships
1145 Tom Stafford “T0354 Can AI be used for better matching of proposals to reviewers? Feasibility and formal evaluation with the Metascience 2025 conference”
1155 Hannelore Vanhaverbeke “T0164: Leveraging Success: How KU Leuven’s Internal Grants Boost External Funding Acquisition”
1205 Pitch consultancy (facilitators: George Richardson, Amanda Kvarven, Youyou Wu, James Phipps)
1220 Plenary: new idea pitches and challenge suggestions
1255 Jordan Dworkin, Open Philanthropy: Closing remarks
Can AI be used for better matching of proposals to reviewers? Feasibility and formal evaluation with the Metascience 2025 conference
Tom Stafford, Amanda Kvarven & The MS2025 Programme Committee
2025-06-27
https://researchonresearch.org/project/a-f-i-r-e/
Meta-metascience
Observation is not enough - we have to try things
- feasibility
- causal inference
Finding (enough, good) reviewers is a conceptual and practical problem
The “shadow” experiment
Consent from those submitting and reviewers
All analyses done after final programme decisions
All analyses local - no data left the conference
441 submissions: Title, Abstracts
25 reviewers: assigned to submissions via keywords
1323 reviews: scores & suitability
(each proposal seen by 3 reviewers)
Research Questions
1. Can language models help match proposals to reviewers?
2. Is it feasible for something like a conference to adopt/adapt this technology
3. Can it be done securely/privacy respecting?
Average Suitability was good
Matching - via embedding
Reviewer keywords & proposal title+abstract -> embedding space
Code from SNSF: https://github.com/snsf-data/snsf-grant-similarity
- thanks Gabriel Osaka and SNSF data team!
Model: SPECTER2: BERT model pre-trained on scientific texts and augmented by a citation graph
Actual proposal-reviewer matching far outperformed random matching
Optimal proposal-reviewer matching outperforms actual matching
You can predict suitability from matching score
…and from this you can predict gain in suitability from using the optimal match
Research Questions
1. Can language models help match proposals to reviewers?
2. Is it feasible for something like a conference to adopt/adapt this technology
3. Can it be done securely/privacy respecting?
Maybe - evidence for meaningful improvements beyond human matching
�Definitely yes
Definitely yes
Caveats:
- restricted range: just metascience & metascientists
- are there better models?
- will predicted gains in suitability pan out in metrics like p(accepts review) or review quality?
Thanks to all participants!
Join the conversation - sign up to the RoRI mailing list for updates on AFIRE projects
researchonresearch.org
@RoRInstitute
Funder Peer learning workshop (online):
Practicalities of implementing
language models locally
8th of July, 2pm BST / 3pm CEST
t.stafford@researchonresearch.org
Leveraging Success ��KU Leuven’s Internal Grants �Boost External Funding Acquisition
KU Leuven – Research Office
Hannelore Vanhaverbeke, Klara Gijsbers & Levent Bingöl
Research Coordination Office
KU Leuven (Belgium) - Research Office – Data Management & Analysis Unit
Reorganisation – broadening scope to metascience
Showcase
Network/learn
Research Coordination Office
KU Leuven (Belgium) - Research Office – Data Management & Analysis Unit
Reorganisation – broadening scope to metascience
Showcase
Network/learn
Government
Vicerector
Policy makers
RMA colleagues
Team (3.2 FTE)
Reports& Lists
Analyses & dashboards
Workflow optimalisation
New ideas
Government
Vicerector
Policy makers
RMA colleagues
Team (3.2 FTE)
Reports& Lists
Analyses & dashboards
Workflow optimalisation
New ideas
External panel review of internal funding mechanisms
Funds
Allocation
Mixed sources
Flemish government (80%)
Directly to institutions
Special Research Fund
Industrial Research Fund
Indirectly & competitively to researchers
Government-subsidized funding agencies
projects
projects
Counterfactual analysis
Matched pairs/Difference in difference
Significance of observed difference
Approach
Known issues
Control group: formation
Matthew Effect: how to avoid/reduce?
Causality: how to prove?
Research Coordination Office
Nearest Neighbour
Control group
formation
Research Coordination Office
Data: 2015-2023; 9 cohorts based on start of the C1/C2 grant
Age
Gender
Nationality
% Employment
Years since PhD
Years tenured
Science group
Internal
funding
No internal
funding
Some researchers have obtained prior funding - others have limited/no budget = difference at the start
< 5 k€
5 - 55 k€
55 - 120 k€
120 - 250 k€
>= 250 k€
Matthew effect reduction
Research Coordination Office
198 researcher pairs
Are these really matches?
Research Coordination Office
Budget size | Nr of matched pairs | p value |
< 5 k€ | 74 | 0.54 |
5 - 55 k€ | 52 | 0.36 |
55 - 120 k€ | 28 | 0.03 |
120 - 250 k€ | 16 | 0.68 |
>250 k€ | 28 | 0.23 |
Leverage effect: expectation
Research Coordination Office
internal
external
Leverage effect: when in evidence?
Research Coordination Office
+ 1 year
start date = (duration C1 or C2)/2
end date = ((duration C1 or C2)/2 + (duration C1 or C2) + 1 year)
Start project
End project
Mid-term project
2015
2018
2017
2020
2021
Leverage effect: results
Null hypothesis: there is no difference between the funded target group and the control group in terms of acquiring external funding
Research Coordination Office
Budget size | Nr of matched pairs | p value | Effect size |
< 5 k€ | 74 | < 0.0001 | 0.7468 (medium) |
5 - 55 k€ | 52 | 0.0067 | 0.5492 (large) |
55 - 120 k€ | 28 | 0.0001 | 0.9067 (large) |
120 - 250 k€ | 16 | 0.3942 |
|
>250 k€ | 28 | 0.0480 |
|
Difference-in-Difference: Mann-Whitney U hypothesis testing confirms significant difference between both groups with a medium effect size
Leverage effect: results
Research Coordination Office
Significant leverage effect for researchers with starting budgets under 120 k€
No significant differences observed for those with starting budgets between 120 – 250 k€
however: smallest group in the analysis, caution
Starting budget over 250 k€ initially showed a significant leverage effect, but this dissolved after Bonferroni correction
For budget classes < 5 k€ and 55 - 120 k€, the size effect was significantly large
Negligible or very weak correlations between the amount of external funding and the initial budget size
Looking for input on collaboration
Expertise on data structure, semantics, limitations, …
Experience with ‘standard’ analyses
Input needed esp. on novel approaches
Towards a RMA – researcher collaboration: advice?
Pitch consultancy activity - until 1220
Suggestions
1. Groups of Three People: A (pitching), B & C (consultants)
2. A pitches for <2 minutes, B & C don’t interrupt!
B&C take notes on how to improve the pitch
3. B & C discuss pitch idea, A doesn’t interrupt!
A takes notes on how pitch landed
4. A shares what they learnt
Notes: https://learninginnovation.ca/wp-content/uploads/2020/05/3WayPitch.pdf
Pitches
Mandated external partners: Maria Aleksandrova
Research capacity training: Habeeb Kolade
Communicating Robustness: Alexandra Sarafoglou
Redfining Significance: Jack Fitzgerald
Sharing marginal near hits between funders: Noam Tal-Parry
Funders of Clinical Trials: Maia Salholz-Hillel
Open Research Sabbaticals: Corinne Jola
Targeted Capacity Development and Mentorship for Researchers in the Global South: Interventions to drive employability and higher quality research outputs
Habeeb Kolade | ResearchRound Institute | habeeb@researchround.com
Metascience Conference 2025 | UCL | July 2, 2025
RESEARCH PROJECTS
MENTORING
TARGETED TRAINING
Provide hands-on practice with real-time feedback during mentorship sessions.
Design and deliver various research classes on foundational research skills and interdisciplinary topics.
Support researchers to complete research projects for critical thinking development and
empowering researchers to design and analyze
THEN
PIPELINES OF MOTIVATED SCIENTISTS WITH TRANSFERABLE SKILLS
STRONGER CONFIDENCE IN DOING RESEARCH
HIGHER QUALITY RESEARCH
IF
Suggestions
Suggestion form:
1. Have your details circulated to all attendees (and receive these details + the slides)
4. Volunteer to pitch
Seed Grants
Who we fund
Examples of activities we’re looking to fund
Funded by the Alfred P. Sloan Foundation, IGL Seed Grants support researchers in piloting innovative experimental ideas and activities that yield the potential to carry out Randomised Controlled Trials (RCTs) that generate high-quality evidence for innovation, science, and productivity.
Funding Range: Awards ranging up to $8,000 USD.
Timeline: Call Opens: 15 September 2025; Deadline for Proposals: 15 October 2025
https://www.innovationgrowthlab.org/seed-grants
Jordan Dworkin
Suggestions
Senior Program Associate, Innovation Policy
Open Philanthropy
jordan.dworkin@openphilanthropy.org