TY - JOUR
T1 - The Impact of Research Data Infrastructures
T2 - The Case of the AlphaFold Database
AU - Romasanta, Angelo Kenneth
AU - Wareham, Jonathan
AU - Priego, Laia Pujol
N1 - Publisher Copyright:
© The author/s, 2025.
PY - 2025
Y1 - 2025
N2 - While the scientific output of research infrastructures is well documented, the broader effects of their secondary outputs, such as computational resources and datasets, remain poorly understood. To better understand the benefits of these public resources, this study explores the AlphaFold (AFDB) database, a collaboration between DeepMind and the European Molecular Biology Laboratory (EMBL) that democratizes access to protein structure data. Employing a quantitative case study strategy using bibliometric analysis, this study compares publications indexed in the Web of Science Core Collection citing the original AF paper (Jumper et al., 2021) (n=13,049) with those citing the AlphaFold database (Varadi et al., 2022) (n=659), covering publications up to August 2024. We examine the impact of the EMBL AlphaFold database on research themes, collaboration patterns, and scientific impact. Our exploratory analysis identifies several impacts: studies leveraging the AF database investigate application-focused themes and require collaboration between fewer institutions. This research highlights the wide-ranging impacts of research infrastructures, emphasizing the need for comprehensive impact assessments to inform future research policy and funding decisions.
AB - While the scientific output of research infrastructures is well documented, the broader effects of their secondary outputs, such as computational resources and datasets, remain poorly understood. To better understand the benefits of these public resources, this study explores the AlphaFold (AFDB) database, a collaboration between DeepMind and the European Molecular Biology Laboratory (EMBL) that democratizes access to protein structure data. Employing a quantitative case study strategy using bibliometric analysis, this study compares publications indexed in the Web of Science Core Collection citing the original AF paper (Jumper et al., 2021) (n=13,049) with those citing the AlphaFold database (Varadi et al., 2022) (n=659), covering publications up to August 2024. We examine the impact of the EMBL AlphaFold database on research themes, collaboration patterns, and scientific impact. Our exploratory analysis identifies several impacts: studies leveraging the AF database investigate application-focused themes and require collaboration between fewer institutions. This research highlights the wide-ranging impacts of research infrastructures, emphasizing the need for comprehensive impact assessments to inform future research policy and funding decisions.
KW - AlphaFold
KW - bibliometrics
KW - research evaluation
KW - Research infrastructure
KW - scientific impact
UR - http://www.scopus.com/inward/record.url?scp=105006666634&partnerID=8YFLogxK
U2 - 10.23726/cij.2025.1597
DO - 10.23726/cij.2025.1597
M3 - Article
AN - SCOPUS:105006666634
SN - 2413-9505
VL - 9
SP - 42
EP - 48
JO - CERN IdeaSquare Journal of Experimental Innovation
JF - CERN IdeaSquare Journal of Experimental Innovation
IS - 1
ER -