End-to-End Relation Extraction of Pharmacokinetic Estimates from the Scientific Literature

Ferran Gonzalez Hernandez, Victoria C. Smith, Quang Nguyen, José Antonio Cordero, Maria Rosa Ballester, Màrius Duran, Albert Solé, Palang Chotsiri, Thanaporn Wattanakul, Gill Mundin, Watjana Lilaonitkul, Joseph F. Standing, Frank Kloprogge

Producció científica: Capítol de llibreContribució a congrés/conferènciaAvaluat per experts

Resum

The lack of comprehensive and standardised databases containing Pharmacokinetic (PK) parameters presents a challenge in the drug development pipeline. Efficiently managing the increasing volume of published PK Parameters requires automated approaches that centralise information from diverse studies. In this work, we present the Pharmacokinetic Relation Extraction Dataset (PRED), a novel, manually curated corpus developed by pharmacometricians and NLP specialists, covering multiple types of PK parameters and numerical expressions reported in open-access scientific articles. PRED covers annotations for various entities and relations involved in PK parameter measurements from 3,600 sentences. We also introduce an end-to-end relation extraction model based on BioBERT, which is trained with joint named entity recognition (NER) and relation extraction objectives. The optimal pipeline achieved a micro-average F1-score of 94% for NER and over 85% F1-score across all relation types. This work represents the first resource for training and evaluating models for PK end-to-end extraction across multiple parameters and study types. We make our corpus and model openly available to accelerate the construction of large PK databases and to support similar endeavours in other scientific disciplines..

Idioma originalAnglès
Títol de la publicacióBioNLP 2024 - 23rd Meeting of the ACL Special Interest Group on Biomedical Natural Language Processing, Proceedings of the Workshop and Shared Tasks
EditorsDina Demner-Fushman, Sophia Ananiadou, Makoto Miwa, Kirk Roberts, Junichi Tsujii
EditorAssociation for Computational Linguistics (ACL)
Pàgines144-154
Nombre de pàgines11
ISBN (electrònic)9798891761308
Estat de la publicacióPublicada - 2024
Esdeveniment23rd Meeting of the ACL Special Interest Group on Biomedical Natural Language Processing, BioNLP 2024 - Bangkok, Thailand
Durada: 16 d’ag. 2024 → …

Sèrie de publicacions

NomBioNLP 2024 - 23rd Meeting of the ACL Special Interest Group on Biomedical Natural Language Processing, Proceedings of the Workshop and Shared Tasks

Conferència

Conferència23rd Meeting of the ACL Special Interest Group on Biomedical Natural Language Processing, BioNLP 2024
País/TerritoriThailand
CiutatBangkok
Període16/08/24 → …

Fingerprint

Navegar pels temes de recerca de 'End-to-End Relation Extraction of Pharmacokinetic Estimates from the Scientific Literature'. Junts formen un fingerprint únic.

Com citar-ho