TY - JOUR
T1 - Review
T2 - A Roadmap to Use Nonstructured Data to Discover Multitarget Cancer Therapies
AU - Scoarta, Silvia
AU - Küçükosmanoglu, Asli
AU - Bindt, Felix
AU - Pouwer, Marianne
AU - Westerman, Bart A.
PY - 2023/4/1
Y1 - 2023/4/1
N2 - Therapy resistance to single agents has led to the realization that combination therapies could become the cornerstone of cancer treatment. To operationalize the selection of effective and safe multitarget therapies, we propose to integrate chemical and preclinical therapeutic information with clinical efficacy and toxicity data, allowing a new perspective on the drug target landscape. To assess the feasibility of this approach, we evaluated the publicly available chemical, preclinical, and clinical therapeutic data, and we addressed some potential limitations while integrating the data. First, by mapping available structured data from the main biomedical resources, we noticed that there is only a 1.7% overlap between drugs in chemical, preclinical, or clinical databases. Especially, the limited amount of structured data in the clinical domain hinders linking drugs to clinical aspects such as efficacy and side effects. Second, to overcome the abovementioned knowledge gap between the chemical, preclinical, and clinical domain, we suggest information extraction from scientific literature and other unstructured resources through natural language processing models, where BioBERT and PubMedBERT are the current state-of-the-art approaches. Finally, we propose that knowledge graphs can be used to link structured data, scientific literature, and electronic health records, to come to meaningful interpretations. Together, we expect this richer knowledge will lower barriers toward clinical application of personalized combination therapies with high efficacy and limited adverse events.
AB - Therapy resistance to single agents has led to the realization that combination therapies could become the cornerstone of cancer treatment. To operationalize the selection of effective and safe multitarget therapies, we propose to integrate chemical and preclinical therapeutic information with clinical efficacy and toxicity data, allowing a new perspective on the drug target landscape. To assess the feasibility of this approach, we evaluated the publicly available chemical, preclinical, and clinical therapeutic data, and we addressed some potential limitations while integrating the data. First, by mapping available structured data from the main biomedical resources, we noticed that there is only a 1.7% overlap between drugs in chemical, preclinical, or clinical databases. Especially, the limited amount of structured data in the clinical domain hinders linking drugs to clinical aspects such as efficacy and side effects. Second, to overcome the abovementioned knowledge gap between the chemical, preclinical, and clinical domain, we suggest information extraction from scientific literature and other unstructured resources through natural language processing models, where BioBERT and PubMedBERT are the current state-of-the-art approaches. Finally, we propose that knowledge graphs can be used to link structured data, scientific literature, and electronic health records, to come to meaningful interpretations. Together, we expect this richer knowledge will lower barriers toward clinical application of personalized combination therapies with high efficacy and limited adverse events.
UR - http://www.scopus.com/inward/record.url?scp=85159241760&partnerID=8YFLogxK
U2 - https://doi.org/10.1200/CCI.22.00096
DO - https://doi.org/10.1200/CCI.22.00096
M3 - Review article
C2 - 37116097
SN - 2473-4276
VL - 7
SP - e2200096
JO - JCO clinical cancer informatics
JF - JCO clinical cancer informatics
ER -