Harnessing full-text publications for deep insights into C. elegans and Drosophila biomaps

In the rapidly expanding domain of scientific research, tracking and synthesizing information from the rapidly increasing volume of publications pose significant challenges. To address this, we introduce a novel high-throughput pipeline that employs ChatGPT to systematically extract and analyze conn...

Full description

Saved in:
Bibliographic Details
Main Authors: Arulprakasam, Karthick Raja, Toh, Janelle Wing Shan, Foo, Herman, Kumar, Mani R., Kutevska, An-Nikol, Davey, Emilia Emmanuelle, Mutwil, Marek, Thibault, Guillaume
Other Authors: School of Biological Sciences
Format: Article
Language:English
Published: 2025
Subjects:
Online Access:https://hdl.handle.net/10356/182064
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-182064
record_format dspace
spelling sg-ntu-dr.10356-1820642025-01-13T15:32:32Z Harnessing full-text publications for deep insights into C. elegans and Drosophila biomaps Arulprakasam, Karthick Raja Toh, Janelle Wing Shan Foo, Herman Kumar, Mani R. Kutevska, An-Nikol Davey, Emilia Emmanuelle Mutwil, Marek Thibault, Guillaume School of Biological Sciences Mechanobiology Institute, NUS Medicine, Health and Life Sciences Genetic interactions Text mining In the rapidly expanding domain of scientific research, tracking and synthesizing information from the rapidly increasing volume of publications pose significant challenges. To address this, we introduce a novel high-throughput pipeline that employs ChatGPT to systematically extract and analyze connectivity information from the full-texts and abstracts of 24,237 and 150,538 research publications concerning Caenorhabditis elegans and Drosophila melanogaster, respectively. This approach has effectively identified 200,219 and 1,194,587 interactions within the C. elegans and Drosophila biomaps, respectively. Utilizing Cytoscape Web, we have developed a searchable online biomaps that link relevant keywords to their corresponding PubMed IDs, thus providing seamless access to an extensive knowledge network encompassing C. elegans and Drosophila. Our work highlights the transformative potential of integrating artificial intelligence with bioinformatics to deepen our understanding of complex biological systems. By revealing the intricate web of relationships among key entities in C. elegans and Drosophila, we offer invaluable insights that promise to propel advancements in genetics, developmental biology, neuroscience, longevity, and beyond. We also provide details and discuss significant nodes within both biomaps, including the insulin/IGF-1 signaling (IIS) and the notch pathways. Our innovative methodology sets a robust foundation for future research aimed at unravelling complex biological networks across diverse organisms. The two databases are available at worm.bio-map.com and drosophila.bio-map.com. Ministry of Education (MOE) Published version This work was supported by funds from the Singapore Ministry of Education Academic Research Fund Tier 1 (RG96/22 to G.T.) and Tier 3 (MOET32022-0002 to M.M.) as well as the Research Scholarship to K.R.A [predoctoral fellowship from Singapore Ministry of Education Academic Research Fund Tier 3 (MOE-MOET32020-0001)]. 2025-01-07T01:13:21Z 2025-01-07T01:13:21Z 2024 Journal Article Arulprakasam, K. R., Toh, J. W. S., Foo, H., Kumar, M. R., Kutevska, A., Davey, E. E., Mutwil, M. & Thibault, G. (2024). Harnessing full-text publications for deep insights into C. elegans and Drosophila biomaps. BMC Genomics, 25(1), 1080-. https://dx.doi.org/10.1186/s12864-024-10997-6 1471-2164 https://hdl.handle.net/10356/182064 10.1186/s12864-024-10997-6 39538127 2-s2.0-85209171186 1 25 1080 en RG96/22 MOET32022-0002 MOE-MOET32020-0001 BMC Genomics © 2024 The Author(s). Open Access. This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creati vecommons.org/licenses/by-nc-nd/4.0/. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Medicine, Health and Life Sciences
Genetic interactions
Text mining
spellingShingle Medicine, Health and Life Sciences
Genetic interactions
Text mining
Arulprakasam, Karthick Raja
Toh, Janelle Wing Shan
Foo, Herman
Kumar, Mani R.
Kutevska, An-Nikol
Davey, Emilia Emmanuelle
Mutwil, Marek
Thibault, Guillaume
Harnessing full-text publications for deep insights into C. elegans and Drosophila biomaps
description In the rapidly expanding domain of scientific research, tracking and synthesizing information from the rapidly increasing volume of publications pose significant challenges. To address this, we introduce a novel high-throughput pipeline that employs ChatGPT to systematically extract and analyze connectivity information from the full-texts and abstracts of 24,237 and 150,538 research publications concerning Caenorhabditis elegans and Drosophila melanogaster, respectively. This approach has effectively identified 200,219 and 1,194,587 interactions within the C. elegans and Drosophila biomaps, respectively. Utilizing Cytoscape Web, we have developed a searchable online biomaps that link relevant keywords to their corresponding PubMed IDs, thus providing seamless access to an extensive knowledge network encompassing C. elegans and Drosophila. Our work highlights the transformative potential of integrating artificial intelligence with bioinformatics to deepen our understanding of complex biological systems. By revealing the intricate web of relationships among key entities in C. elegans and Drosophila, we offer invaluable insights that promise to propel advancements in genetics, developmental biology, neuroscience, longevity, and beyond. We also provide details and discuss significant nodes within both biomaps, including the insulin/IGF-1 signaling (IIS) and the notch pathways. Our innovative methodology sets a robust foundation for future research aimed at unravelling complex biological networks across diverse organisms. The two databases are available at worm.bio-map.com and drosophila.bio-map.com.
author2 School of Biological Sciences
author_facet School of Biological Sciences
Arulprakasam, Karthick Raja
Toh, Janelle Wing Shan
Foo, Herman
Kumar, Mani R.
Kutevska, An-Nikol
Davey, Emilia Emmanuelle
Mutwil, Marek
Thibault, Guillaume
format Article
author Arulprakasam, Karthick Raja
Toh, Janelle Wing Shan
Foo, Herman
Kumar, Mani R.
Kutevska, An-Nikol
Davey, Emilia Emmanuelle
Mutwil, Marek
Thibault, Guillaume
author_sort Arulprakasam, Karthick Raja
title Harnessing full-text publications for deep insights into C. elegans and Drosophila biomaps
title_short Harnessing full-text publications for deep insights into C. elegans and Drosophila biomaps
title_full Harnessing full-text publications for deep insights into C. elegans and Drosophila biomaps
title_fullStr Harnessing full-text publications for deep insights into C. elegans and Drosophila biomaps
title_full_unstemmed Harnessing full-text publications for deep insights into C. elegans and Drosophila biomaps
title_sort harnessing full-text publications for deep insights into c. elegans and drosophila biomaps
publishDate 2025
url https://hdl.handle.net/10356/182064
_version_ 1821279342392508416