Detecting target text related to algorithmic efficiency in scholarly big data using recurrent convolutional neural network model

© 2017, Springer International Publishing AG. We are observing an exponential growth of scientific literature since the last few decades. Tapping on the advancement of web-enabled tools and technologies, millions of articles are stored and indexed in the digital libraries. Among this archived scient...

Full description

Saved in:

Bibliographic Details
Main Authors:	Iqra Safder, Junaid Sarfraz, Saeed Ul Hassan, Mohsen Ali, Suppawong Tuarob
Other Authors:	Information Technology University
Format:	Conference or Workshop Item
Published:	2018
Subjects:	Computer Science
Online Access:	https://repository.li.mahidol.ac.th/handle/123456789/42427
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Mahidol University

id	th-mahidol.42427
record_format	dspace
spelling	th-mahidol.424272019-03-14T15:03:28Z Detecting target text related to algorithmic efficiency in scholarly big data using recurrent convolutional neural network model Iqra Safder Junaid Sarfraz Saeed Ul Hassan Mohsen Ali Suppawong Tuarob Information Technology University Mahidol University Computer Science © 2017, Springer International Publishing AG. We are observing an exponential growth of scientific literature since the last few decades. Tapping on the advancement of web-enabled tools and technologies, millions of articles are stored and indexed in the digital libraries. Among this archived scientific literature, thousands of newly emerging algorithms, mostly illustrated with pseudo-codes, are published every year in the area of Computer Science and other related computational fields. Previously, an array of techniques has been deployed to retrieve information related to these algorithms by indexing their pseudo-codes and metadata from a vast pool of scholarly documents. Unfortunately, existing search engines are only limited to indexing a textual description of each pseudo-code and are unable to provide simple algorithm-specific information such as run-time complexity, performance evaluation (such as precision, recall, or f-measure), and the size of the dataset it can effectively process, etc. In this paper, we propose a set of algorithms that extract information pertaining to the performance of algorithm(s) presented and/or discussed in the research article. Specifically, sentences in the paper that convey information about the efficiency of the corresponding algorithm are identified and extracted, using the Recurrent Convolutional Neural Network (RCNN) model. To evaluate the performance of our algorithm, we have collected a dataset of 258 manually annotated scholarly documents by four experts, originally downloaded from CiteseerX. Our proposed RCNN based model achieves encouraging 77.65% f-measure and 76.35% accuracy. 2018-12-21T07:23:38Z 2019-03-14T08:03:28Z 2018-12-21T07:23:38Z 2019-03-14T08:03:28Z 2017-01-01 Conference Paper Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol.10647 LNCS, (2017), 30-40 10.1007/978-3-319-70232-2_3 16113349 03029743 2-s2.0-85034018669 https://repository.li.mahidol.ac.th/handle/123456789/42427 Mahidol University SCOPUS https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85034018669&origin=inward
institution	Mahidol University
building	Mahidol University Library
continent	Asia
country	Thailand Thailand
content_provider	Mahidol University Library
collection	Mahidol University Institutional Repository
topic	Computer Science
spellingShingle	Computer Science Iqra Safder Junaid Sarfraz Saeed Ul Hassan Mohsen Ali Suppawong Tuarob Detecting target text related to algorithmic efficiency in scholarly big data using recurrent convolutional neural network model
description	© 2017, Springer International Publishing AG. We are observing an exponential growth of scientific literature since the last few decades. Tapping on the advancement of web-enabled tools and technologies, millions of articles are stored and indexed in the digital libraries. Among this archived scientific literature, thousands of newly emerging algorithms, mostly illustrated with pseudo-codes, are published every year in the area of Computer Science and other related computational fields. Previously, an array of techniques has been deployed to retrieve information related to these algorithms by indexing their pseudo-codes and metadata from a vast pool of scholarly documents. Unfortunately, existing search engines are only limited to indexing a textual description of each pseudo-code and are unable to provide simple algorithm-specific information such as run-time complexity, performance evaluation (such as precision, recall, or f-measure), and the size of the dataset it can effectively process, etc. In this paper, we propose a set of algorithms that extract information pertaining to the performance of algorithm(s) presented and/or discussed in the research article. Specifically, sentences in the paper that convey information about the efficiency of the corresponding algorithm are identified and extracted, using the Recurrent Convolutional Neural Network (RCNN) model. To evaluate the performance of our algorithm, we have collected a dataset of 258 manually annotated scholarly documents by four experts, originally downloaded from CiteseerX. Our proposed RCNN based model achieves encouraging 77.65% f-measure and 76.35% accuracy.
author2	Information Technology University
author_facet	Information Technology University Iqra Safder Junaid Sarfraz Saeed Ul Hassan Mohsen Ali Suppawong Tuarob
format	Conference or Workshop Item
author	Iqra Safder Junaid Sarfraz Saeed Ul Hassan Mohsen Ali Suppawong Tuarob
author_sort	Iqra Safder
title	Detecting target text related to algorithmic efficiency in scholarly big data using recurrent convolutional neural network model
title_short	Detecting target text related to algorithmic efficiency in scholarly big data using recurrent convolutional neural network model
title_full	Detecting target text related to algorithmic efficiency in scholarly big data using recurrent convolutional neural network model
title_fullStr	Detecting target text related to algorithmic efficiency in scholarly big data using recurrent convolutional neural network model
title_full_unstemmed	Detecting target text related to algorithmic efficiency in scholarly big data using recurrent convolutional neural network model
title_sort	detecting target text related to algorithmic efficiency in scholarly big data using recurrent convolutional neural network model
publishDate	2018
url	https://repository.li.mahidol.ac.th/handle/123456789/42427
_version_	1763495907838394368

Detecting target text related to algorithmic efficiency in scholarly big data using recurrent convolutional neural network model

Similar Items