Citation analysis on Google Scholar

Google Scholar, Scopus and Web of Science are some of the most commonly used online databases for scholarly work. The mentioned databases vary in their coverage and accuracy of citation counts. This purpose of this report is to conduct citation analysis on Google Scholar. H-index, among other statis...

Full description

Saved in:
Bibliographic Details
Main Author: Vidur Puliani
Other Authors: Xiao Xiao Kui
Format: Final Year Project
Language:English
Published: 2015
Subjects:
Online Access:http://hdl.handle.net/10356/62821
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-62821
record_format dspace
spelling sg-ntu-dr.10356-628212023-03-03T20:26:51Z Citation analysis on Google Scholar Vidur Puliani Xiao Xiao Kui School of Computer Engineering DRNTU::Engineering::Computer science and engineering::Mathematics of computing::Numerical analysis Google Scholar, Scopus and Web of Science are some of the most commonly used online databases for scholarly work. The mentioned databases vary in their coverage and accuracy of citation counts. This purpose of this report is to conduct citation analysis on Google Scholar. H-index, among other statistics, is a widely used measure to evaluate the number of citations of an author. H-index is often used to evaluate the impact of an author’s work on his or her peers and used as an evaluation tool for grants and promotions. Given the importance of h-index as a measure, it is important to identify and extract any possible distortions to give an unbiased measure of an author’s influence. The total citation counts used to calculate the h-index includes self-citations. Self-citations are citations where the author of the citing paper and cited paper are the same. A higher number of self-citations might correlate with higher h-index, which does not necessarily imply a greater influence of an author’s work. Therefore, this report aims to analyse the effect of self-citation on h-index by calculating two h-index values, one using the total number of citations and the other excluding the self-citations. A python crawler was developed to collect the citation data for three authors from Google Scholar and store it in a local database for analysis. The citation analysis shows that the h-index value without self-citation decreases, albeit the effect was limited and non-uniform. Bachelor of Engineering (Computer Science) 2015-04-29T07:40:13Z 2015-04-29T07:40:13Z 2015 2015 Final Year Project (FYP) http://hdl.handle.net/10356/62821 en Nanyang Technological University 37 p. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic DRNTU::Engineering::Computer science and engineering::Mathematics of computing::Numerical analysis
spellingShingle DRNTU::Engineering::Computer science and engineering::Mathematics of computing::Numerical analysis
Vidur Puliani
Citation analysis on Google Scholar
description Google Scholar, Scopus and Web of Science are some of the most commonly used online databases for scholarly work. The mentioned databases vary in their coverage and accuracy of citation counts. This purpose of this report is to conduct citation analysis on Google Scholar. H-index, among other statistics, is a widely used measure to evaluate the number of citations of an author. H-index is often used to evaluate the impact of an author’s work on his or her peers and used as an evaluation tool for grants and promotions. Given the importance of h-index as a measure, it is important to identify and extract any possible distortions to give an unbiased measure of an author’s influence. The total citation counts used to calculate the h-index includes self-citations. Self-citations are citations where the author of the citing paper and cited paper are the same. A higher number of self-citations might correlate with higher h-index, which does not necessarily imply a greater influence of an author’s work. Therefore, this report aims to analyse the effect of self-citation on h-index by calculating two h-index values, one using the total number of citations and the other excluding the self-citations. A python crawler was developed to collect the citation data for three authors from Google Scholar and store it in a local database for analysis. The citation analysis shows that the h-index value without self-citation decreases, albeit the effect was limited and non-uniform.
author2 Xiao Xiao Kui
author_facet Xiao Xiao Kui
Vidur Puliani
format Final Year Project
author Vidur Puliani
author_sort Vidur Puliani
title Citation analysis on Google Scholar
title_short Citation analysis on Google Scholar
title_full Citation analysis on Google Scholar
title_fullStr Citation analysis on Google Scholar
title_full_unstemmed Citation analysis on Google Scholar
title_sort citation analysis on google scholar
publishDate 2015
url http://hdl.handle.net/10356/62821
_version_ 1759853598869553152