Citation analysis on Google Scholar
Google Scholar, Scopus and Web of Science are some of the most commonly used online databases for scholarly work. The mentioned databases vary in their coverage and accuracy of citation counts. This purpose of this report is to conduct citation analysis on Google Scholar. H-index, among other statis...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
2015
|
Subjects: | |
Online Access: | http://hdl.handle.net/10356/62821 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-62821 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-628212023-03-03T20:26:51Z Citation analysis on Google Scholar Vidur Puliani Xiao Xiao Kui School of Computer Engineering DRNTU::Engineering::Computer science and engineering::Mathematics of computing::Numerical analysis Google Scholar, Scopus and Web of Science are some of the most commonly used online databases for scholarly work. The mentioned databases vary in their coverage and accuracy of citation counts. This purpose of this report is to conduct citation analysis on Google Scholar. H-index, among other statistics, is a widely used measure to evaluate the number of citations of an author. H-index is often used to evaluate the impact of an author’s work on his or her peers and used as an evaluation tool for grants and promotions. Given the importance of h-index as a measure, it is important to identify and extract any possible distortions to give an unbiased measure of an author’s influence. The total citation counts used to calculate the h-index includes self-citations. Self-citations are citations where the author of the citing paper and cited paper are the same. A higher number of self-citations might correlate with higher h-index, which does not necessarily imply a greater influence of an author’s work. Therefore, this report aims to analyse the effect of self-citation on h-index by calculating two h-index values, one using the total number of citations and the other excluding the self-citations. A python crawler was developed to collect the citation data for three authors from Google Scholar and store it in a local database for analysis. The citation analysis shows that the h-index value without self-citation decreases, albeit the effect was limited and non-uniform. Bachelor of Engineering (Computer Science) 2015-04-29T07:40:13Z 2015-04-29T07:40:13Z 2015 2015 Final Year Project (FYP) http://hdl.handle.net/10356/62821 en Nanyang Technological University 37 p. application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
DRNTU::Engineering::Computer science and engineering::Mathematics of computing::Numerical analysis |
spellingShingle |
DRNTU::Engineering::Computer science and engineering::Mathematics of computing::Numerical analysis Vidur Puliani Citation analysis on Google Scholar |
description |
Google Scholar, Scopus and Web of Science are some of the most commonly used online databases for scholarly work. The mentioned databases vary in their coverage and accuracy of citation counts. This purpose of this report is to conduct citation analysis on Google Scholar. H-index, among other statistics, is a widely used measure to evaluate the number of citations of an author. H-index is often used to evaluate the impact of an author’s work on his or her peers and used as an evaluation tool for grants and promotions. Given the importance of h-index as a measure, it is important to identify and extract any possible distortions to give an unbiased measure of an author’s influence. The total citation counts used to calculate the h-index includes self-citations. Self-citations are citations where the author of the citing paper and cited paper are the same. A higher number of self-citations might correlate with higher h-index, which does not necessarily imply a greater influence of an author’s work. Therefore, this report aims to analyse the effect of self-citation on h-index by calculating two h-index values, one using the total number of citations and the other excluding the self-citations. A python crawler was developed to collect the citation data for three authors from Google Scholar and store it in a local database for analysis. The citation analysis shows that the h-index value without self-citation decreases, albeit the effect was limited and non-uniform. |
author2 |
Xiao Xiao Kui |
author_facet |
Xiao Xiao Kui Vidur Puliani |
format |
Final Year Project |
author |
Vidur Puliani |
author_sort |
Vidur Puliani |
title |
Citation analysis on Google Scholar |
title_short |
Citation analysis on Google Scholar |
title_full |
Citation analysis on Google Scholar |
title_fullStr |
Citation analysis on Google Scholar |
title_full_unstemmed |
Citation analysis on Google Scholar |
title_sort |
citation analysis on google scholar |
publishDate |
2015 |
url |
http://hdl.handle.net/10356/62821 |
_version_ |
1759853598869553152 |