A simple more general boxplot method for identifying outliers
The boxplot method (Exploratory Data Analysis, Addison-Wesley, Reading, MA, 1977) is a graphically-based method of identifying outliers which is appealing not only in its simplicity but also because it does not use the extreme potential outliers in computing a measure of dispersion. The inner and ou...
Saved in:
Main Authors: | , , |
---|---|
Format: | Article |
Published: |
Elsevier Science
2004
|
Subjects: | |
Online Access: | http://eprints.utm.my/id/eprint/12180/ http://dx.doi.org/10.1016/j.csda.2003.10.012 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Universiti Teknologi Malaysia |
id |
my.utm.12180 |
---|---|
record_format |
eprints |
spelling |
my.utm.121802017-10-08T04:54:53Z http://eprints.utm.my/id/eprint/12180/ A simple more general boxplot method for identifying outliers Schwertman, Neil C. Owens, Margaret Ann Adnan, Robiah QA76 Computer software The boxplot method (Exploratory Data Analysis, Addison-Wesley, Reading, MA, 1977) is a graphically-based method of identifying outliers which is appealing not only in its simplicity but also because it does not use the extreme potential outliers in computing a measure of dispersion. The inner and outer fences are defined in terms of the hinges (or fourths), and therefore are not distorted by a few extreme values. Such distortion could lead to failing to detect some outliers, a problem known as "masking". A method for determining the probability associated with any fence or observation is proposed based on the cumulative distribution function of the order statistics. This allows the statistician to easily assess, in a probability sense, the degree to which an observation is dissimilar to the majority of the observations. In addition, an adaptation for approximately normal but somewhat asymmetric distributions is suggested. Elsevier Science 2004-08-01 Article PeerReviewed Schwertman, Neil C. and Owens, Margaret Ann and Adnan, Robiah (2004) A simple more general boxplot method for identifying outliers. Computational Statistics and Data Analysis , 47 . pp. 165-174. ISSN 01679473 http://dx.doi.org/10.1016/j.csda.2003.10.012 doi:10.1016/j.csda.2003.10.012 |
institution |
Universiti Teknologi Malaysia |
building |
UTM Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
Universiti Teknologi Malaysia |
content_source |
UTM Institutional Repository |
url_provider |
http://eprints.utm.my/ |
topic |
QA76 Computer software |
spellingShingle |
QA76 Computer software Schwertman, Neil C. Owens, Margaret Ann Adnan, Robiah A simple more general boxplot method for identifying outliers |
description |
The boxplot method (Exploratory Data Analysis, Addison-Wesley, Reading, MA, 1977) is a graphically-based method of identifying outliers which is appealing not only in its simplicity but also because it does not use the extreme potential outliers in computing a measure of dispersion. The inner and outer fences are defined in terms of the hinges (or fourths), and therefore are not distorted by a few extreme values. Such distortion could lead to failing to detect some outliers, a problem known as "masking". A method for determining the probability associated with any fence or observation is proposed based on the cumulative distribution function of the order statistics. This allows the statistician to easily assess, in a probability sense, the degree to which an observation is dissimilar to the majority of the observations. In addition, an adaptation for approximately normal but somewhat asymmetric distributions is suggested. |
format |
Article |
author |
Schwertman, Neil C. Owens, Margaret Ann Adnan, Robiah |
author_facet |
Schwertman, Neil C. Owens, Margaret Ann Adnan, Robiah |
author_sort |
Schwertman, Neil C. |
title |
A simple more general boxplot method for identifying outliers |
title_short |
A simple more general boxplot method for identifying outliers |
title_full |
A simple more general boxplot method for identifying outliers |
title_fullStr |
A simple more general boxplot method for identifying outliers |
title_full_unstemmed |
A simple more general boxplot method for identifying outliers |
title_sort |
simple more general boxplot method for identifying outliers |
publisher |
Elsevier Science |
publishDate |
2004 |
url |
http://eprints.utm.my/id/eprint/12180/ http://dx.doi.org/10.1016/j.csda.2003.10.012 |
_version_ |
1643645880830525440 |