Replacing missing values using trustworthy data values from web data sources
In practice, collected data usually are incomplete and contains missing value. Existing approaches in managing missing values overlook the importance of trustworthy data values in replacing missing values. In view that trusted completed data is very important in data analysis, we proposed a fram...
Saved in:
Main Authors: | , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Institute of Physics Publishing
2017
|
Online Access: | http://psasir.upm.edu.my/id/eprint/62958/1/Replacing%20missing%20values%20using%20trustworthy%20data%20values%20from%20web%20data%20sources.pdf http://psasir.upm.edu.my/id/eprint/62958/ http://iopscience.iop.org/article/10.1088/1742-6596/892/1/012009/pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Universiti Putra Malaysia |
Language: | English |
id |
my.upm.eprints.62958 |
---|---|
record_format |
eprints |
spelling |
my.upm.eprints.629582018-11-28T09:23:49Z http://psasir.upm.edu.my/id/eprint/62958/ Replacing missing values using trustworthy data values from web data sources Mohd Jaya, Mohd Izham Sidi, Fatimah Mat Yusof, Sharmila Affendey, Lilly Suriani Ishak, Iskandar A. Jabar, Marzanah In practice, collected data usually are incomplete and contains missing value. Existing approaches in managing missing values overlook the importance of trustworthy data values in replacing missing values. In view that trusted completed data is very important in data analysis, we proposed a framework of missing value replacement using trustworthy data values from web data sources. The proposed framework adopted ontology to map data values from web data sources to the incomplete dataset. As data from web is conflicting with each other, we proposed a trust score measurement based on data accuracy and data reliability. Trust score is then used to select trustworthy data values from web data sources for missing values replacement. We successfully implemented the proposed framework using financial dataset and presented the findings in this paper. From our experiment, we manage to show that replacing missing values with trustworthy data values is important especially in a case of conflicting data to solve missing values problem. Institute of Physics Publishing 2017 Article PeerReviewed text en http://psasir.upm.edu.my/id/eprint/62958/1/Replacing%20missing%20values%20using%20trustworthy%20data%20values%20from%20web%20data%20sources.pdf Mohd Jaya, Mohd Izham and Sidi, Fatimah and Mat Yusof, Sharmila and Affendey, Lilly Suriani and Ishak, Iskandar and A. Jabar, Marzanah (2017) Replacing missing values using trustworthy data values from web data sources. Journal of Physics: Conference Series, 892 (1). pp. 1-11. ISSN 1742-6588; ESSN: 1742-6596 http://iopscience.iop.org/article/10.1088/1742-6596/892/1/012009/pdf 10.1088/1742-6596/892/1/012009 |
institution |
Universiti Putra Malaysia |
building |
UPM Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
Universiti Putra Malaysia |
content_source |
UPM Institutional Repository |
url_provider |
http://psasir.upm.edu.my/ |
language |
English |
description |
In practice, collected data usually are incomplete and contains missing value. Existing approaches in managing missing values overlook the importance of trustworthy data values in replacing missing values. In view that trusted completed data is very important in data analysis, we proposed a framework of missing value replacement using trustworthy data values from web data sources. The proposed framework adopted ontology to map data values from web data sources to the incomplete dataset. As data from web is conflicting with each other, we proposed a trust score measurement based on data accuracy and data reliability. Trust score is then used to select trustworthy data values from web data sources for missing values replacement. We successfully implemented the proposed framework using financial dataset and presented the findings in this paper. From our experiment, we manage to show that replacing missing values with trustworthy data values is important especially in a case of conflicting data to solve missing values problem. |
format |
Article |
author |
Mohd Jaya, Mohd Izham Sidi, Fatimah Mat Yusof, Sharmila Affendey, Lilly Suriani Ishak, Iskandar A. Jabar, Marzanah |
spellingShingle |
Mohd Jaya, Mohd Izham Sidi, Fatimah Mat Yusof, Sharmila Affendey, Lilly Suriani Ishak, Iskandar A. Jabar, Marzanah Replacing missing values using trustworthy data values from web data sources |
author_facet |
Mohd Jaya, Mohd Izham Sidi, Fatimah Mat Yusof, Sharmila Affendey, Lilly Suriani Ishak, Iskandar A. Jabar, Marzanah |
author_sort |
Mohd Jaya, Mohd Izham |
title |
Replacing missing values using trustworthy data values from web data sources |
title_short |
Replacing missing values using trustworthy data values from web data sources |
title_full |
Replacing missing values using trustworthy data values from web data sources |
title_fullStr |
Replacing missing values using trustworthy data values from web data sources |
title_full_unstemmed |
Replacing missing values using trustworthy data values from web data sources |
title_sort |
replacing missing values using trustworthy data values from web data sources |
publisher |
Institute of Physics Publishing |
publishDate |
2017 |
url |
http://psasir.upm.edu.my/id/eprint/62958/1/Replacing%20missing%20values%20using%20trustworthy%20data%20values%20from%20web%20data%20sources.pdf http://psasir.upm.edu.my/id/eprint/62958/ http://iopscience.iop.org/article/10.1088/1742-6596/892/1/012009/pdf |
_version_ |
1643837718668509184 |