Replacing missing values using trustworthy data values from web data sources

In practice, collected data usually are incomplete and contains missing value. Existing approaches in managing missing values overlook the importance of trustworthy data values in replacing missing values. In view that trusted completed data is very important in data analysis, we proposed a fram...

Full description

Saved in:
Bibliographic Details
Main Authors: Mohd Jaya, Mohd Izham, Sidi, Fatimah, Mat Yusof, Sharmila, Affendey, Lilly Suriani, Ishak, Iskandar, A. Jabar, Marzanah
Format: Article
Language:English
Published: Institute of Physics Publishing 2017
Online Access:http://psasir.upm.edu.my/id/eprint/62958/1/Replacing%20missing%20values%20using%20trustworthy%20data%20values%20from%20web%20data%20sources.pdf
http://psasir.upm.edu.my/id/eprint/62958/
http://iopscience.iop.org/article/10.1088/1742-6596/892/1/012009/pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universiti Putra Malaysia
Language: English
id my.upm.eprints.62958
record_format eprints
spelling my.upm.eprints.629582018-11-28T09:23:49Z http://psasir.upm.edu.my/id/eprint/62958/ Replacing missing values using trustworthy data values from web data sources Mohd Jaya, Mohd Izham Sidi, Fatimah Mat Yusof, Sharmila Affendey, Lilly Suriani Ishak, Iskandar A. Jabar, Marzanah In practice, collected data usually are incomplete and contains missing value. Existing approaches in managing missing values overlook the importance of trustworthy data values in replacing missing values. In view that trusted completed data is very important in data analysis, we proposed a framework of missing value replacement using trustworthy data values from web data sources. The proposed framework adopted ontology to map data values from web data sources to the incomplete dataset. As data from web is conflicting with each other, we proposed a trust score measurement based on data accuracy and data reliability. Trust score is then used to select trustworthy data values from web data sources for missing values replacement. We successfully implemented the proposed framework using financial dataset and presented the findings in this paper. From our experiment, we manage to show that replacing missing values with trustworthy data values is important especially in a case of conflicting data to solve missing values problem. Institute of Physics Publishing 2017 Article PeerReviewed text en http://psasir.upm.edu.my/id/eprint/62958/1/Replacing%20missing%20values%20using%20trustworthy%20data%20values%20from%20web%20data%20sources.pdf Mohd Jaya, Mohd Izham and Sidi, Fatimah and Mat Yusof, Sharmila and Affendey, Lilly Suriani and Ishak, Iskandar and A. Jabar, Marzanah (2017) Replacing missing values using trustworthy data values from web data sources. Journal of Physics: Conference Series, 892 (1). pp. 1-11. ISSN 1742-6588; ESSN: 1742-6596 http://iopscience.iop.org/article/10.1088/1742-6596/892/1/012009/pdf 10.1088/1742-6596/892/1/012009
institution Universiti Putra Malaysia
building UPM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Putra Malaysia
content_source UPM Institutional Repository
url_provider http://psasir.upm.edu.my/
language English
description In practice, collected data usually are incomplete and contains missing value. Existing approaches in managing missing values overlook the importance of trustworthy data values in replacing missing values. In view that trusted completed data is very important in data analysis, we proposed a framework of missing value replacement using trustworthy data values from web data sources. The proposed framework adopted ontology to map data values from web data sources to the incomplete dataset. As data from web is conflicting with each other, we proposed a trust score measurement based on data accuracy and data reliability. Trust score is then used to select trustworthy data values from web data sources for missing values replacement. We successfully implemented the proposed framework using financial dataset and presented the findings in this paper. From our experiment, we manage to show that replacing missing values with trustworthy data values is important especially in a case of conflicting data to solve missing values problem.
format Article
author Mohd Jaya, Mohd Izham
Sidi, Fatimah
Mat Yusof, Sharmila
Affendey, Lilly Suriani
Ishak, Iskandar
A. Jabar, Marzanah
spellingShingle Mohd Jaya, Mohd Izham
Sidi, Fatimah
Mat Yusof, Sharmila
Affendey, Lilly Suriani
Ishak, Iskandar
A. Jabar, Marzanah
Replacing missing values using trustworthy data values from web data sources
author_facet Mohd Jaya, Mohd Izham
Sidi, Fatimah
Mat Yusof, Sharmila
Affendey, Lilly Suriani
Ishak, Iskandar
A. Jabar, Marzanah
author_sort Mohd Jaya, Mohd Izham
title Replacing missing values using trustworthy data values from web data sources
title_short Replacing missing values using trustworthy data values from web data sources
title_full Replacing missing values using trustworthy data values from web data sources
title_fullStr Replacing missing values using trustworthy data values from web data sources
title_full_unstemmed Replacing missing values using trustworthy data values from web data sources
title_sort replacing missing values using trustworthy data values from web data sources
publisher Institute of Physics Publishing
publishDate 2017
url http://psasir.upm.edu.my/id/eprint/62958/1/Replacing%20missing%20values%20using%20trustworthy%20data%20values%20from%20web%20data%20sources.pdf
http://psasir.upm.edu.my/id/eprint/62958/
http://iopscience.iop.org/article/10.1088/1742-6596/892/1/012009/pdf
_version_ 1643837718668509184