A synthetic data generation procedure for univariate circular data with various outliers scenarios using Python programming language

Synthetic data is artificial data that is created based on the statistical properties of the original data. The aim of this study is to generate a synthetic or simulated data for univariate circular data that follow von Mises (VM) distribution with various outliers scenario using Python programming...

Full description

Saved in:
Bibliographic Details
Main Authors: Nur Syahirah, Zulkipli, Siti Zanariah, Satari, Wan Nur Syahidah, Wan Yusoff
Format: Conference or Workshop Item
Language:English
Published: IOP Publishing Ltd 2021
Subjects:
Online Access:http://umpir.ump.edu.my/id/eprint/35201/1/A%20synthetic%20data%20generation%20procedure%20for%20univariate%20circular%20data%20with%20various%20outliers%20scenarios.pdf
http://umpir.ump.edu.my/id/eprint/35201/
https://doi.org/10.1088/1742-6596/1988/1/012111
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universiti Malaysia Pahang
Language: English
id my.ump.umpir.35201
record_format eprints
spelling my.ump.umpir.352012022-10-17T05:11:18Z http://umpir.ump.edu.my/id/eprint/35201/ A synthetic data generation procedure for univariate circular data with various outliers scenarios using Python programming language Nur Syahirah, Zulkipli Siti Zanariah, Satari Wan Nur Syahidah, Wan Yusoff Q Science (General) QA Mathematics Synthetic data is artificial data that is created based on the statistical properties of the original data. The aim of this study is to generate a synthetic or simulated data for univariate circular data that follow von Mises (VM) distribution with various outliers scenario using Python programming language. The procedure of formulation a synthetic data generation is proposed in this study. The synthetic data is generated from various combinations of seven sample size, n and five concentration parameters, K. Moreover, a synthetic data will be generated by formulating a data generation procedure with different condition of outliers scenarios. Three outliers scenarios are proposed in this study to introduce the outliers in synthetic dataset by placing them away from inliers at a specific distance. The number of outliers planted in the dataset are fixed with three outliers. The synthetic data is randomly generated by using Python library and package which are 'numpy', 'random' and von Mises'. In conclusion, the synthetic data of univariate circular data from von Mises distribution is generated and the outliers are successfully introduced in the dataset with three outliers scenarios using Python. This study will be valuable for those who are interested to study univariate circular data with outliers and choose Python as an analysis tool. IOP Publishing Ltd 2021-08-17 Conference or Workshop Item PeerReviewed pdf en cc_by http://umpir.ump.edu.my/id/eprint/35201/1/A%20synthetic%20data%20generation%20procedure%20for%20univariate%20circular%20data%20with%20various%20outliers%20scenarios.pdf Nur Syahirah, Zulkipli and Siti Zanariah, Satari and Wan Nur Syahidah, Wan Yusoff (2021) A synthetic data generation procedure for univariate circular data with various outliers scenarios using Python programming language. In: 28th Simposium Kebangsaan Sains Matematik, SKSM 2021, 28-29 July 2021 , Kuantan, Pahang, Virtual. pp. 1-10., 1988 (012111). ISSN 1742-6588 https://doi.org/10.1088/1742-6596/1988/1/012111
institution Universiti Malaysia Pahang
building UMP Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Malaysia Pahang
content_source UMP Institutional Repository
url_provider http://umpir.ump.edu.my/
language English
topic Q Science (General)
QA Mathematics
spellingShingle Q Science (General)
QA Mathematics
Nur Syahirah, Zulkipli
Siti Zanariah, Satari
Wan Nur Syahidah, Wan Yusoff
A synthetic data generation procedure for univariate circular data with various outliers scenarios using Python programming language
description Synthetic data is artificial data that is created based on the statistical properties of the original data. The aim of this study is to generate a synthetic or simulated data for univariate circular data that follow von Mises (VM) distribution with various outliers scenario using Python programming language. The procedure of formulation a synthetic data generation is proposed in this study. The synthetic data is generated from various combinations of seven sample size, n and five concentration parameters, K. Moreover, a synthetic data will be generated by formulating a data generation procedure with different condition of outliers scenarios. Three outliers scenarios are proposed in this study to introduce the outliers in synthetic dataset by placing them away from inliers at a specific distance. The number of outliers planted in the dataset are fixed with three outliers. The synthetic data is randomly generated by using Python library and package which are 'numpy', 'random' and von Mises'. In conclusion, the synthetic data of univariate circular data from von Mises distribution is generated and the outliers are successfully introduced in the dataset with three outliers scenarios using Python. This study will be valuable for those who are interested to study univariate circular data with outliers and choose Python as an analysis tool.
format Conference or Workshop Item
author Nur Syahirah, Zulkipli
Siti Zanariah, Satari
Wan Nur Syahidah, Wan Yusoff
author_facet Nur Syahirah, Zulkipli
Siti Zanariah, Satari
Wan Nur Syahidah, Wan Yusoff
author_sort Nur Syahirah, Zulkipli
title A synthetic data generation procedure for univariate circular data with various outliers scenarios using Python programming language
title_short A synthetic data generation procedure for univariate circular data with various outliers scenarios using Python programming language
title_full A synthetic data generation procedure for univariate circular data with various outliers scenarios using Python programming language
title_fullStr A synthetic data generation procedure for univariate circular data with various outliers scenarios using Python programming language
title_full_unstemmed A synthetic data generation procedure for univariate circular data with various outliers scenarios using Python programming language
title_sort synthetic data generation procedure for univariate circular data with various outliers scenarios using python programming language
publisher IOP Publishing Ltd
publishDate 2021
url http://umpir.ump.edu.my/id/eprint/35201/1/A%20synthetic%20data%20generation%20procedure%20for%20univariate%20circular%20data%20with%20various%20outliers%20scenarios.pdf
http://umpir.ump.edu.my/id/eprint/35201/
https://doi.org/10.1088/1742-6596/1988/1/012111
_version_ 1748180685195051008