Comparison Between Linear and Non-linear Variable Selection Methods with Applications to Spectroscopic (UV-Vis/NIR) Data
Variable selection aims to identify important parameters in relation to predicted responses. Selection outcomes of the important variables could be different depending on the methods used. In this research, the important variables identified using linear and non-linear variable selection methods bas...
Saved in:
Main Authors: | , , , , , |
---|---|
Language: | English |
Published: |
Science Faculty of Chiang Mai University
2020
|
Subjects: | |
Online Access: | http://epg.science.cmu.ac.th/ejournal/dl.php?journal_id=10583 http://cmuir.cmu.ac.th/jspui/handle/6653943832/67342 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Chiang Mai University |
Language: | English |
id |
th-cmuir.6653943832-67342 |
---|---|
record_format |
dspace |
spelling |
th-cmuir.6653943832-673422020-04-02T14:45:04Z Comparison Between Linear and Non-linear Variable Selection Methods with Applications to Spectroscopic (UV-Vis/NIR) Data Chanida Krongchai Sakunna Wongsaipun Sujitra Funsueb Parichat Theanjumpol Jaroon Jakmunee Sila Kittiwachana variable selection multivariate calibration partial least squares (PLS) elf organizing map (SOM) spectral data analysis Variable selection aims to identify important parameters in relation to predicted responses. Selection outcomes of the important variables could be different depending on the methods used. In this research, the important variables identified using linear and non-linear variable selection methods based on partial least squares-variable important in prediction (PLS-VIP) and self organizing mapdiscrimination index (SOM-DI) were compared. Two datasets, near-infrared (NIR) spectra of adulterated Thai Jasmine rice and ultraviolet-visible (UV-Vis) spectra of food colorant mixtures were used for the demonstration. The advantages and disadvantages for the use of the different algorithms were compared and discussed. For the NIR data, the calibration model using supervised self organizing map (SSOM) offered better prediction results and the SOM-DI variable selection method identified the spectral changes in NIR overtone regions as significance. On the other hand, PLS calibration model resulted in higher predictive errors while the PLS-VIP variable selection captured variation from the visible region between 664 nm and 884 nm. Using the UV-Vis data, PLS appeared to put attention on only the highest absorbance region of the peak maximum absorbance. In contrast, SSOM model highlighted the variation around the isosbestic spectral regions between the mixture components. The drawback for the use of a mixture design to construct the calibration models, leading to wrong interpretation of the important variables, was also discussed. 2020-04-02T14:45:04Z 2020-04-02T14:45:04Z 2020 Chiang Mai Journal of Science 47, 1 (January 2020), 160-174 0125-2526 http://epg.science.cmu.ac.th/ejournal/dl.php?journal_id=10583 http://cmuir.cmu.ac.th/jspui/handle/6653943832/67342 Eng Science Faculty of Chiang Mai University |
institution |
Chiang Mai University |
building |
Chiang Mai University Library |
country |
Thailand |
collection |
CMU Intellectual Repository |
language |
English |
topic |
variable selection multivariate calibration partial least squares (PLS) elf organizing map (SOM) spectral data analysis |
spellingShingle |
variable selection multivariate calibration partial least squares (PLS) elf organizing map (SOM) spectral data analysis Chanida Krongchai Sakunna Wongsaipun Sujitra Funsueb Parichat Theanjumpol Jaroon Jakmunee Sila Kittiwachana Comparison Between Linear and Non-linear Variable Selection Methods with Applications to Spectroscopic (UV-Vis/NIR) Data |
description |
Variable selection aims to identify important parameters in relation to predicted responses. Selection outcomes of the important variables could be different depending on the methods used. In this research, the important variables identified using linear and non-linear variable selection methods based on partial least squares-variable important in prediction (PLS-VIP) and self organizing mapdiscrimination index (SOM-DI) were compared. Two datasets, near-infrared (NIR) spectra of adulterated Thai Jasmine rice and ultraviolet-visible (UV-Vis) spectra of food colorant mixtures were used for the demonstration. The advantages and disadvantages for the use of the different algorithms were compared and discussed. For the NIR data, the calibration model using supervised self organizing map (SSOM) offered better prediction results and the SOM-DI variable selection method identified the spectral changes in NIR overtone regions as significance. On the other hand, PLS calibration model resulted in higher predictive errors while the PLS-VIP variable selection captured variation from the visible region between 664 nm and 884 nm. Using the UV-Vis data, PLS appeared to put attention on only the highest absorbance region of the peak maximum absorbance. In contrast, SSOM model highlighted the variation around the isosbestic spectral regions between the mixture components. The drawback for the use of a mixture design to construct the calibration models, leading to wrong interpretation of the important variables, was also discussed. |
author |
Chanida Krongchai Sakunna Wongsaipun Sujitra Funsueb Parichat Theanjumpol Jaroon Jakmunee Sila Kittiwachana |
author_facet |
Chanida Krongchai Sakunna Wongsaipun Sujitra Funsueb Parichat Theanjumpol Jaroon Jakmunee Sila Kittiwachana |
author_sort |
Chanida Krongchai |
title |
Comparison Between Linear and Non-linear Variable Selection Methods with Applications to Spectroscopic (UV-Vis/NIR) Data |
title_short |
Comparison Between Linear and Non-linear Variable Selection Methods with Applications to Spectroscopic (UV-Vis/NIR) Data |
title_full |
Comparison Between Linear and Non-linear Variable Selection Methods with Applications to Spectroscopic (UV-Vis/NIR) Data |
title_fullStr |
Comparison Between Linear and Non-linear Variable Selection Methods with Applications to Spectroscopic (UV-Vis/NIR) Data |
title_full_unstemmed |
Comparison Between Linear and Non-linear Variable Selection Methods with Applications to Spectroscopic (UV-Vis/NIR) Data |
title_sort |
comparison between linear and non-linear variable selection methods with applications to spectroscopic (uv-vis/nir) data |
publisher |
Science Faculty of Chiang Mai University |
publishDate |
2020 |
url |
http://epg.science.cmu.ac.th/ejournal/dl.php?journal_id=10583 http://cmuir.cmu.ac.th/jspui/handle/6653943832/67342 |
_version_ |
1681426617274990592 |