Comparison between linear and non-linear variable selection methods with applications to spectroscopic (UV-Vis/NIR) data

© 2020, Chiang Mai University. All rights reserved. Variable selection aims to identify important parameters in relation to predicted responses. Selection outcomes of the important variables could be different depending on the methods used. In this research, the important variables identified using...

Full description

Saved in:
Bibliographic Details
Main Authors: Chanida Krongchai, Sakunna Wongsaipun, Sujitra Funsueb, Parichat Theanjumpol, Jaroon Jakmunee, Sila Kittiwachana
Format: Journal
Published: 2020
Subjects:
Online Access:https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85078946174&origin=inward
http://cmuir.cmu.ac.th/jspui/handle/6653943832/68270
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Chiang Mai University
id th-cmuir.6653943832-68270
record_format dspace
spelling th-cmuir.6653943832-682702020-04-02T15:30:17Z Comparison between linear and non-linear variable selection methods with applications to spectroscopic (UV-Vis/NIR) data Chanida Krongchai Sakunna Wongsaipun Sujitra Funsueb Parichat Theanjumpol Jaroon Jakmunee Sila Kittiwachana Biochemistry, Genetics and Molecular Biology Chemistry Materials Science Mathematics Physics and Astronomy © 2020, Chiang Mai University. All rights reserved. Variable selection aims to identify important parameters in relation to predicted responses. Selection outcomes of the important variables could be different depending on the methods used. In this research, the important variables identified using linear and non-linear variable selection methods based on partial least squares-variable important in prediction (PLS-VIP) and self organizing map-discrimination index (SOM-DI) were compared. Two datasets, near-infrared (NIR) spectra of adulterated Thai Jasmine rice and ultraviolet-visible (UV-Vis) spectra of food colorant mixtures were used for the demonstration. The advantages and disadvantages for the use of the different algorithms were compared and discussed. For the NIR data, the calibration model using supervised self organizing map (SSOM) offered better prediction results and the SOM-DI variable selection method identified the spectral changes in NIR overtone regions as significance. On the other hand, PLS calibration model resulted in higher predictive errors while the PLS-VIP variable selection captured variation from the visible region between 664 nm and 884 nm. Using the UV-Vis data, PLS appeared to put attention on only the highest absorbance region of the peak maximum absorbance. In contrast, SSOM model highlighted the variation around the isosbestic spectral regions between the mixture components. The drawback for the use of a mixture design to construct the calibration models, leading to wrong interpretation of the important variables, was also discussed. 2020-04-02T15:24:03Z 2020-04-02T15:24:03Z 2020-01-01 Journal 01252526 2-s2.0-85078946174 https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85078946174&origin=inward http://cmuir.cmu.ac.th/jspui/handle/6653943832/68270
institution Chiang Mai University
building Chiang Mai University Library
country Thailand
collection CMU Intellectual Repository
topic Biochemistry, Genetics and Molecular Biology
Chemistry
Materials Science
Mathematics
Physics and Astronomy
spellingShingle Biochemistry, Genetics and Molecular Biology
Chemistry
Materials Science
Mathematics
Physics and Astronomy
Chanida Krongchai
Sakunna Wongsaipun
Sujitra Funsueb
Parichat Theanjumpol
Jaroon Jakmunee
Sila Kittiwachana
Comparison between linear and non-linear variable selection methods with applications to spectroscopic (UV-Vis/NIR) data
description © 2020, Chiang Mai University. All rights reserved. Variable selection aims to identify important parameters in relation to predicted responses. Selection outcomes of the important variables could be different depending on the methods used. In this research, the important variables identified using linear and non-linear variable selection methods based on partial least squares-variable important in prediction (PLS-VIP) and self organizing map-discrimination index (SOM-DI) were compared. Two datasets, near-infrared (NIR) spectra of adulterated Thai Jasmine rice and ultraviolet-visible (UV-Vis) spectra of food colorant mixtures were used for the demonstration. The advantages and disadvantages for the use of the different algorithms were compared and discussed. For the NIR data, the calibration model using supervised self organizing map (SSOM) offered better prediction results and the SOM-DI variable selection method identified the spectral changes in NIR overtone regions as significance. On the other hand, PLS calibration model resulted in higher predictive errors while the PLS-VIP variable selection captured variation from the visible region between 664 nm and 884 nm. Using the UV-Vis data, PLS appeared to put attention on only the highest absorbance region of the peak maximum absorbance. In contrast, SSOM model highlighted the variation around the isosbestic spectral regions between the mixture components. The drawback for the use of a mixture design to construct the calibration models, leading to wrong interpretation of the important variables, was also discussed.
format Journal
author Chanida Krongchai
Sakunna Wongsaipun
Sujitra Funsueb
Parichat Theanjumpol
Jaroon Jakmunee
Sila Kittiwachana
author_facet Chanida Krongchai
Sakunna Wongsaipun
Sujitra Funsueb
Parichat Theanjumpol
Jaroon Jakmunee
Sila Kittiwachana
author_sort Chanida Krongchai
title Comparison between linear and non-linear variable selection methods with applications to spectroscopic (UV-Vis/NIR) data
title_short Comparison between linear and non-linear variable selection methods with applications to spectroscopic (UV-Vis/NIR) data
title_full Comparison between linear and non-linear variable selection methods with applications to spectroscopic (UV-Vis/NIR) data
title_fullStr Comparison between linear and non-linear variable selection methods with applications to spectroscopic (UV-Vis/NIR) data
title_full_unstemmed Comparison between linear and non-linear variable selection methods with applications to spectroscopic (UV-Vis/NIR) data
title_sort comparison between linear and non-linear variable selection methods with applications to spectroscopic (uv-vis/nir) data
publishDate 2020
url https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85078946174&origin=inward
http://cmuir.cmu.ac.th/jspui/handle/6653943832/68270
_version_ 1681426789396643840