Fast emulation of self-organizing maps for large datasets

© 2015 The Authors. Published by Elsevier B.V. The self-organizing map (SOM) methodology does vector quantization and clustering on the dataset, and then projects the obtained clusters to a lower dimensional space, such as a 2D map, by positioning similar clusters in locations that are spatially clo...

Full description

Saved in:
Bibliographic Details
Main Authors: Cordel, Macario O., Azcarraga, Arnulfo P.
Format: text
Published: Animo Repository 2015
Online Access:https://animorepository.dlsu.edu.ph/faculty_research/764
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: De La Salle University
id oai:animorepository.dlsu.edu.ph:faculty_research-1763
record_format eprints
spelling oai:animorepository.dlsu.edu.ph:faculty_research-17632022-11-16T02:54:40Z Fast emulation of self-organizing maps for large datasets Cordel, Macario O. Azcarraga, Arnulfo P. © 2015 The Authors. Published by Elsevier B.V. The self-organizing map (SOM) methodology does vector quantization and clustering on the dataset, and then projects the obtained clusters to a lower dimensional space, such as a 2D map, by positioning similar clusters in locations that are spatially closer in the lower dimension space. This makes the SOM methodology an effective tool for data visualization. However, in a world where mined information from big data have to be available immediately, SOM becomes an unattractive tool because of its time complexity. In this paper, we propose an alternative visualization methodology for large datasets that emulates SOM methodology without the speed constraints inherent to SOM. To demonstrate the efficiency and the potential of the proposed scheme as a fast visualization tool, the methodology is used to cluster and project the 3,823 image samples of handwritten digits of the Optical Recognition of Handwritten Digits dataset. Although the dataset is not, by any means large, it is sufficient to demonstrate the speed-up that can be achieved by using this proposed SOM emulation procedure. 2015-01-01T08:00:00Z text text/html https://animorepository.dlsu.edu.ph/faculty_research/764 Faculty Research Work Animo Repository
institution De La Salle University
building De La Salle University Library
continent Asia
country Philippines
Philippines
content_provider De La Salle University Library
collection DLSU Institutional Repository
description © 2015 The Authors. Published by Elsevier B.V. The self-organizing map (SOM) methodology does vector quantization and clustering on the dataset, and then projects the obtained clusters to a lower dimensional space, such as a 2D map, by positioning similar clusters in locations that are spatially closer in the lower dimension space. This makes the SOM methodology an effective tool for data visualization. However, in a world where mined information from big data have to be available immediately, SOM becomes an unattractive tool because of its time complexity. In this paper, we propose an alternative visualization methodology for large datasets that emulates SOM methodology without the speed constraints inherent to SOM. To demonstrate the efficiency and the potential of the proposed scheme as a fast visualization tool, the methodology is used to cluster and project the 3,823 image samples of handwritten digits of the Optical Recognition of Handwritten Digits dataset. Although the dataset is not, by any means large, it is sufficient to demonstrate the speed-up that can be achieved by using this proposed SOM emulation procedure.
format text
author Cordel, Macario O.
Azcarraga, Arnulfo P.
spellingShingle Cordel, Macario O.
Azcarraga, Arnulfo P.
Fast emulation of self-organizing maps for large datasets
author_facet Cordel, Macario O.
Azcarraga, Arnulfo P.
author_sort Cordel, Macario O.
title Fast emulation of self-organizing maps for large datasets
title_short Fast emulation of self-organizing maps for large datasets
title_full Fast emulation of self-organizing maps for large datasets
title_fullStr Fast emulation of self-organizing maps for large datasets
title_full_unstemmed Fast emulation of self-organizing maps for large datasets
title_sort fast emulation of self-organizing maps for large datasets
publisher Animo Repository
publishDate 2015
url https://animorepository.dlsu.edu.ph/faculty_research/764
_version_ 1751550428821061632