The Emerging "Big Dimensionality"

The world continues to generate quintillion bytes of data daily, leading to the pressing needs for new efforts in dealing with the grand challenges brought by Big Data. Today, there is a growing consensus among the computational intelligence communities that data volume presents an immediate challen...

Full description

Saved in:
Bibliographic Details
Main Authors: Zhai, Yiteng, Ong, Yew-Soon, Tsang, Ivor W.
Other Authors: School of Computer Engineering
Format: Article
Language:English
Published: 2016
Online Access:https://hdl.handle.net/10356/81714
http://hdl.handle.net/10220/39675
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-81714
record_format dspace
spelling sg-ntu-dr.10356-817142020-05-28T07:17:36Z The Emerging "Big Dimensionality" Zhai, Yiteng Ong, Yew-Soon Tsang, Ivor W. School of Computer Engineering The world continues to generate quintillion bytes of data daily, leading to the pressing needs for new efforts in dealing with the grand challenges brought by Big Data. Today, there is a growing consensus among the computational intelligence communities that data volume presents an immediate challenge pertaining to the scalability issue. However, when addressing volume in Big Data analytics, researchers in the data analytics community have largely taken a one-sided study of volume, which is the "Big Instance Size" factor of the data. The flip side of volume which is the dimensionality factor of Big Data, on the other hand, has received much lesser attention. This article thus represents an attempt to fill in this gap and places special focus on this relatively under-explored topic of "Big Dimensionality", wherein the explosion of features (variables) brings about new challenges to computational intelligence. We begin with an analysis on the origins of Big Dimensionality. The evolution of feature dimensionality in the last two decades is then studied using popular data repositories considered in the data analytics and computational intelligence research communities. Subsequently, the state-of-the-art feature selection schemes reported in the field of computational intelligence are reviewed to reveal the inadequacies of existing approaches in keeping pace with the emerging phenomenon of Big Dimensionality. Last but not least, the "curse and blessing of Big Dimensionality" are delineated and deliberated. ASTAR (Agency for Sci., Tech. and Research, S’pore) Accepted version 2016-01-12T07:37:45Z 2019-12-06T14:36:43Z 2016-01-12T07:37:45Z 2019-12-06T14:36:43Z 2014 Journal Article Zhai, Y., Ong, Y.-S., & Tsang, I. W. (2014). The Emerging "Big Dimensionality". IEEE Computational Intelligence Magazine, 9(3), 14-26. 1556-603X https://hdl.handle.net/10356/81714 http://hdl.handle.net/10220/39675 10.1109/MCI.2014.2326099 en IEEE Computational Intelligence Magazine © 2014 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. The published version is available at: [http://dx.doi.org/10.1109/MCI.2014.2326099]. 13 p. application/pdf
institution Nanyang Technological University
building NTU Library
country Singapore
collection DR-NTU
language English
description The world continues to generate quintillion bytes of data daily, leading to the pressing needs for new efforts in dealing with the grand challenges brought by Big Data. Today, there is a growing consensus among the computational intelligence communities that data volume presents an immediate challenge pertaining to the scalability issue. However, when addressing volume in Big Data analytics, researchers in the data analytics community have largely taken a one-sided study of volume, which is the "Big Instance Size" factor of the data. The flip side of volume which is the dimensionality factor of Big Data, on the other hand, has received much lesser attention. This article thus represents an attempt to fill in this gap and places special focus on this relatively under-explored topic of "Big Dimensionality", wherein the explosion of features (variables) brings about new challenges to computational intelligence. We begin with an analysis on the origins of Big Dimensionality. The evolution of feature dimensionality in the last two decades is then studied using popular data repositories considered in the data analytics and computational intelligence research communities. Subsequently, the state-of-the-art feature selection schemes reported in the field of computational intelligence are reviewed to reveal the inadequacies of existing approaches in keeping pace with the emerging phenomenon of Big Dimensionality. Last but not least, the "curse and blessing of Big Dimensionality" are delineated and deliberated.
author2 School of Computer Engineering
author_facet School of Computer Engineering
Zhai, Yiteng
Ong, Yew-Soon
Tsang, Ivor W.
format Article
author Zhai, Yiteng
Ong, Yew-Soon
Tsang, Ivor W.
spellingShingle Zhai, Yiteng
Ong, Yew-Soon
Tsang, Ivor W.
The Emerging "Big Dimensionality"
author_sort Zhai, Yiteng
title The Emerging "Big Dimensionality"
title_short The Emerging "Big Dimensionality"
title_full The Emerging "Big Dimensionality"
title_fullStr The Emerging "Big Dimensionality"
title_full_unstemmed The Emerging "Big Dimensionality"
title_sort emerging "big dimensionality"
publishDate 2016
url https://hdl.handle.net/10356/81714
http://hdl.handle.net/10220/39675
_version_ 1681058887577370624