The Emerging "Big Dimensionality"
The world continues to generate quintillion bytes of data daily, leading to the pressing needs for new efforts in dealing with the grand challenges brought by Big Data. Today, there is a growing consensus among the computational intelligence communities that data volume presents an immediate challen...
Saved in:
Main Authors: | , , |
---|---|
Other Authors: | |
Format: | Article |
Language: | English |
Published: |
2016
|
Online Access: | https://hdl.handle.net/10356/81714 http://hdl.handle.net/10220/39675 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-81714 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-817142020-05-28T07:17:36Z The Emerging "Big Dimensionality" Zhai, Yiteng Ong, Yew-Soon Tsang, Ivor W. School of Computer Engineering The world continues to generate quintillion bytes of data daily, leading to the pressing needs for new efforts in dealing with the grand challenges brought by Big Data. Today, there is a growing consensus among the computational intelligence communities that data volume presents an immediate challenge pertaining to the scalability issue. However, when addressing volume in Big Data analytics, researchers in the data analytics community have largely taken a one-sided study of volume, which is the "Big Instance Size" factor of the data. The flip side of volume which is the dimensionality factor of Big Data, on the other hand, has received much lesser attention. This article thus represents an attempt to fill in this gap and places special focus on this relatively under-explored topic of "Big Dimensionality", wherein the explosion of features (variables) brings about new challenges to computational intelligence. We begin with an analysis on the origins of Big Dimensionality. The evolution of feature dimensionality in the last two decades is then studied using popular data repositories considered in the data analytics and computational intelligence research communities. Subsequently, the state-of-the-art feature selection schemes reported in the field of computational intelligence are reviewed to reveal the inadequacies of existing approaches in keeping pace with the emerging phenomenon of Big Dimensionality. Last but not least, the "curse and blessing of Big Dimensionality" are delineated and deliberated. ASTAR (Agency for Sci., Tech. and Research, S’pore) Accepted version 2016-01-12T07:37:45Z 2019-12-06T14:36:43Z 2016-01-12T07:37:45Z 2019-12-06T14:36:43Z 2014 Journal Article Zhai, Y., Ong, Y.-S., & Tsang, I. W. (2014). The Emerging "Big Dimensionality". IEEE Computational Intelligence Magazine, 9(3), 14-26. 1556-603X https://hdl.handle.net/10356/81714 http://hdl.handle.net/10220/39675 10.1109/MCI.2014.2326099 en IEEE Computational Intelligence Magazine © 2014 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. The published version is available at: [http://dx.doi.org/10.1109/MCI.2014.2326099]. 13 p. application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
country |
Singapore |
collection |
DR-NTU |
language |
English |
description |
The world continues to generate quintillion bytes of data daily, leading to the pressing needs for new efforts in dealing with the grand challenges brought by Big Data. Today, there is a growing consensus among the computational intelligence communities that data volume presents an immediate challenge pertaining to the scalability issue. However, when addressing volume in Big Data analytics, researchers in the data analytics community have largely taken a one-sided study of volume, which is the "Big Instance Size" factor of the data. The flip side of volume which is the dimensionality factor of Big Data, on the other hand, has received much lesser attention. This article thus represents an attempt to fill in this gap and places special focus on this relatively under-explored topic of "Big Dimensionality", wherein the explosion of features (variables) brings about new challenges to computational intelligence. We begin with an analysis on the origins of Big Dimensionality. The evolution of feature dimensionality in the last two decades is then studied using popular data repositories considered in the data analytics and computational intelligence research communities. Subsequently, the state-of-the-art feature selection schemes reported in the field of computational intelligence are reviewed to reveal the inadequacies of existing approaches in keeping pace with the emerging phenomenon of Big Dimensionality. Last but not least, the "curse and blessing of Big Dimensionality" are delineated and deliberated. |
author2 |
School of Computer Engineering |
author_facet |
School of Computer Engineering Zhai, Yiteng Ong, Yew-Soon Tsang, Ivor W. |
format |
Article |
author |
Zhai, Yiteng Ong, Yew-Soon Tsang, Ivor W. |
spellingShingle |
Zhai, Yiteng Ong, Yew-Soon Tsang, Ivor W. The Emerging "Big Dimensionality" |
author_sort |
Zhai, Yiteng |
title |
The Emerging "Big Dimensionality" |
title_short |
The Emerging "Big Dimensionality" |
title_full |
The Emerging "Big Dimensionality" |
title_fullStr |
The Emerging "Big Dimensionality" |
title_full_unstemmed |
The Emerging "Big Dimensionality" |
title_sort |
emerging "big dimensionality" |
publishDate |
2016 |
url |
https://hdl.handle.net/10356/81714 http://hdl.handle.net/10220/39675 |
_version_ |
1681058887577370624 |