Unsupervised representation learning by predicting random distances

Deep neural networks have gained great success in a broad range of tasks due to its remarkable capability to learn semantically rich features from high-dimensional data. However, they often require large-scale labelled data to successfully learn such features, which significantly hinders their adapt...

Full description

Saved in:
Bibliographic Details
Main Authors: WANG, Hu, PANG, Guansong, SHEN, Chunhua, MA, Congbo
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2021
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/7136
https://ink.library.smu.edu.sg/context/sis_research/article/8139/viewcontent/0408.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-8139
record_format dspace
spelling sg-smu-ink.sis_research-81392022-04-22T04:23:35Z Unsupervised representation learning by predicting random distances WANG, Hu PANG, Guansong SHEN, Chunhua MA, Congbo Deep neural networks have gained great success in a broad range of tasks due to its remarkable capability to learn semantically rich features from high-dimensional data. However, they often require large-scale labelled data to successfully learn such features, which significantly hinders their adaption in unsupervised learning tasks, such as anomaly detection and clustering, and limits their applications to critical domains where obtaining massive labelled data is prohibitively expensive. To enable unsupervised learning on those domains, in this work we propose to learn features without using any labelled data by training neural networks to predict data distances in a randomly projected space. Random mapping is a theoretically proven approach to obtain approximately preserved distances. To well predict these distances, the representation learner is optimised to learn genuine class structures that are implicitly embedded in the randomly projected space. Empirical results on 19 real-world datasets show that our learned representations substantially outperform a few state-of-theart methods for both anomaly detection and clustering tasks. Code is available at: https://git.io/ RDP 2021-01-01T08:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/7136 info:doi/10.24963/ijcai.2020/408 https://ink.library.smu.edu.sg/context/sis_research/article/8139/viewcontent/0408.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Databases and Information Systems OS and Networks
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Databases and Information Systems
OS and Networks
spellingShingle Databases and Information Systems
OS and Networks
WANG, Hu
PANG, Guansong
SHEN, Chunhua
MA, Congbo
Unsupervised representation learning by predicting random distances
description Deep neural networks have gained great success in a broad range of tasks due to its remarkable capability to learn semantically rich features from high-dimensional data. However, they often require large-scale labelled data to successfully learn such features, which significantly hinders their adaption in unsupervised learning tasks, such as anomaly detection and clustering, and limits their applications to critical domains where obtaining massive labelled data is prohibitively expensive. To enable unsupervised learning on those domains, in this work we propose to learn features without using any labelled data by training neural networks to predict data distances in a randomly projected space. Random mapping is a theoretically proven approach to obtain approximately preserved distances. To well predict these distances, the representation learner is optimised to learn genuine class structures that are implicitly embedded in the randomly projected space. Empirical results on 19 real-world datasets show that our learned representations substantially outperform a few state-of-theart methods for both anomaly detection and clustering tasks. Code is available at: https://git.io/ RDP
format text
author WANG, Hu
PANG, Guansong
SHEN, Chunhua
MA, Congbo
author_facet WANG, Hu
PANG, Guansong
SHEN, Chunhua
MA, Congbo
author_sort WANG, Hu
title Unsupervised representation learning by predicting random distances
title_short Unsupervised representation learning by predicting random distances
title_full Unsupervised representation learning by predicting random distances
title_fullStr Unsupervised representation learning by predicting random distances
title_full_unstemmed Unsupervised representation learning by predicting random distances
title_sort unsupervised representation learning by predicting random distances
publisher Institutional Knowledge at Singapore Management University
publishDate 2021
url https://ink.library.smu.edu.sg/sis_research/7136
https://ink.library.smu.edu.sg/context/sis_research/article/8139/viewcontent/0408.pdf
_version_ 1770576229743722496