Factor modeling for clustering high-dimensional time series

We propose a new unsupervised learning method for clustering a large number of time series based on a latent factor structure. Each cluster is characterized by its own cluster-specific factors in addition to some common factors which impact on all the time series concerned. Our setting also offers t...

Full description

Saved in:
Bibliographic Details
Main Authors: Zhang, Bo, Pan, Guangming, Yao, Qiwei, Zhou, Wang
Other Authors: School of Physical and Mathematical Sciences
Format: Article
Language:English
Published: 2023
Subjects:
Online Access:https://hdl.handle.net/10356/172172
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-172172
record_format dspace
spelling sg-ntu-dr.10356-1721722023-12-04T15:35:00Z Factor modeling for clustering high-dimensional time series Zhang, Bo Pan, Guangming Yao, Qiwei Zhou, Wang School of Physical and Mathematical Sciences Science::Mathematics Eigenanalysis Idiosyncratic Components We propose a new unsupervised learning method for clustering a large number of time series based on a latent factor structure. Each cluster is characterized by its own cluster-specific factors in addition to some common factors which impact on all the time series concerned. Our setting also offers the flexibility that some time series may not belong to any clusters. The consistency with explicit convergence rates is established for the estimation of the common factors, the cluster-specific factors, and the latent clusters. Numerical illustration with both simulated data as well as a real data example is also reported. As a spin-off, the proposed new approach also advances significantly the statistical inference for the factor model of Lam and Yao. Supplementary materials for this article are available online. Ministry of Education (MOE) Published version Bo Zhang is partially supported by National Natural Science Funds of China No.12001517 & 72091212, National Key R&D Program of China - 2022YFA1008000, USTC Research Funds of the Double First-Class Initiative YD2040002005 and The Fundamental Research Funds for the Central Universities WK2040000026 & WK2040000027. Guangming Pan is partially supported by MOE Tier 2 grant 2018-T2-2-112 and MOE Tier 1 grant RG76/21 at the Nanyang Technological University, Singapore. Qiwei Yao is partially supported by EPSRC (UK) Research grant EP/V007556/1. Wang Zhou is partially supported by a grant A-8000440-00-00 at the National University of Singapore. 2023-11-28T04:55:32Z 2023-11-28T04:55:32Z 2023 Journal Article Zhang, B., Pan, G., Yao, Q. & Zhou, W. (2023). Factor modeling for clustering high-dimensional time series. Journal of the American Statistical Association, 1-12. https://dx.doi.org/10.1080/01621459.2023.2183132 0162-1459 https://hdl.handle.net/10356/172172 10.1080/01621459.2023.2183132 2-s2.0-85152429056 1 12 en 2018-T2-2-112 RG76/21 Journal of the American Statistical Association © 2023 The Author(s). Published with license by Taylor & Francis Group, LLC. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The terms on which this article has been published allow the posting of the Accepted Manuscript in a repository by the author(s) or with their consent. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Science::Mathematics
Eigenanalysis
Idiosyncratic Components
spellingShingle Science::Mathematics
Eigenanalysis
Idiosyncratic Components
Zhang, Bo
Pan, Guangming
Yao, Qiwei
Zhou, Wang
Factor modeling for clustering high-dimensional time series
description We propose a new unsupervised learning method for clustering a large number of time series based on a latent factor structure. Each cluster is characterized by its own cluster-specific factors in addition to some common factors which impact on all the time series concerned. Our setting also offers the flexibility that some time series may not belong to any clusters. The consistency with explicit convergence rates is established for the estimation of the common factors, the cluster-specific factors, and the latent clusters. Numerical illustration with both simulated data as well as a real data example is also reported. As a spin-off, the proposed new approach also advances significantly the statistical inference for the factor model of Lam and Yao. Supplementary materials for this article are available online.
author2 School of Physical and Mathematical Sciences
author_facet School of Physical and Mathematical Sciences
Zhang, Bo
Pan, Guangming
Yao, Qiwei
Zhou, Wang
format Article
author Zhang, Bo
Pan, Guangming
Yao, Qiwei
Zhou, Wang
author_sort Zhang, Bo
title Factor modeling for clustering high-dimensional time series
title_short Factor modeling for clustering high-dimensional time series
title_full Factor modeling for clustering high-dimensional time series
title_fullStr Factor modeling for clustering high-dimensional time series
title_full_unstemmed Factor modeling for clustering high-dimensional time series
title_sort factor modeling for clustering high-dimensional time series
publishDate 2023
url https://hdl.handle.net/10356/172172
_version_ 1784855604772208640