Panel Data Analysis Via Variable Selection and Subject Clustering

This book investigates tradeoff between security and usability in designing leakage resilient password systems (LRP) and introduces two practical LRP systems named Cover Pad and ShadowKey. It demonstrates that existing LRP systems are subject to both brute force attacks and statistical attacks and t...

Full description

Saved in:

Bibliographic Details
Main Authors:	LU, Haibing, HUANG, Shengsheng, LI, Yingjiu, YANG, Yanjiang
Format:	text
Language:	English
Published:	Institutional Knowledge at Singapore Management University 2014
Subjects:	Computer Sciences Numerical Analysis and Scientific Computing
Online Access:	https://ink.library.smu.edu.sg/sis_research/2561 http://dx.doi.org/10.1007/978-3-642-45252-9_5
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Singapore Management University
Language:	English

id	sg-smu-ink.sis_research-3561
record_format	dspace
spelling	sg-smu-ink.sis_research-35612016-01-21T07:32:57Z Panel Data Analysis Via Variable Selection and Subject Clustering LU, Haibing HUANG, Shengsheng LI, Yingjiu YANG, Yanjiang This book investigates tradeoff between security and usability in designing leakage resilient password systems (LRP) and introduces two practical LRP systems named Cover Pad and ShadowKey. It demonstrates that existing LRP systems are subject to both brute force attacks and statistical attacks and that these attacks cannot be effectively mitigated without sacrificing the usability of LRP systems. Quantitative analysis proves that a secure LRP system in practical settings imposes a considerable amount of cognitive workload unless certain secure channels are involved. The book introduces a secure and practical LRP A panel data set contains observations on multiple phenomena observed over multiple time periods for the same subjects (e.g., firms or individuals). Panel data sets frequently appeared in the study of Marketing, Economics, and many other social sciences. An important panel data analysis task is to analyze and predict a variable of interest. As in social sciences, the number of collected data records for each subject is usually not large enough to support accurate and reliable data analysis, a common solution is to pool all subjects together and then run a linear regression method in attempt to discover the underlying relationship between the variable of interest and other observed variables. However, this method suffers from two limitations. First, subjects might not be poolable due to their heterogeneous nature. Second, not all variables might have significant relationships to the variable of interest. A regression on many irrelevant regressors will lead to wrong predictions. To address these two issues, we propose a novel approach, called Selecting and Clustering, which derives underlying linear models by first selecting variables highly correlated to the variable of interest and then clustering subjects into homogenous groups of the same linear models with respect to those variables. Furthermore, we build an optimization model to formulate this problem, the solution of which enables one to select variables and clustering subjects simultaneously. Due to the combinatorial nature of the problem, an effective and efficient algorithm is proposed. Studies on real data sets validate the effectiveness of our approach as our approach performs significantly better than other existing approaches. 2014-01-01T08:00:00Z text https://ink.library.smu.edu.sg/sis_research/2561 info:doi/10.1007/978-3-642-45252-9_5 http://dx.doi.org/10.1007/978-3-642-45252-9_5 Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Computer Sciences Numerical Analysis and Scientific Computing
institution	Singapore Management University
building	SMU Libraries
continent	Asia
country	Singapore Singapore
content_provider	SMU Libraries
collection	InK@SMU
language	English
topic	Computer Sciences Numerical Analysis and Scientific Computing
spellingShingle	Computer Sciences Numerical Analysis and Scientific Computing LU, Haibing HUANG, Shengsheng LI, Yingjiu YANG, Yanjiang Panel Data Analysis Via Variable Selection and Subject Clustering
description	This book investigates tradeoff between security and usability in designing leakage resilient password systems (LRP) and introduces two practical LRP systems named Cover Pad and ShadowKey. It demonstrates that existing LRP systems are subject to both brute force attacks and statistical attacks and that these attacks cannot be effectively mitigated without sacrificing the usability of LRP systems. Quantitative analysis proves that a secure LRP system in practical settings imposes a considerable amount of cognitive workload unless certain secure channels are involved. The book introduces a secure and practical LRP A panel data set contains observations on multiple phenomena observed over multiple time periods for the same subjects (e.g., firms or individuals). Panel data sets frequently appeared in the study of Marketing, Economics, and many other social sciences. An important panel data analysis task is to analyze and predict a variable of interest. As in social sciences, the number of collected data records for each subject is usually not large enough to support accurate and reliable data analysis, a common solution is to pool all subjects together and then run a linear regression method in attempt to discover the underlying relationship between the variable of interest and other observed variables. However, this method suffers from two limitations. First, subjects might not be poolable due to their heterogeneous nature. Second, not all variables might have significant relationships to the variable of interest. A regression on many irrelevant regressors will lead to wrong predictions. To address these two issues, we propose a novel approach, called Selecting and Clustering, which derives underlying linear models by first selecting variables highly correlated to the variable of interest and then clustering subjects into homogenous groups of the same linear models with respect to those variables. Furthermore, we build an optimization model to formulate this problem, the solution of which enables one to select variables and clustering subjects simultaneously. Due to the combinatorial nature of the problem, an effective and efficient algorithm is proposed. Studies on real data sets validate the effectiveness of our approach as our approach performs significantly better than other existing approaches.
format	text
author	LU, Haibing HUANG, Shengsheng LI, Yingjiu YANG, Yanjiang
author_facet	LU, Haibing HUANG, Shengsheng LI, Yingjiu YANG, Yanjiang
author_sort	LU, Haibing
title	Panel Data Analysis Via Variable Selection and Subject Clustering
title_short	Panel Data Analysis Via Variable Selection and Subject Clustering
title_full	Panel Data Analysis Via Variable Selection and Subject Clustering
title_fullStr	Panel Data Analysis Via Variable Selection and Subject Clustering
title_full_unstemmed	Panel Data Analysis Via Variable Selection and Subject Clustering
title_sort	panel data analysis via variable selection and subject clustering
publisher	Institutional Knowledge at Singapore Management University
publishDate	2014
url	https://ink.library.smu.edu.sg/sis_research/2561 http://dx.doi.org/10.1007/978-3-642-45252-9_5
_version_	1770572519351255040

Panel Data Analysis Via Variable Selection and Subject Clustering

Similar Items