Panel Data Analysis Via Variable Selection and Subject Clustering

This book investigates tradeoff between security and usability in designing leakage resilient password systems (LRP) and introduces two practical LRP systems named Cover Pad and ShadowKey. It demonstrates that existing LRP systems are subject to both brute force attacks and statistical attacks and t...

Full description

Saved in:
Bibliographic Details
Main Authors: LU, Haibing, HUANG, Shengsheng, LI, Yingjiu, YANG, Yanjiang
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2014
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/2561
http://dx.doi.org/10.1007/978-3-642-45252-9_5
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-3561
record_format dspace
spelling sg-smu-ink.sis_research-35612016-01-21T07:32:57Z Panel Data Analysis Via Variable Selection and Subject Clustering LU, Haibing HUANG, Shengsheng LI, Yingjiu YANG, Yanjiang This book investigates tradeoff between security and usability in designing leakage resilient password systems (LRP) and introduces two practical LRP systems named Cover Pad and ShadowKey. It demonstrates that existing LRP systems are subject to both brute force attacks and statistical attacks and that these attacks cannot be effectively mitigated without sacrificing the usability of LRP systems. Quantitative analysis proves that a secure LRP system in practical settings imposes a considerable amount of cognitive workload unless certain secure channels are involved. The book introduces a secure and practical LRP A panel data set contains observations on multiple phenomena observed over multiple time periods for the same subjects (e.g., firms or individuals). Panel data sets frequently appeared in the study of Marketing, Economics, and many other social sciences. An important panel data analysis task is to analyze and predict a variable of interest. As in social sciences, the number of collected data records for each subject is usually not large enough to support accurate and reliable data analysis, a common solution is to pool all subjects together and then run a linear regression method in attempt to discover the underlying relationship between the variable of interest and other observed variables. However, this method suffers from two limitations. First, subjects might not be poolable due to their heterogeneous nature. Second, not all variables might have significant relationships to the variable of interest. A regression on many irrelevant regressors will lead to wrong predictions. To address these two issues, we propose a novel approach, called Selecting and Clustering, which derives underlying linear models by first selecting variables highly correlated to the variable of interest and then clustering subjects into homogenous groups of the same linear models with respect to those variables. Furthermore, we build an optimization model to formulate this problem, the solution of which enables one to select variables and clustering subjects simultaneously. Due to the combinatorial nature of the problem, an effective and efficient algorithm is proposed. Studies on real data sets validate the effectiveness of our approach as our approach performs significantly better than other existing approaches. 2014-01-01T08:00:00Z text https://ink.library.smu.edu.sg/sis_research/2561 info:doi/10.1007/978-3-642-45252-9_5 http://dx.doi.org/10.1007/978-3-642-45252-9_5 Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Computer Sciences Numerical Analysis and Scientific Computing
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Computer Sciences
Numerical Analysis and Scientific Computing
spellingShingle Computer Sciences
Numerical Analysis and Scientific Computing
LU, Haibing
HUANG, Shengsheng
LI, Yingjiu
YANG, Yanjiang
Panel Data Analysis Via Variable Selection and Subject Clustering
description This book investigates tradeoff between security and usability in designing leakage resilient password systems (LRP) and introduces two practical LRP systems named Cover Pad and ShadowKey. It demonstrates that existing LRP systems are subject to both brute force attacks and statistical attacks and that these attacks cannot be effectively mitigated without sacrificing the usability of LRP systems. Quantitative analysis proves that a secure LRP system in practical settings imposes a considerable amount of cognitive workload unless certain secure channels are involved. The book introduces a secure and practical LRP A panel data set contains observations on multiple phenomena observed over multiple time periods for the same subjects (e.g., firms or individuals). Panel data sets frequently appeared in the study of Marketing, Economics, and many other social sciences. An important panel data analysis task is to analyze and predict a variable of interest. As in social sciences, the number of collected data records for each subject is usually not large enough to support accurate and reliable data analysis, a common solution is to pool all subjects together and then run a linear regression method in attempt to discover the underlying relationship between the variable of interest and other observed variables. However, this method suffers from two limitations. First, subjects might not be poolable due to their heterogeneous nature. Second, not all variables might have significant relationships to the variable of interest. A regression on many irrelevant regressors will lead to wrong predictions. To address these two issues, we propose a novel approach, called Selecting and Clustering, which derives underlying linear models by first selecting variables highly correlated to the variable of interest and then clustering subjects into homogenous groups of the same linear models with respect to those variables. Furthermore, we build an optimization model to formulate this problem, the solution of which enables one to select variables and clustering subjects simultaneously. Due to the combinatorial nature of the problem, an effective and efficient algorithm is proposed. Studies on real data sets validate the effectiveness of our approach as our approach performs significantly better than other existing approaches.
format text
author LU, Haibing
HUANG, Shengsheng
LI, Yingjiu
YANG, Yanjiang
author_facet LU, Haibing
HUANG, Shengsheng
LI, Yingjiu
YANG, Yanjiang
author_sort LU, Haibing
title Panel Data Analysis Via Variable Selection and Subject Clustering
title_short Panel Data Analysis Via Variable Selection and Subject Clustering
title_full Panel Data Analysis Via Variable Selection and Subject Clustering
title_fullStr Panel Data Analysis Via Variable Selection and Subject Clustering
title_full_unstemmed Panel Data Analysis Via Variable Selection and Subject Clustering
title_sort panel data analysis via variable selection and subject clustering
publisher Institutional Knowledge at Singapore Management University
publishDate 2014
url https://ink.library.smu.edu.sg/sis_research/2561
http://dx.doi.org/10.1007/978-3-642-45252-9_5
_version_ 1770572519351255040