Why copulas have been successful in many practical applications: A theoretical explanation based on computational efficiency

© Springer International Publishing Switzerland 2015. A natural way to represent a 1-D probability distribution is to store its cumulative distribution function (cdf) F(x) = Prob(X < x). When several random variables X1,. . ., Xn are independent, the corresponding cdfs F1(x1),. . ., Fn(xn) provid...

Full description

Saved in:
Bibliographic Details
Main Authors: Vladik Kreinovich, Hung T. Nguyen, Songsak Sriboonchitta, Olga Kosheleva
Format: Conference Proceeding
Published: 2018
Subjects:
Online Access:https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=84951019499&origin=inward
http://cmuir.cmu.ac.th/jspui/handle/6653943832/54416
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Chiang Mai University
id th-cmuir.6653943832-54416
record_format dspace
spelling th-cmuir.6653943832-544162018-09-04T10:20:04Z Why copulas have been successful in many practical applications: A theoretical explanation based on computational efficiency Vladik Kreinovich Hung T. Nguyen Songsak Sriboonchitta Olga Kosheleva Computer Science Mathematics © Springer International Publishing Switzerland 2015. A natural way to represent a 1-D probability distribution is to store its cumulative distribution function (cdf) F(x) = Prob(X < x). When several random variables X1,. . ., Xn are independent, the corresponding cdfs F1(x1),. . ., Fn(xn) provide a complete description of their joint distribution. In practice, there is usually some dependence between the variables, so, in addition to the marginals Fi(xi), we also need to provide an additional information about the joint distribution of the given variables. It is possible to represent this joint distribution by a multi-D cdf F(x1,. . ., xn) = Prob(X1 < x1 &. . &Xn < xn), but this will lead to duplication - since marginals can be reconstructed from the joint cdf - and duplication is a waste of computer space. It is therefore desirable to come up with a duplication-free representation which would still allow us to easily reconstruct F(x1,. . ., xn). In this paper, we prove that among all duplication-free representations, the most computationally efficient one is a representation in which marginals are supplements by a copula. This result explains why copulas have been successfully used in many applications of statistics: since the copula representation is, in some reasonable sense, the most computationally efficient way of representing multi-D probability distributions. 2018-09-04T10:13:10Z 2018-09-04T10:13:10Z 2015-01-01 Conference Proceeding 03029743 2-s2.0-84951019499 10.1007/978-3-319-25135-6-12 https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=84951019499&origin=inward http://cmuir.cmu.ac.th/jspui/handle/6653943832/54416
institution Chiang Mai University
building Chiang Mai University Library
country Thailand
collection CMU Intellectual Repository
topic Computer Science
Mathematics
spellingShingle Computer Science
Mathematics
Vladik Kreinovich
Hung T. Nguyen
Songsak Sriboonchitta
Olga Kosheleva
Why copulas have been successful in many practical applications: A theoretical explanation based on computational efficiency
description © Springer International Publishing Switzerland 2015. A natural way to represent a 1-D probability distribution is to store its cumulative distribution function (cdf) F(x) = Prob(X < x). When several random variables X1,. . ., Xn are independent, the corresponding cdfs F1(x1),. . ., Fn(xn) provide a complete description of their joint distribution. In practice, there is usually some dependence between the variables, so, in addition to the marginals Fi(xi), we also need to provide an additional information about the joint distribution of the given variables. It is possible to represent this joint distribution by a multi-D cdf F(x1,. . ., xn) = Prob(X1 < x1 &. . &Xn < xn), but this will lead to duplication - since marginals can be reconstructed from the joint cdf - and duplication is a waste of computer space. It is therefore desirable to come up with a duplication-free representation which would still allow us to easily reconstruct F(x1,. . ., xn). In this paper, we prove that among all duplication-free representations, the most computationally efficient one is a representation in which marginals are supplements by a copula. This result explains why copulas have been successfully used in many applications of statistics: since the copula representation is, in some reasonable sense, the most computationally efficient way of representing multi-D probability distributions.
format Conference Proceeding
author Vladik Kreinovich
Hung T. Nguyen
Songsak Sriboonchitta
Olga Kosheleva
author_facet Vladik Kreinovich
Hung T. Nguyen
Songsak Sriboonchitta
Olga Kosheleva
author_sort Vladik Kreinovich
title Why copulas have been successful in many practical applications: A theoretical explanation based on computational efficiency
title_short Why copulas have been successful in many practical applications: A theoretical explanation based on computational efficiency
title_full Why copulas have been successful in many practical applications: A theoretical explanation based on computational efficiency
title_fullStr Why copulas have been successful in many practical applications: A theoretical explanation based on computational efficiency
title_full_unstemmed Why copulas have been successful in many practical applications: A theoretical explanation based on computational efficiency
title_sort why copulas have been successful in many practical applications: a theoretical explanation based on computational efficiency
publishDate 2018
url https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=84951019499&origin=inward
http://cmuir.cmu.ac.th/jspui/handle/6653943832/54416
_version_ 1681424316513648640