Correlation inequalities

From Encyclopedia of Mathematics - Reading time: 8 min


Correlation between variables in multivariate statistics (cf. also Multi-dimensional statistical analysis) [a19] has motivated classes of inequalities on subsets of a base set S that are referred to as correlation inequalities. They are often stated deterministically but usually have probabilistic interpretations. For example, suppose S is the family of all subsets of a finite set {1n}, and let A and B be order ideals or down-sets in (S,): abA implies aA, abB implies aB, with A,BS. Then, [a13], [a16], 2n|AB||A||B|. When μ is the uniform probability measure (cf. also Probability measure) on 2S, so that μ(a)=μ({a})=2n for every aS, and μ(A)=|A|2n, one has μ(AB)μ(A)μ(B). Hence, with respect to μ, A and B are correlated whenever both are down-sets.

As general definitions, one says with respect to a probability measure μ on an algebra E of subsets of S that A,BE are positively correlated if μ(AB)>μ(A)μ(B) and non-negatively correlated if μ(AB)μ(A)μ(B). Inequality reversals define negative correlation and non-positive correlation, respectively. The instances of these inequalities and generalizations described below assume that S is a finite set partially ordered by so that is irreflexive (never aa) and transitive: ab and bc imply ac.

Inequalities for distributive lattices.[edit]

Let (S,) be a distributive lattice with join ab and meet ab defined by

ab=min{zS:a\clez,b\clez},

ab=max{zS:z\clea,z\cleb}.

Distributivity says that a(bc)=(ab)(ac) for all a,b,cS. A mapping f:SR is non-decreasing if ab implies f(a)f(b) for all a,bS, and f(A) is defined as aAf(a) for AS, with f()=0. The join and meet and can be extended to subsets A,BS by

AB={ab:aA,bB},

AB={ab:aA,bB},

with the understanding that AB=AB= if A or B is empty.

The pre-eminent correlation inequality is the Ahlswede–Daykin inequality [a1], also called the four-functions inequality [a2]. It says that if f1,f2,f3,f4:S[0,) satisfy

f1(a)f2(b)f3(ab)f4(ab) for all a,bS,

then

f1(A)f2(B)f3(AB)f4(AB) for all A,BS.

This is a very powerful result that allows efficient proofs of many other correlation inequalities [a2], [a7], [a10], [a23]. One unusual application [a8] concerns match sets of orders of a deck of 52 cards numbered 1 through 52. For each order, M={k: card k is the k th card at the top of the deck }. Let A={M: an even kM} and B={M: an odd kM}. When μ is the uniform distribution on the 52! orders, μ(AB)μ(A)μ(B). Hence, if the match set of a well-shuffled deck contains an odd number, this can only increase our confidence that it contains an even number.

The most important correlation inequality historically is the FKG inequality [a9] (also known as the Fortuin–Kasteleyn–Ginibre inequality), which says that if η:S[0,) is log supermodular, i.e.,

η(a)η(b)η(ab)η(ab) for all a,bS,

and if f:SR and g:SR are non-decreasing, then

[Sη(a)f(a)][Sη(a)g(a)]

[Sη(a)][Sη(a)f(a)g(a)].

This can be written in terms of the mathematical expectation operation Eη as Eη(f)Eη(g)Eη(fg), when S is a Boolean algebra and η is a log supermodular probability measure with = and =. Suitable definitions of f and g with {0,1} co-domains show that A,BS are non-negatively correlated when both are up-sets or order filters, given log supermodularity.

Another notable result is the Holley inequality [a12]. Suppose η1:S[0,) and η2:S[0,) satisfy Sη1(a)=Sη2(a) along with the following inequality that generalizes log supermodularity and is itself generalized by the four-function Ahlswede–Daykin hypothesis:

η1(a)η2(b)η1(ab)η2(ab) for all a,bS.

Then the Holley inequality says that for every non-decreasing f:SR,

Sf(a)η1(a)Sf(a)η2(a).

Simple proofs of the FKG and Holley inequalities follow from the Ahlswede–Daykin inequality by suitable choices of f1 through f4. Also, when fi=η for all i, log supermodularity on S implies log supermodularity on the subsets of S; when fi1 for all i, one obtains |A||B||AB||AB| for all A,BS, which is equivalent to distributivity [a5] for an ordered lattice (S,).

Systems of subsets.[edit]

Several inequalities for (S,) when S is the family of subsets of {1n} have already been given. Log supermodularity for a probability measure μ on 2S is defined as μ(a)μ(b)μ(ab)μ(ab) for all a,bS, and when this holds, the FKG inequality says that Eμ(f)Eμ(g)Eμ(fg) for non-decreasing f:SR and g:SR and implies that μ(A)μ(B)μ(AB)μ(AB) for all A,BS. In addition, if A and B are down-sets or up-sets, then μ(AB)μ(A)μ(B). When μ is uniform, in which case log supermodularity is automatic, 2n|AB||A||B| if A is an up-set and B is a down-set, so the two are non-positively correlated when the complementary orientations obtain.

Another inequality involves set differences [a14]. Let AB={ab:aA,bB} for A,BS. Then |AB||BA||A||B|, and, in particular, |AA||A|.

The match set of a permutation σ on {1n} is defined as M(σ)={i{1n}:σ(i)=i}. Assuming that all n! permutations are equally likely, let μ(a) for a{1n} be the probability that M(σ)=a. Note that μ(a)=0 when |a|=n1 and hence that μ is not log supermodular. However, there still is an FKG-like inequality [a8], because μ(AB)μ(A)μ(B) whenever A,BS are up-sets. This illustrates the non-necessity of log supermodularity in certain cases for a correlation inequality to hold.

Linear extensions.[edit]

Another class of correlation inequalities arises when S is the set of linear extensions of a finite partially ordered set (X,) and all members of S are equally likely. (X,<0) is a linear extension of (X,) if (X,<0) is a partially ordered set, ≺⊆<0 and, for all distinct x,yX, x<0y or y<0x. Let N be the number of linear extensions of (X,), so the probability law for S is μ(a)=1/N. By additivity, μ(A)=Aμ(a) for each subset A of linear extensions.

A premier correlation inequality in this setting is the Fishburn–Shepp inequality [a6], [a18], also known as the xyz inequality. One says that x,yX are incomparable if xy and neither xy nor yx, and denotes by (x1<0y1xm<0ym) the set of all linear extensions of (X,) that satisfy the xi<0yi relations. The Fishburn–Shepp inequality says that if x, y and z are mutually incomparable in (X,), then

μ(x<0y)μ(x<0z)<μ(x<0y,x<0z),

so events A=(x<0y) and B=(x<0z) are positively correlated. Rewritten as μ(AB)/μ(A)>μ(B), one sees that the probability of x<0z in a randomly chosen linear extension increases when it is true also that x<0y. Examples in [a10], [a17], [a18] show that other plausible positive or non-negative correlations need not be true.

Inequalities involving two-part partitions of X are featured in [a11], [a17]. Suppose {X1,X2} is a non-trivial partition of X. Let A=(a1<0b1aj<0bj) and B=(c1<0d1ck<0dk) for ai,ciX1 and bi,diX2. If the restriction of in (X,) to each Xi is a linear order (cf. also Order (on a set)), then A and B are non-negatively correlated, i.e. μ(AB)μ(A)μ(B). The same is true if ≺=12, where i is a partial order on Xi.

Other inequalities which use two ordering relations on a fixed YX that are associated with are in [a3], [a4], [a7], [a22], [a23]. One of these, [a3], shows that the plausible inequality

μ(x<0y<0z<0w)μ(x<0y<0z)μ(z<0w),

which is suggested by the non-strict rendering μ(x<0y<0z)μ(x<0y)μ(y<0z) of the Fishburn–Shepp inequality, can be contradicted whenever n5.

An inequality from percolation theory.[edit]

A final inequality comes from percolation theory [a20], [a21] and provides a nice contrast to the FKG inequality version |A||B|2n|AB| when A and B are up-sets in the family of subsets of {1n}. For convenience, the subsets of {1n} are written in characteristic vector form, so that S={0,1}n. The cylinder set determined by xS and K{1n} is defined as

c(x,K)={yS:yj=xj for all jK},

and the disjoint intersection of A,BS is defined by

AB={xS:c(x,K)A,

c(x,{1n}K)B for some K}.

The inequality, conjectured in [a20], [a21] and proved in [a15], is

2n|AB||A||B| for all A,BS.

Thus, whereas some A,BS are positively correlated by the usual definition based on AB, all A,BS are "non-positively correlated" with respect to the disjoint intersection AB.

References[edit]

[a1] R. Ahlswede, D.E. Daykin, "An inequality for the weights of two families, their unions and intersections" Z. Wahrscheinlichkeitsth. verw. Gebiete , 43 (1978) pp. 183–185
[a2] B. Bollobás, "Combinatorics" , Cambridge Univ. Press (1986)
[a3] G.R. Brightwell, "Universal correlations in finite posets" Order , 2 (1985) pp. 129–144
[a4] G.R. Brightwell, "Some correlation inequalities in finite posets" Order , 2 (1986) pp. 387–402
[a5] D.E. Daykin, "A lattice is distributive if and only if |A||B||AB||AB|" Nanta Math. , 10 (1977) pp. 58–60
[a6] P.C. Fishburn, "A correlational inequality for linear extensions of a poset" Order , 1 (1984) pp. 127–137
[a7] P.C. Fishburn, "Correlation in partially ordered sets" Discrete Appl. Math. , 39 (1992) pp. 173–191
[a8] P.C. Fishburn, P.G. Doyle, L.A. Shepp, "The match set of a random permutation has the FKG property" Ann. of Probab. , 16 (1988) pp. 1194–1214
[a9] C.M. Fortuin, P.N. Kasteleyn, J. Ginibre, "Correlation inequalities for some partially ordered sets" Comm. Math. Phys. , 22 (1971) pp. 89–103
[a10] R.L. Graham, "Applications of the FKG inequality and its relatives" , Proc. 12th Internat. Symp. Math. Programming , Springer (1983) pp. 115–131
[a11] R.L Graham, A.C. Yao, F.F. Yao, "Some monotonicity properties of partial orders" SIAM J. Alg. Discrete Methods , 1 (1980) pp. 251–258
[a12] R. Holley, "Remarks on the FKG inequalities" Comm. Math. Phys. , 36 (1974) pp. 227–231
[a13] D.J. Kleitman, "Families of non-disjoint sets" J. Combin. Th. , 1 (1966) pp. 153–155
[a14] J. Marica, J. Schönheim, "Differences of sets and a problem of Graham" Canad. Math. Bull. , 12 (1969) pp. 635–637
[a15] D. Reimer, "Proof of the van den Berg-Kesten conjecture". Comb. Probab. Comput. 9, No. 1, pp. 27-32 (2000). Zbl 0947.60093
[a16] P.D. Seymour, "On incomparable collections of sets" Mathematika , 20 (1973) pp. 208–209
[a17] L.A. Shepp, "The FKG property and some monotonicity properties of partial orders" SIAM J. Alg. Discrete Methods , 1 (1980) pp. 295–299
[a18] L.A. Shepp, "The XYZ conjecture and the FKG inequality" Ann. of Probab. , 10 (1982) pp. 824–827
[a19] R.F. Tate, "Correlation methods" W.H. Kruskal (ed.) J.M. Tanur (ed.) , Internat. Encycl. Stat. , 1 , Free Press (1978) pp. 615–628
[a20] J. van der Berg, U. Fiebig, "On a combinatorial conjecture concerning disjoint occurrences of events" Ann. of Probab. , 15 (1987) pp. 354–374
[a21] J. van der Berg, H. Kesten, "Inequalities with applications to percolation and reliability" J. Appl. Probab. , 22 (1985) pp. 556–569
[a22] P.M. Winkler, "Correlation among partial orders" SIAM J. Alg. Discrete Methods , 4 (1983) pp. 1–7
[a23] P M. Winkler, "Correlation and order" Contemp. Math. , 57 (1986) pp. 151–174

How to Cite This Entry: Correlation inequalities (Encyclopedia of Mathematics) | Licensed under CC BY-SA 3.0. Source: https://encyclopediaofmath.org/wiki/Correlation_inequalities
20 views |
↧ Download this article as ZWI file
Encyclosphere.org EncycloReader is supported by the EncyclosphereKSF