Research Article Volume 12 Issue 3
Location and scale under exchangeable errors
DR Jensen
Regret for the inconvenience: we are taking measures to prevent fraudulent form submissions by extractors and page crawlers. Please type the correct Captcha word to see email ID.
Department of Statistics, Virginia Tech, USA
Correspondence: DR Jensen, Department of Statistics, Virginia Tech, USA
Received: May 23, 2023 | Published: June 29, 2023
Citation: Jensen DR. Location and scale under exchangeable errors. Biom Biostat Int J. 2023;12(3):81-86. DOI: 10.15406/bbij.2023.12.00388
Download PDF
Abstract
Classical multivariate analysis rests on observations
having
mutually independent rows, with dispersion matrix as a direct product
, supported in turn by a rich literature. That independence may fail is modeled here on taking the rows of
to be exchangeably dependent such that
where exchangeability rests on the choice for
. Three choices are considered; each interjects additional parameters into the model; and it remains to ask which, if any, of findings widely known under independence, might apply also under exchangeable dependence. Conventional inferences for the location and scale parameters
are reconsidered. Excluding
these are found to carry over in large part to include the exchangeable errors of this study.
AMS subject classification: 62E15, 62H15, 62J20
Keywords: exchangeable matrix errors, recovered properties, location and scale parameters
Introduction
The model
asserts the n rows of
to be
–dimensional responses, having location parameters
where
and with
as an array of random errors. Conventional analysts take the
rows of
to be mutually independent and Gaussian, so that
. To the contrary, independence often fails; venues include multiple time series, econometrics, and empirical adjustments that induce dependencies among the adjusted responses, as in references1,2 for calibrated data. Accordingly, it is instructive to replace independence among rows by exchangeable dependence, on letting
where exchangeably rests on the choice for
.
In short, the basic foundations remain to be reworked, as in this study with regard to independence. Specifically, with
as Euclidean
–space and
the real matrices of order
then the distribution
for
is said to be exchangeable provided that
for every
the
permutation group, a concept due to Johnson.3 In this study exchangeable errors on
are identified; their use is seen to offer a rich class of alternatives to independence. A brief survey follows.
Selected classes of exchangeable errors on
are studied, as are moments for the model
. The focus here centers on
as the conventional location/scale parameters. But since additional parameters are injected into the model on requiring that it should be exchangeable, it is essential to identify those properties, if any, which do carry over to include exchangeable errors.
Preliminaries
Notation
Identify
and
as stated, with
as the symmetric, positive definite matrices of order
. Vectors and matrices are in bold type, with
as the transpose, inverse, trace, and determinant of
. The unit vector in
is
;
is the
identity;
and Diag
is block–diagonal. Take
to be the eigenvalues of
. The condition number of
is
. For
and
, their direct product is
of order
, and a
–inverse of
is
such that
.
Random arrays
Consider
to be random, with
as its law of distribution, its expected values in
, and its dispersion matrix in
under moments of first and second orders. Moreover, for displaying the elements of
, the matrix
of order
is taken row–wise through the mapping
of order
as in the following from Jensen DR, et al.4
Proposition 1 i. For
, then
is arrayed as
, often of the form
with elements
;
- Then for row
of
the element
is on the diagonal of
, and
is off the diagonal;
iii. For
and fixed
,then
.
Exchangeable arrays trace to Johnson3 as noted, and since have a rich history. Any mixture of independent, identically distributed
variables in
is exchangeable; a converse of Finetti B5 is that elements of
if exchangeable, are conditionally
given some
. Matrix arrays are considered next; refer also to Aldous.6
Definition 1
- The distribution of
is said to be left–exchangeable provided that
for every
;
-
is right–exchangeable provided that
for every
.
Essential properties may be listed as follow.
Lemma 1 Take
with
, and
with .
- Let
be exchangeable on
; then
for every
,i.e.
is invariant under
acting by congruence;
- Let
be left–exchangeable ; then
for every
;
- Let
be right–exchangeable; then
for every
.
Proof. Clearly
implies
for every
, to give conclusion (i). Conclusions (ii) and (iii) follow as in Definition 1(iii), namely,
; and applying Conclusion (i) in succession to
and
.
Classes of exchangeable errors
An early version having exchangeable rows on
is
, identified in7 as an Exchangeable General Linear Model. This is a block–partitioned version of an equicorrelation matrix, but differing from matrices of type
as considered here and listed in Table 1.
Remark 1 Given
, then
follows on taking
and
.
Essential properties may be summarized as follow.
Theorem 1 Consider the classes
of Table 1, together with conditions
for
to be positive definite. Then
- The classes are closed under congruence by
, i.e. for
, the matrices satisfy
, for each
;
- For each
, the conditions
that
be positive definite, are identical for all elements of the classes
- Consider
to be fixed as are
and
. Corresponding to
is an equivalent subclass, namely
, as given by
,
having identical values for
and
, consisting in number as
provided that elements of are distinct.
Proof (i) The conditions
of Table 1 are from Theorem 2 of Jensen DR8;
follows step–by–step on modifying that proof exclusive of
; and
is given in Halperin M.9 (ii) Closure properties for
and
follow with
since
and
with
, and similarly
reproduces itself. Conclusion (iii) holds for
since
and
are invariant under permutations of
, and the members of
clearly are generated from all
permutations of the elements of
if distinct.
Table 1 identifies additional parameters as required to achieve exchangeability. It is essential to examine the manner in which these may affect outcomes of the analysis, specifically, through the singular joint distribution of
as
functions of
.
Repeated use is made of
from Proposition 1(iii). In addition,
is the idempotent projection operator onto the error span of the model
.
Theorem 2 Given
, consider under the classes
of Table 1. Then
-
for each
;
- The joint dispersion matrices
of order
under the Table 1 classes are given respectively by
(1)
Proof Conclusion (i) follows from
, and
by parallel arguments. Next let
, so that
and
with
as in Table 1. Substituting these in succession into expression
gives the displayed matrices (1) for the classes
, respectively.
Class
|
|
|
Source
|
|
|
|
Jensen8
|
|
|
|
Baldessari10
|
|
|
|
Halperin9
|
Table 1 Classes of matrices
for
as factors of
having exchangeable rows, together with conditions
for
to be positive definite, where
is of order
,
and
In short, Theorem 2 catalogs the essentials of requiring that errors on
be exchangeable as in Table 1. Both
are affected in having properties discordant with those of the conventional
. Specifically, requiring that errors be exchangeable may serve to compromise the evidence contained in with regard to, to be examined subsequently. Details are collected in the following Table 2 as excerpted from Theorem 2.
Scale–invariance
This concept is central to establishing properties under independence as they may carry over to include exchangeable dependence. To these ends, associate with the classes
the values
from the final row of Table 2.
Lemma 2 Let
be scale invariant, i.e.
for
;and consider these as they may apply in the exchangeable classes
of Table 1.
- The scale parameters of
are respectively
for the classes of Table 1;
- Properties of
are identical to those for
, for each of the exchangeable classes
.
Proof. Conclusion (i) is from Table 2 as noted. The proof for (ii) hinges on scaling properties of Wishart matrices, namely, that
, so that if
as in Table 2, then
, the default state. Accordingly, infer that
behaves as if from
in the third row of Table 2, and
behaves as if from
. But
is scale–invariant, so that
, as if from
to complete a proof.
Tests for
A complement to estimation is hypothesis testing under exchangeable errors. First consider
.
For
, recall that
-
;
-
are mutually independent; and
- Hotelling’s11 test for
vs
utilizes the statistic
(2)
with distribution
of order
having
degrees of freedom and noncentrality parameter
. Under the error classes of Table 1, the principal negative finding of this study is the following.
Lemma 3 Consider
in the classes
, together with
for testing
vs
.
- That
are independent is met only in the class
;
- Replacing
in Equation (2) are reciprocals of
, and these typically are unknown;
- In short, the classical tests for
are unsupported in the exchangeable error classes
.
Proof. (i) The independence of
, namely
, is met only in the class
in Theorem 2, unless
for both
and
in Equation (1), in which case
. Conclusion (ii) follows from Theorem 2 and Table 2, and Conclusion (iii) follows in summary.
Item
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Table 2 Properties of
and,
where
; moreover, the distribution
is central Wishart of order
, having
degrees of freedom and scale parameters
Inferences for
Estimation
The dispersion matrix
within the rows of
, and the cross–covariances
between rows, all depend on
. In addition to properties of
as reported in Theorem 2 and Table 2, let
be the error mean squares for the classes
. Essential features are that
for
and
for the classes
. Thus
is unbiased for
, whereas
are biased by the factors
. Moreover, as measures of scatter, the generalized variances are related as
and
, whereas the condition numbers
are identical.
Hypothesis tests
Five tests, historically devised and subsequently used under
are listed in Table 3, to include statements of hypotheses, commonly used test statistics, and references.
As to exchangeable dependence, it remains to identify those of Table 3 that remain viable in the exchangeable classes of Table 1.
Theorem 3 Consider the tests for
as in Table 3 for the classes
of Table 1, in lieu of the conventional
.
(i) All statistics of Table 3 are scale–invariant;
(ii) For the classes
, properties of the tests of Table 3 are identical to those for
, independently of
.
Proof As before
, are the scale parameters for
in
. Conclusion (i) is apparent, where for
, we find on rescaling
that
and
, leaving
to be scale–invariant. Conclusion (ii) follows on applying conclusion (i) in order to verify the scale–invariance and applicability of Lemma 2.
Remark 2 These tests accordingly exhibit genuinely nonparametric features, in that each applies for structured distributions in the classes
beyond that of the conventional
.
Exact distributions of the Table 3 statistics
rarely are known, supported instead by approximations, namely,
, such that
, namely, approximately chi–squared having
degrees of freedom. Details are found in Sections 7.2.1 and 7.2.2 of Rencher12 and 7.3 of Morrison.13 These details are omitted here in the interests of brevity, but suffice to say, those approximations all apply in the exchangeable error classes of this study.
Item
|
|
Test statistic
|
Reference
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Table 3 Selected hypotheses regarding
; commonly used test statistics; references R to Rencher12 and M to Morrison13
Legend
;
and
consists of
linear contrasts.
Correlation analyses
Here sample entities depend on
, corresponding parameters are identical functions of
. To these ends take
of orders
, and partition
as
(3)
Then
are simple correlations; the singular values
are the canonical correlations
and the multiple correlations are defined at
Again note that these were derived historically and subsequently used under the independence model
. The question again arises as to whether exchangeable errors may have compromised correlative evidence in
regarding
.Results to the contrary are the substance of the following.
Theorem 4 Given
in the exchangeable classes
; consider effects on correlation analyses as prescribed under
.
(i) Then for all
, the entities
and their properties are identical to those for
;
(ii) In like manner, for all
, properties of multiple and canonical correlations are identical to those for
;
(iii) In short, conventional correlation analyses are preserved despite requiring that errors be exchangeable in
.
Proof. The claims again rest on the fact that sample correlations are scale–invariant functions of
and
. Conclusions (i), (ii) and (iii) now follow from Lemma 2.
Factor analyses
Within the scope of psychometric, sociometric, and humanistic endeavors, the
paradigm postulates that
such that elements of
comprise the factor loadings, and
the unique variances. In particular, the diagonal elements of
are
where
are the communalities. The analysis begins with
, typically utilizing maximum likelihood estimation as in Chapter 13 of Rencher.12 An initial solution
eventually is rotated so as to achieve further desirable properties, since the loadings
are non–unique.
For the case that
, the normal–theory likelihood ratio for testing
vs
is
(4)
and referred to upper critical values of the approximating distribution, namely,
with
as in expression (13.47) of Rencher.12 These were derived historically and used subsequently for the case that
.
The extent to which the foregoing algorithm may be applied more generally, to encompass exchangeable errors, is examined in the following.
Theorem 5 Consider the statistic (4) for testing the
model in the classes
, as developed and prescribed for
. Then
(i) For each distribution
, properties of tests using (4) are identical to those under
.
Proof. As the statistic (4) is scale–invariant, the conclusion again follows from Lemma 2.
Conclusion
In retrospect, taking the conventional
remains an enduring artefact of statistical practice. Exchangeable dependence, where
, is a radical departure, albeit on occasion as being itself fundamental to correct statistical practice. Foundations trace to Johnson3; extensions encompass matrices in
and stochastic sequences in various domains. Representations for two–way arrays include (i) functions of
scalars as in Aldous DJ6 and the related studies14,15; and (ii) as limits of finite exchangeable sequences as in Ivanoff BG.16 for rectangular arrays. Marshall & Olkin17 demonstrated that Schur–concave joint density functions on
are exchangeable; Shaked & Tong18 superimposed partial orderings on exchangeable arrays; and Seneta19 sought to approximate joint probabilities of equicorrelated vectors in
in terms of marginal probabilities and the correlation parameter
.Functional limit theorems for row and column arrays were studied in Ivanoff BG.16 Kallenberg20 examined ergodic properties of exchangeable arrays generated as multivariate samples from a stationary process. In reliability studies, an exchangeable array is considered in Spizzichino F, et al.21 as deriving from a hierarchical model having multivariate negative aging. In addition, a multivariate lognormal frailty model for exchangeable failure time data, having marginal Weibull lifetime distributions, is considered in Stefanescu C.22
Alternative to our studies is equation (1) of Arnold7 having the linear structure of our model
but differing in dispersion. Arnold’s approach differs in reducing his model to a canonical form. Nonetheless, Arnold’s assessment of
serves to confirm our findings in Lemma 3. On the other hand, our examination of, its sample version
, and other second–moment properties, find no parallel in Arnold’s studies. In continuation of those studies, Roy & Fonseca23 sought to extend equation (1), considered as a two–level array, to encompass three levels.
Antecedents to the present study include
in Table 1 from Baldessari10 in lieu of
in the Analysis of Variance; and characterized in24–26 as the class of all within-subject dispersion matrices preserving the validity of conventional
–tests in the analysis of repeated measurements. Moreover, structured matrices of an earlier vintage include the Euclidean distance matrices of Gower,27 namely
, with
diagonal, having applications to linear inference as found in Farebrother.28
In summary, our studies have sought to cover a diversity of topics in multivariate statistical inference from a further perspective, namely, that of exchangeable errors. But at the same time, to acknowledge and to pursue the prospects that requiring exchangeability may serve to compromised the meanings attributed to sample evidence. Specifically, references abound for the vast array of multivariate normal procedures described here as classical, including those amenable to selected exchangeable distributions as shown here. Of the many topics not covered, interested readers are encouraged to undertake further investigations using and adding to the analytical principles demonstrated here.
Acknowledgments
Conflicts of interest
The author declare that there is no conflicts of interest.
Funding
References
- Jensen DR, Ramirez DE. Anomalies in the analysis of calibrated data. J Statist Computat Simul. 2009;79:299–314.
- Jensen DR, Ramirez DE. Irregularities in X(Y) from Y(X) in linear regression. J Statist Computat Simul. 2012;83(10):1807–1828.
- Johnson W. Logic, Part III: The Logical Foundations of Science. Cambridge UK: Cambridge University Press; 1924.
- Jensen DR, Good IJ. Invariant distributions associated with matrix laws under structural symmetry. J Royal Statist Soc B. 1981;43(3):327–332.
- Finetti B. Forecasting: its logical laws, its subjective sources. Annals of the Henri Poincaré Institute. 1937;7(1):1–68.
- Aldous DJ. Representations for partially exchangeable arrays of random variables. J Multivariate Anal. 1981;11(4):581–598.
- Arnold S. Linear models with exchangeably distributed errors. J Amer Statist Assoc. 1979;74(365):194–199.
- Jensen DR. Structured dispersion and validity in linear inference. Linear Algebra Applic. 1996;249:189–196.
- Halperin M. Normal regression theory in the presence of intra-class correlation. Ann Math Statist. 1951;22:573–580.
- Baldessari B. Analysis of variance of dependent data. Statistica (Bologna). 1966;26:895–903.
- Hotelling H. The generalization of Student’s ratio. Ann Math Statist. 1931;2:360–378.
- Rencher A. Methods of Multivariate Analysis. 2nd ed. New York: Wiley; 2002.
- Morrison DF. Multivariate Statistical Methods. 3rd ed. New York: McGraw–Hill; 1990.
- Kallenberg O. On the representation theorem for exchangeable arrays. J Multivariate Anal. 1989;30:137–154.
- Lynch J. Canonical row–column–exchangeable arrays. J Multivariate Anal. 1984;15:135–140.
- Ivanoff BG, Weber NC. Functional limit theorems for row and column exchangeable arrays. J Multivariate Anal. 1995;55:133–148.
- Marshall AW, Olkin I. Majorization in multivariate distributions. Ann Statist. 1974;2:1189–1200.
- Shaked M, Tong YL. Some partial orderings of exchangeable random variables by positive dependence. J Multivariate Anal. 1985;17:333–349.
- Seneta E. Multivariate probability in terms of marginal probability and correlation coefficient. Biometrical J. 1987;29:375–380.
- Kallenberg O. Multivariate sampling and the estimation problem for exchangeable arrays. J Theor Probab. 1999;12:859–883.
- Spizzichino F, Torrisi G. Multivariate negative aging in an exchangeable model of heterogeneity. Statist Probab Letters. 2001;55:71–82.
- Stefanescu C, Turnbull B. Multivariate frailty models for exchangeable survival data with covariates. Technometrics. 2006;48:411–417.
- Roy A, Fonseca M. Linear models with doubly exchangeable distributed errors. Commun. Statist: Theory and Methods. 2012;41;2545–2569.
- Bock R. Multivariate analysis of variance of repeated measurements. In: Harris CW, editor. Problems of Measuring Change. Madison: Wisconsin: University of Wisconsin Press; 1963:85–103.
- Huynh H, Feldt L. Conditions under which mean square ratios in repeated measurements designs have exact F–distributions. J Amer Statist Assoc. 1970;65:1582–1589.
- Rouanet H, Lépine D. Comparison between treatments in a repeated–measurement design: ANOVA and multivariate methods, British J Math Statist Psychol. 1970;23:147–163.
- Gower J. Euclidean distance geometry. Math Sci. 1982;7:1–14.
- Farebrother R. Linear unbiased approximators of the disturbances in the standard linear model. Linear Algebra Applic. 1985;67:259–274.
©2023 Jensen. This is an open access article distributed under the terms of the,
which
permits unrestricted use, distribution, and build upon your work non-commercially.