Scatter matrix
From Wikipedia, the free encyclopedia
- For the notion in quantum mechanics, see scattering matrix.
In multivariate statistics and probability theory, the scatter matrix is a statistic that is used to make estimates of the covariance matrix, for instance of the multivariate normal distribution.
Given n samples of m-dimensional data, represented as the m-by-n matrix, , the sample mean is
where is the j-th column of .[1]
The scatter matrix is the m-by-m positive semi-definite matrix
where denotes matrix transpose,[2] and multiplication is with regards to the outer product. The scatter matrix may be expressed more succinctly as
where is the n-by-n centering matrix.