Exchangeability
In many applications, the order of observations does not matter. (For example, training instances can be permuted without changing the probability distribution; words in the bag-of-words model can be permuted)
Formally,
Definition Random variables are exchangeable if for all permutations ,
Definition Sequence is infinitely exchangeable if any finite subsequence is exchangeable.
De Finetti's Theorem
If are exchangeable, then the probability distribution can be described succinctly due to the following theorem.
Theorem (De Finetti) If is infinitely exchangeable where , then there exists some parameter space and density function such that for any observations,
In other words, we can always draw a graphical model:
If is finite, then has finite dimension, and we can parameterize. (For example, if is Bernoulli, then we only need 1 parameter ).
However, if is infinite, has infinite dimension and cannot be parameterized by a finite amount of parameters.
This is the motivation for using nonparametric methods which do not assume the (finite) number of parameters.