| ::
Understanding the Central Limit Theorem ::
Consider a fair game of coin tossing. If a Tail (T) comes out you loose
$1 otherwise with a Head (H) you win $1.
If we play twice then the list of possible outcomes are {TT,TH,HT,HH}.
However the TH and HT have the same payoff and thus the payoff list for
{TT,TH &HT,HH} is ${-2,0,2}. Obviously here all possible outcomes
are equally likely and so the possibility of coming out with nothing is
twice as likely than winning or loosing.
Consider a second game with three outcomes Red (R),Green (G) , Blue (B).
With R you loose $2; with B you win $2; otherwise with G you gain nothing.
If G is twice as likely to occur than the other 2 colors we immediately
see that this game is equivalent to playing our first game twice. It has
the same probabilities associated to the same possible payoffs.
Mathematically - the mapping of game 1 onto game 2 is done by an Auto
or Self convolution of the probability distribution function of the payoffs
of game 1.
AutoConvolution of {T,H}
![[Graphics:Images/CLT_gr_1.gif]](Images/CLT_gr_1.gif)
![[Graphics:Images/CLT_gr_2.gif]](Images/CLT_gr_2.gif)
![[Graphics:Images/CLT_gr_3.gif]](Images/CLT_gr_3.gif)
This is the same Probability distribution function for the payoff associated
to {R,G,B} in the second game.
![[Graphics:Images/CLT_gr_4.gif]](Images/CLT_gr_4.gif)
Playing Game 2 twice is equivalent to playing Game 1 four times etc. -
and performing the Autoconvolution we can find the list of outcomes together
with the probability distribution function.
![[Graphics:Images/CLT_gr_5.gif]](Images/CLT_gr_5.gif)
We observe that the Probability distribution function associated to
the possible payoffs approaches a Gaussian distribution as we play more
and more games.
This observation is part of what is known as the Central Limit Theorem.
We can play different fair H & T or RGB games with different associated
probability distribution functions and still in the limit the probability
of the payoff approaches a Gaussian.
![[Graphics:Images/CLT_gr_6.gif]](Images/CLT_gr_6.gif)
Further we can think of fair RGB games that cannot be constructed by
playing two different or same fair HT. For example the probabilities associated
to {R,G,B} being {0.4,0.2,0,4} is one such example. Nevertheless, playing
this game in repetition or as part of a series of different fair games
still produces a Gaussian distribution.
![[Graphics:Images/CLT_gr_7.gif]](Images/CLT_gr_7.gif)
Why? Why is it that when playing many fair games with different probability
distribution functions associated to the payoffs do we always approach
a Gaussian distribution?
The proof of this theorem requires graduate mathematics. However, we
shall focus on the key property that is responsible for this.
Lets focus on H & T games and , for the moment, forget the payoff
distribution function. Since the order of an outcome is not an issue here
the ordered outcome TTHH gives the same payoff as THTH - the distribution
of the outcomes is thus Binomial.
Therefore the distribution of the outcomes approaches a Gaussian over
many games. When we then compute the probability distribution for the
payoff by substituting in P(H) and P(T) we see that any peak in the previous
distribution will move towards the center driven by the increasing
combinatorial paths leading towards the center.
For an RGB game the distribution is not the usual Binomial but still approaches
a Gaussian since the order of individual outcomes does not matter. Indeed,
the Pascal triangle structure obtained in the HT outcome distribution
is also obtained in the RGB game with the difference that the former produces
a Pascal triangle whilst the latter produces a Pascal tetrahedron. We
just have to add the condition that RB produces the same payoff as
- projecting the tetrahedron structure of the outcomes to a triangle.
![[Graphics:Images/CLT_gr_34.gif]](Images/CLT_gr_34.gif)
Even when we play a series of many different fair H & T games with
different probabilities associated to a H and T outcome - because H followed
by T and T followed by H produce the same payoff - the outcome will always
approach a Gaussian.
Written by Raffaello Vardavas. Special thanks to Cameron
Connell.
Back for more 
|