Quantnotes.com
Help Contact
Fundamentals
Publications
Software & Data
Book Reviews
Job Listings
Event Listings
Forums
Edutainment
Useful Links
About Us

 

QuantFinanceJobs.com

:: Collaborations ::

:: QF Journal ::

 

:: Understanding the Central Limit Theorem ::

Consider a fair game of coin tossing. If a Tail (T) comes out you loose $1 otherwise with a Head (H) you win $1.
    
If we play twice then the list of possible outcomes are {TT,TH,HT,HH}. However the TH and HT have the same payoff and thus the payoff list for {TT,TH &HT,HH} is ${-2,0,2}. Obviously here all possible outcomes are equally likely and so the possibility of coming out with nothing is twice as likely than winning or loosing.


Consider a second game with three outcomes Red (R),Green (G) , Blue (B). With R you loose $2; with B you win $2; otherwise with G you gain nothing. If G is twice as likely to occur than the other 2 colors we  immediately see that this game is equivalent to playing our first game twice. It has the same probabilities associated to the same possible payoffs.


Mathematically - the mapping of game 1 onto game 2 is done by an Auto or Self convolution of the probability distribution function of the payoffs of game 1.


AutoConvolution of {T,H}

[Graphics:Images/CLT_gr_1.gif]

[Graphics:Images/CLT_gr_2.gif]

[Graphics:Images/CLT_gr_3.gif]


This is the same Probability distribution function for the payoff associated to {R,G,B} in the second game.

[Graphics:Images/CLT_gr_4.gif]


Playing Game 2 twice is equivalent to playing Game 1 four times etc. - and performing the Autoconvolution we can find the list of outcomes together with the probability distribution function.

[Graphics:Images/CLT_gr_5.gif]

We observe that the Probability distribution function associated to the possible payoffs approaches a Gaussian distribution as we play more and more games.

This observation is part of what is known as the Central Limit Theorem.

We can play different fair H & T or RGB games with different associated probability distribution functions and still in the limit the probability of the payoff approaches a Gaussian.

[Graphics:Images/CLT_gr_6.gif]

Further we can think of fair RGB games that cannot be constructed by playing two different or same fair HT. For example the probabilities  associated to {R,G,B} being {0.4,0.2,0,4} is one such example. Nevertheless, playing this game in repetition or as part of a series of different fair games still produces a Gaussian distribution.

[Graphics:Images/CLT_gr_7.gif]

Why? Why is it that when playing many fair games with different probability distribution functions associated to the payoffs do we always approach a Gaussian distribution?

The proof of this theorem requires graduate mathematics. However, we shall focus on the key property that is responsible for this.

Lets focus on H & T games and , for the moment, forget the payoff distribution function. Since the order of an outcome is not an issue here the ordered outcome TTHH gives the same payoff as THTH - the distribution of the outcomes is thus Binomial.

T H
[Graphics:Images/CLT_gr_8.gif] [Graphics:Images/CLT_gr_9.gif] [Graphics:Images/CLT_gr_10.gif]
[Graphics:Images/CLT_gr_11.gif] [Graphics:Images/CLT_gr_12.gif] [Graphics:Images/CLT_gr_13.gif] [Graphics:Images/CLT_gr_14.gif]
[Graphics:Images/CLT_gr_15.gif] [Graphics:Images/CLT_gr_16.gif] [Graphics:Images/CLT_gr_17.gif] [Graphics:Images/CLT_gr_18.gif] [Graphics:Images/CLT_gr_19.gif]
[Graphics:Images/CLT_gr_20.gif] [Graphics:Images/CLT_gr_21.gif] [Graphics:Images/CLT_gr_22.gif] [Graphics:Images/CLT_gr_23.gif] [Graphics:Images/CLT_gr_24.gif] [Graphics:Images/CLT_gr_25.gif]
[Graphics:Images/CLT_gr_26.gif] [Graphics:Images/CLT_gr_27.gif] [Graphics:Images/CLT_gr_28.gif] [Graphics:Images/CLT_gr_29.gif] [Graphics:Images/CLT_gr_30.gif] [Graphics:Images/CLT_gr_31.gif] [Graphics:Images/CLT_gr_32.gif]

Therefore the distribution of the outcomes approaches a Gaussian over many games. When we then compute the probability distribution for the payoff by substituting in P(H) and P(T) we see that any peak in the previous distribution will move towards the center  driven by the increasing combinatorial paths leading towards the center.


For an RGB game the distribution is not the usual Binomial but still approaches a Gaussian since the order of individual outcomes does not matter. Indeed, the Pascal triangle structure obtained in the HT outcome distribution is also obtained in the RGB game with the difference that the former produces a Pascal triangle whilst the latter produces a Pascal tetrahedron. We just have to add the condition that RB produces the same payoff as [Graphics:Images/CLT_gr_33.gif] - projecting the tetrahedron structure of the outcomes to a triangle.

[Graphics:Images/CLT_gr_34.gif]

Even when we play a series of many different fair H & T games with different probabilities associated to a H and T outcome - because H followed by T and T followed by H produce the same payoff - the outcome will always approach a Gaussian.

Written by Raffaello Vardavas. Special thanks to Cameron Connell.

 

Back for more

Copyright © 2001-04 Quantnotes.com. All rights reserved. Legal Notice | Privacy Notice