Subject classification: this is a mathematics resource. |
Subject classification: this is a statistics resource. |
Given any collection of real numbers, an average is a single number intended to give an estimate of the general magnitude of the numbers. Formally, it is a function from a set of n numbers to a single number with the following properties:
It might be supposed that the average should be translation invariant, so that if all numbers are increased by the same constant k their average will increase by the same number: AV(x+k,y+k) = AV(x,y)+k. However, it can be shown that there is only one average meeting this and the other requirements strictly: the arithmetic mean discussed below. If the less strict versions of the requirements are used, the median and other quartiles would meet all the requirements.
It might also be supposed that the average should be a continuous function of the numbers. Again, this would rule out quartiles.
The simplest average is the arithmetic mean, defined as the sum of the numbers divided by the number of numbers. Thus the average of {1,2,3,4,5) is (1+2+3+4+5)/5 = 3.
Exercise: Verify that this meets all the conditions above, including the stricter versions.
Another common average is the geometric mean, obtained by multiplying all the numbers together and, if there are n numbers, taking the nth root. Thus the geometric mean of {1,2,3,4,5) is (1x2x3x4x5)1/5 = 2.605 (approximately).
Note that the geometric mean should not be used if any of the numbers is negative (why?) and is zero if any of the numbers is zero, no matter how large the other numbers are (why?).
Exercise: Verify that the geometric mean meets all the conditions above, including the stricter versions.
Unless all the numbers are equal, the geometric mean is always less than the arithmetic mean.
This is easily proved for just two numbers and three numbers; the outlines of the proof are:
If a and b are two unequal non-negative numbers, then
This can be rearranged as
If a, b and c are three non-negative numbers, not all equal, then
So
This can be rearranged as
For four numbers, take the numbers in pairs and apply the already proven result for two numbers.
How can we assess whether the geometric mean is close to the arithmetic mean, or substantially less?
Given a set of n numbers {x1 ... xn}, let their arithmetic mean be m and
Then the sum of the yi is zero. (Why?) We assume that the xi do not diverge too much from their mean, so the yi are fairly small numbers and we can expand log (1+y) in a power series.
In other words, the greater the dispersion of the numbers about their arithmetic mean, the greater the difference between the two means.
Exercise: Some approximations have been made in deriving this result. Demonstrate by actual calculations that the result is true in general.
The RMS is the square root of the arithmetic mean of the squares of a collection of numbers, i.e.
This average should not be used for a mixture of positive and negative numbers (why?).
Exercise: Verify that this meets all the conditions above, including the stricter versions.
Unless all the numbers are equal, the RMS is always greater than the arithmetic mean. This is easily proved for just two numbers by considering the square of the RMS and of the arithmetic mean:
A similar but more complex proof will work for any number of numbers. With a slight extension of the proof, it may be shown that if m is the arithmetic mean and s2 is the variance of a set of numbers, then
The harmonic mean of a set of numbers is the reciprocal of the arithmetic mean of the reciprocals of those numbers. Thus for three numbers we have
This mean should not be used if any number is zero or negative (why?).
Exercise: Verify that this meets all the conditions above, including the stricter versions.
Unless all the numbers are equal, the harmonic is always less than the geometric mean. This follows because its reciprocal is the arithmetic mean of the reciprocals of the numbers, hence is greater than the geometric mean of the reciprocals which is the reciprocal of the geometric mean. Thus we have:
The rth power mean of a set of numbers, for any real number r, is
This average should only be used for positive numbers if r < 0 and non-negative numbers if r > 0 (why?).
Exercise: Verify that this meets all the conditions above, including the stricter versions.
This mean is undefined for r = 0, but the limit as r tends to 0 is the geometric mean. Thus all of the averages we have considered so far are special cases of this mean (r = -1, harmonic; r = 0, geometric; r = 1, arithmetic; r = 2, RMS).
It can be shown that for any collection of positive numbers (not all equal), this mean is a continuous, strictly monotonic increasing function of r; the inequalities above are special cases of this. As r tends to infinity, the mean tends to the maximum of the xi, and as r tends to minus infinity, the mean tends to the minimum of the xi.
The power plus 1 mean of a set of numbers, for any real number s, is
If s=0, this is the arithmetic mean; if s=-1 it is the harmonic mean.
Note in particular that if s=1, this is RMS2/(arithmetic mean). If m is the arithmetic mean and s2 is the variance of a set of numbers, than the s=1 mean is
It can be shown that this type of mean behaves much like the rth power mean. For any collection of positive numbers (not all equal), this mean is a continuous, strictly monotonic increasing function of s. As s tends to infinity, the mean tends to the maximum of the xi, and as s tends to minus infinity, the mean tends to the minimum of the xi.
Both of these types of mean can be regarded as a special case of the following:
Clearly, if s=0 this is the forth power mean; if r=1 this is the power plus 1 mean.
More types of average can be found by mixing different averages, provided that the formula is symmetric in the variables. For example, for any three numbers x, y, z, the following are all averages:
It may be argued that any quartile other than the median is "biased" hence unsatisfactory. However, if in a group of numbers a few of them are much larger or much smaller than the others ("outliers"), any average may seem biased. For example, consider {1,2,3,4,5,6,7,8,144}. The arithmetic mean is 20, far higher than the upper quartile of 7 and indeed far higher than all but one of the numbers.
The mode is not an average. Firstly, it is not always uniquely defined; for example, in {1,1,1,2,2,3,3,3} there are two modes, 1 and 3 (which are also the minimum and maximum). Secondly and more important, it does not satisfy the monotonicity rule. Consider the set of eight numbers
The mode is 2. Now suppose the seventh number increases from 2 to 3:
The joint modes are now 1 and 2. Now suppose the sixth number increases from 2 to 4:
The mode is now 1.
Let f(x) be any strictly monotonic function. Given a set of numbers {x1 ... xn}, define:
X is then the transformation mean of the xi with respect to f(x).
Examples: f(x) = xr gives the rth power mean; f(x) = log(x) gives the geometric mean.
Exercises