biology daily - the biology and biochemistry encyclopedia
biology daily articles and research Encyclopedia Dictionary Forums biology research links Weblinks Pictures Articles Blogs Newsletter

Chebyshev's inequality

This article is not about Chebyshev's sum inequality.

Chebyshev's inequality (or Tchebysheff's inequality or Chebyshev's theorem), named in honor of Pafnuty Chebyshev, is a result in probability theory that gives a lower bound for the probability that a value of a random variable of any distribution with finite variance lies within a certain distance from the variable's mean; equivalently, the theorem provides an upper bound for the probability that values lie outside the same distance from the mean.

Contents

The theorem

'Theorem.' Let X be a random variable with mean μ and finite variance σ2. Then for any real number k > 0,

P(\left|X-\mu\right|\geq k\sigma)\leq\frac{1}{k^2}.

Only the cases k > 1 provide useful information.

Another consequence of the theorem is that for any distribution with mean μ and finite standard deviation σ, at least half of the values lie in the interval (μ − √2 σ, μ + √2 σ).

Typically, the theorem will provide rather loose bounds. However, the bounds provided by Chebyshev's inequality cannot, in general (remaining sound for variables of arbitrary distribution), be improved upon. For example, for any k > 1, the following example (where σ = 1/k) meets the bounds exactly.

\begin{matrix}P(X=-1) & = & 1/2k^2 \\ P(X=0) & = & 1 - 1/k^2 \\ P(X=1) & = & 1/2k^2 \end{matrix}

The theorem can be useful despite loose bounds because it applies to random variables of any distribution, and because these bounds can be calculated knowing no more about the distribution than the mean and variance.

Chebyshev's inequality is used for proving the weak law of large numbers.

Variants

A one-tailed variant with k > 0, is

P(X-\mu \geq k\sigma)\leq\frac{1}{1+k^2}.

A stronger result applicable to unimodal probability distributions is the Vysochanskiï-Petunin inequality.

Example application

For illustration, assume Wikipedia articles are on average 1000 characters long with a standard deviation of 200 characters. From Chebyshev's inequality we can then deduce that at least 75% of Wikipedia articles have a length between 600 and 1400 characters (k = 2).

Proof

Markov's inequality is applied with a = k2, and X as (X - μ)2.

A direct proof shows why bounds we get are quite loose:

P[|X-\mu | \geq k\sigma] = \int_{\left \{\omega :\, \left | X(\omega)-\mu  \right | \geq k\sigma\right \} } 1 \, dP
\leq \int_{\left \{\omega :\, \left | X(\omega)-\mu  \right | \geq k\sigma\right \} } \left (\frac{X-\mu }{k\sigma} \right )^2\, dP
\leq \frac{1}{k^2\sigma^2}\int_R (X-\mu )^2 dP = \frac{1}{k^2}

First, we increase the 1 in integrand and then we increase the set over which we integrate to all reals, so any probability of not being at the mean or the boundary points loosens the inequality.

See also



06-01-2009 23:10:04
The contents of this article are licensed from Wikipedia.org under the GNU Free Documentation License. How to see transparent copy
BiologyDaily.com 2005. Legal info   Privacy