Estimator
From Exampleproblems
In statistics, given a parametric model, an estimator is a function of the known sample data that is used to estimate an unknown parameter; an estimate is the result from the actual application of the function to a particular set of data. Many different estimators are possible for any given parameter. Some criterion is used to choose between the estimators, although it is often the case that a criterion cannot be used to clearly pick one estimator over another.
Contents |
Context and definitions
Context
is a probability space,
is a measure space (state space),
- Θ is a parameter space of dimension
,
- (Γ,S) is a measure space,
is a projection,
is the set of all possible distributions on Σ
For example,
is any probability space,
,
,
is defined by γ(x,y) = x.
Estimator
If
is a measurable function, then any Tn is called estimator,
is an estimating sequence and any value
is called estimation.
Continuing the example of the previous paragraph, suppose now that
is a sequence of random variables such that
are iid with distribution
. Then
defines an estimator. It is called sample mean and is an estimating sequence for the expected value of any distribution for which the integral
| ∫ | XdP |
| Ω |
is defined.
Desirable properties
The definition of an estimator is not very restrictive. Indeed, instead of the sample mean, we could have chosen
to estimate the mean of the distribution F. This would be a very bad choice because the expected value of Tn is 0. We therefore need ways of assessing the quality of an estimator.
For an estimator Tn of the parameter θ,
- the error is Tn − θ,
- the bias is defined as the expected value of the error:
,
- and the mean squared error is given by
.
The following equality holds:
. An estimator whose bias is 0 is called 'unbiased'. This is the least we can ask from an estimator.
i.e. mean squared error = variance + square of bias.
where var(X) is the variance of X and E(X) is the expected value of X.
The standard deviation of an estimator of θ (the square root of the variance), or an estimate of the standard deviation of an estimator of θ, is called the standard error of θ.
Unbiased estimators
The first quality we might expect from a useful estimator is for its expected value to be the quantity being estimated. Such an estimator is called an unbiased estimator.
is an unbiased estimator of θ iff
for all θ, or, equivalently, iff
for all θ.
Consistency
A consistent estimator is an estimator that converges in probability to the quantity being estimated as the sample size grows.
An estimator tn (where n is the sample size) is a consistent estimator for parameter θ if and only if, for all ε > 0, no matter how small, we have
It is called strongly consistent, if it converges almost surely to the true value.
Efficiency
The quality of an estimator is generally judged by its mean squared error.
However, occasionally one chooses the unbiased estimator with the lowest variance. Efficient estimators are those that have the lowest possible variance among all unbiased estimators. In some cases, a biased estimator may have a uniformly smaller mean squared error than does any unbiased estimator. For that and other reasons, it is sometimes preferable not to limit oneself to unbiased estimators; see bias (statistics). Concerning such "best unbiased estimators", see also Cramér-Rao inequality, Gauss-Markov theorem, Lehmann-Scheffé theorem, Rao-Blackwell theorem.
Other properties
Often, estimator are due to restrictions (restricted estimators).
Asymptotic value of an estimating sequence
The central limit theorem states that the sample mean converges to the mean of the sampled distribution. This situation is what we would expect: the estimation gets better as we have more values to consider. The limit value of a estimating sequence as the sample length grows to infinity is called asymptotic value.
More precisely, suppose that
is an estimator. and
, then
is called asymptotic value of
.
Types of estimators
Several types of estimators exist, each corresponding to a different view of the problem.
Maximum likelihood estimators
Following the notations of the previous example and supposing the random variables are discrete, the likelihood of a sample is simply the probability of observing that particular sample:
. We could also, having observed a particular sample, consider the likelihood as a function of a parameter of the model. For example, if we toss two coins and get (Heads, Tails), the likelihood is p(1 − p) (supposing the probability to get Heads is p and the probability to get Tails is 1 − p). This is clearly a function of p, which happens to be the parameter of the Bernoulli distribution we used.
More generally:
- For a discrete distribution p with parameter θ, the likelihood function is defined by:
.
- For continuous distributions with parameter θ and density f(. | θ), the likelihood function is defined by:
.
If we do not know the value of the parameter θ, we might want to find the one that is the most likely given a specific sample.
If
is the likelihood function in a given parametric model, then the maximum likelihood estimator of a parameter θ is defined by
.
Bayes estimators
Minimax estimators
Interval estimators
References
See also
- Maximum likelihood
- Method of moments, generalized method of moments
- Cramér-Rao inequality
- Minimum mean squared error (MMSE)
- Maximum a posteriori (MAP)
- Minimum variance unbiased estimator (MVUE)
- Best linear unbiased estimator (BLUE)
- Unbiased estimators — see bias (statistics).
- Particle filter
- Markov chain Monte Carlo (MCMC)
- Kalman filter
- Wiener filter
Books and lecture notes
G. Saporta, Probabilités, Analyse des Données et Statistiques"", Editions TECHNIP, 1990. Prof. R. Viertl, Angewandte Statistik, lecture notes from the Technical University of Vienna, 2004. Prof. K. Felsenstein, Theorie statistischer Schätzung, lecture notes from the Technical University of Vienna, 2004.
