Given the data assume a set of distributions on .

Assume is a sample from , indipendent and identically distribuited random variables for the same .

Goal

We want to estimate the true value of the data comes from.

Because in real applications there is not a “true” value for , we are searching for value that more probably reprensents our data .

Definition

is a MLE for if

or, more precisely, .

.

Remark

  • The MLE it might not be unique.
  • The MLE may fail to exists.

Pros

  • Easy to compute
  • Intuitive interpretation
  • Asymptotic properties
    • Consistent (as it converges to the “true” value of )
    • Normal (central limit theorem)
    • Efficient (is the best possible estimate of )
  • Invariant under reparametrization ( is a MLE for )

Cons

  • Is a point estimate, so has no rapresentation of uncertainty
  • Can overfit
  • Existence and uniqueness are not guaranteed.

MLE for Gaussian mean and variance

Choose that maximizes the probability of observed data.

From here we obtain