Posted in : Machine Learning
Machine Learning - MLE
Given the data assume a set of distributions on .
Assume is a sample from , indipendent and identically distribuited random variables for the same .
Goal
We want to estimate the true value of the data comes from.
Because in real applications there is not a “true” value for , we are searching for value that more probably reprensents our data .
Definition
is a MLE for if
or, more precisely, .
.
Remark
- The MLE it might not be unique.
- The MLE may fail to exists.
Pros
- Easy to compute
- Intuitive interpretation
- Asymptotic properties
- Consistent (as it converges to the “true” value of )
- Normal (central limit theorem)
- Efficient (is the best possible estimate of )
- Invariant under reparametrization ( is a MLE for )
Cons
- Is a point estimate, so has no rapresentation of uncertainty
- Can overfit
- Existence and uniqueness are not guaranteed.
MLE for Gaussian mean and variance
Choose that maximizes the probability of observed data.
From here we obtain