Posted in : Machine Learning
Machine Learning - MAP
Given the data assume a joint distribution .
Assume is a sample from , indipendent and identically distribuited random variables for the same .
Goal
Choose a good value of for . Rather than estimating a single , we obtain a distribution over possible values of .
Definition
is a MAP for if
From the Bayes theorem:
- is the posterior distribution on given the data
- is the likelihood function
- is the prior distribution.
Pros
- Easy to compute and interpretable
- Avoid overfitting
- As tends to looks like the MLE (remember that the MLE has some nice asymptotic properties, that the MAP does not have)
Cons
- Being a point estimate has no representation of uncertainty of
- Not invariant under reparametrization (unlike MLE)
- You have to do assume the prior on .