Given the data assume a joint distribution .

Assume is a sample from , indipendent and identically distribuited random variables for the same .

Goal

Choose a good value of for . Rather than estimating a single , we obtain a distribution over possible values of .

Definition

is a MAP for if

From the Bayes theorem:

  • is the posterior distribution on given the data
  • is the likelihood function
  • is the prior distribution.

Pros

  • Easy to compute and interpretable
  • Avoid overfitting
  • As tends to looks like the MLE (remember that the MLE has some nice asymptotic properties, that the MAP does not have)

Cons

  • Being a point estimate has no representation of uncertainty of
  • Not invariant under reparametrization (unlike MLE)
  • You have to do assume the prior on .