Application of the likelihood function in phylogenetic analysis.

Huelsenbeck JP & Bollback JP

(2001) In "Handbook of Statistical Genetics", John Wiley and Sons CH. 15, 415-439.

The likelihood of a phylogenetic tree is proportional to the probability of observing the comparative data (such as aligned DNA sequences) conditional on the tree. The likelihood function is important because it is the vehicle that carried the observations. The likelihood function can be used in two ways to infer phylogeny. First, the tree that maximizes the likelihood can be chosen as the best estimate of phylogeny; this is the method of maximum likelihood. Second, a prior probability distribution on trees can be specified and inferences based upon the posterior probability distribution of trees; this is the approach taken by Bayesians. Although maximum likelihood and Bayesian inference are similar in that the same models of DNA substitution can be used to calculate the likelihood function, they differ in their interpretation of probability. Markov chain Monte Carlo (MCMC) can be used to approximate the posterior probability of trees. MCMC also makes it possible to perform comparative analyses that accommodate phylogenetic uncertainty.

View/Download PDF

 
Andrew Rambaut, 2007