Distributions of topological tree metrics between a species tree and a gene tree


DOI: 10.1007/s10463-016-0557-x

Cite this article as:
Xi, J., Xie, J. & Yoshida, R. Ann Inst Stat Math (2016). doi:10.1007/s10463-016-0557-x


In order to conduct a statistical analysis on a given set of phylogenetic gene trees, we often use a distance measure between two trees. In a statistical distance-based method to analyze discordance between gene trees, it is a key to decide “biologically meaningful” and “statistically well-distributed” distance between trees. Thus, in this paper, we study the distributions of the three tree distance metrics: the edge difference, the path difference, and the precise K interval cospeciation distance, between two trees: First, we focus on distributions of the three tree distances between two random unrooted trees with n leaves (\(n \ge 4\)); and then we focus on the distributions the three tree distances between a fixed rooted species tree with n leaves and a random gene tree with n leaves generated under the coalescent process with the given species tree. We show some theoretical results as well as simulation study on these distributions.


Coalescent Phylogenetics Tree metrics Tree topologies 

Copyright information

© The Institute of Statistical Mathematics, Tokyo 2016

Authors and Affiliations

  1. 1.Department of MathematicsNorth Carolina State UniversityRaleighUSA
  2. 2.Statistics DepartmentUniversity of KentuckyLexingtonUSA
  3. 3.Statistics DepartmentUniversity of KentuckyLexingtonUSA