Untangling Tanglegrams: Comparing Trees by Their Drawings

Balaji Venkatachalam, Jim Apple, Katherine St. John, Daniel Gusfield
2010 IEEE/ACM Transactions on Computational Biology & Bioinformatics  
A tanglegram is a pair of trees on the same set of leaves with matching leaves in the two trees joined by an edge. Tanglegrams are widely used in biology -to compare evolutionary histories of host and parasite species and to analyze genes of species in the same geographical area. We consider optimizations problems in tanglegram drawings. We show a linear time algorithm to decide if a tanglegram admits a planar embedding by a reduction to the planar graph drawing problem. This problem was
more » ... problem was considered by Fernau, Kauffman and Poths. (FSTTCS 2005). Our reduction method helps to solve a conjecture they posed, showing a fixed-parameter tractable algorithm for minimizing the number of crossings over all d-ary trees. For the case where one tree is fixed, we show an O(n log n) algorithm to determine the drawing of the second tree that minimizes the number of crossings. This improves the bound from earlier methods. We introduce a new optimization criterion using Spearman's footrule optimization and give an O(n 2 ) algorithm. We also show integer programming formulations to quickly obtain tanglegram drawings that minimize the two optimization measures discussed. We prove lower bounds on the maximum gap between the optimal solution and the heuristic of Dwyer and Schreiber (Austral. Symp. on Info. Vis. 2004) to minimize crossings.
doi:10.1109/tcbb.2010.57 pmid:20530818 fatcat:mfdvgonzgvdnngk2ha67cnqbgu