Phylogenetic targeting
References:
Arnold, C., Nunn, C. L. 2010. Phylogenetic Targeting of Research Effort in Evolutionary Biology. American Naturalist 176:601-612. [PDF]
Project Description:
Here, I will introduce one of my previous research projects,
Phylogenetic Targeting, which is more fully described in the official publication and in my
Master's thesis [pdf]. For more information and a web
implementation of the program, see http://phylotargeting.fas.harvard.edu. In what follows, we provide an abstract of the project.
Many questions in comparative biology require that
new data be collected, either to build a comparative database for the
first time or to augment existing data. Given resource limitations in
collecting data, which species should be studied to increase the size of
comparative data sets? By taking the hypotheses, other comparative data
relevant to the hypotheses, and an estimate of phylogeny, we show that a
method of phylogenetic targeting can systematically identify the
species that offer the greatest statistical power to test the
hypotheses. Phylogenetic targeting selects potential candidates for
future data collection based on a flexible scoring system that maximizes
the differences in pairwise comparisons. The method can control for
confounding variables, or it can maximize the power to test competing
hypotheses. We used simulations to assess the performance of
phylogenetic targeting, as compared to a less systematic approach of
randomly selecting species (as might occur when data have been collected
on species without regard to variation in the traits of interest). The
simulations revealed that selection of species using phylogenetic
targeting increases the statistical power to detect correlations and
that power increases with the number of species in the clade even when
the number of samples collected was not increased. We also developed a
web-based, freely available and publicly accessible computer program
PhyloTargeting that implements the approach. It provides a
user-friendly interface, a variety of options to analyze the data set,
and graphical visualizations of the results.