概率论系列报告(讨论班)—Maximum likelihood implementation of an isolation-with-migration model for three species(最大似然法框架下的多物种隔离移民模型研究)
主 题: 概率论系列报告(讨论班)—Maximum likelihood implementation of an isolation-with-migration model for three species(最大似然法框架下的多物种隔离移民模型研究)
报告人: 朱天琪 博士 (中科院数学与系统科公司)
时 间: 2017-05-15 15:00-16:00
地 点: 理科1号楼1303
Abstract: We develop a maximum likelihood (ML) method for estimating migration rates between species using genomic sequence data. A species tree is used to accommodate the phylogenetic relationships among three species, allowing for migration between the two sister species, while the third species is used as an out-group. A Markov chain characterization of the genealogical process of coalescence andmigrationis used tointegrate out themigration histories at eachlocus analytically, whereas Gaussian quadrature is used to integrate over the coalescent times on each genealogical tree numerically. Our implementation can accommodate tens of thousands of loci, making it feasible to analyze genome-scale data sets to test for gene flow. We calculate the posterior probabilities of gene trees at individual loci to identify genomic regions that are likely to have been transferred between species due to gene flow. We conduct a simulation study to examine the statistical properties of the likelihood ratio test for gene flow between the two in-group species and of the ML estimates of model parameters such as the migration rate. Inclusion of data from a third out-group species is found to increase dramatically the power of the test and the precision of parameter estimation. We compiled and analyzed several genomic data sets from the Drosophila fruit flies. We discuss the utility of the multispecies coalescent model for species tree estimation, accounting for incomplete lineage sorting and migration. [IM model, maximum likelihood, multispecies coalescent, migration, speciation.]