Testing Patterns of Nucleotide Substitution Rates at Multiple Genes

dc.contributor.advisorSpencer V. Muse, Committee Chairen_US
dc.contributor.authorTao, Wenlien_US
dc.date.accessioned2010-04-02T19:18:48Z
dc.date.available2010-04-02T19:18:48Z
dc.date.issued2002-11-17en_US
dc.degree.disciplineBiomathematicsen_US
dc.degree.leveldissertationen_US
dc.degree.namePhDen_US
dc.description.abstractStudying patterns of nucleotide substitution rates at multiple loci can help provide clues to the evolution and function of genes. The computational drawback of the maximum likelihood version of relative ratio tests becomes a concern when a large number of pairwise comparisons are performed among multiple genes. We propose a new version of relative ratio test, including four procedures, based on the use of pairwise sequence distances. The first is based on ANOVA two-way model and allows covariances between branch lengths. The second method applies generalized estimation equations (GEEs) to Poisson regression in a log-linear model. The third one is a nonparametric approach based on bootstrap percentile confidence intervals. The fourth method is based on weighted least squares estimation with covariances. Simulation studies have been conducted to compare Type I errors and powers between the likelihood version of test and the first three proposed methods. The formula have been derived for the last method as well as the numerical steps. The ANOVA-based method is the least computationally expensive and it has desirable Type I errors in most cases as well as good powers. The bootstrap-based method is the slowest among the four methods, but with smallest Type I errors and powers similar to the ANOVA-based method. The Likelihood-based method is the second slowest and has more desirable Type I errors than those of the ANOVA-based method, but has less powers than the ANOVA-based method. The GEE-based method is suitable only for very long genes, but has good statistical properties. The ANOVA-based method is applied to mtDNA sequences from a broad range of animal mitochondrial genomes. The results indicate that it is uncommon that branch lengths are conserved well among animal mitochondrial genes.en_US
dc.identifier.otheretd-07182002-170307en_US
dc.identifier.urihttp://www.lib.ncsu.edu/resolver/1840.16/5725
dc.rightsI hereby certify that, if appropriate, I have obtained and attached hereto a written permission statement from the owner(s) of each third party copyrighted matter to be included in my thesis, dissertation, or project report, allowing distribution as specified below. I certify that the version I submitted is the same as that approved by my advisory committee. I hereby grant to NC State University or its agents the non-exclusive license to archive and make accessible, under the conditions specified below, my thesis, dissertation, or project report in whole or in part in all forms of media, now or hereafter known. I retain all other ownership rights to the copyright of the thesis, dissertation or project report. I also retain the right to use in future works (such as articles or books) all or part of this thesis, dissertation, or project report.en_US
dc.subjectmitochondrial genomeen_US
dc.subjectGEEsen_US
dc.subjectweighted least squares with covariancesen_US
dc.subjectbootstrapen_US
dc.subjectPoisson regression in a log-linear modelen_US
dc.subjectANOVA with covariancesen_US
dc.titleTesting Patterns of Nucleotide Substitution Rates at Multiple Genesen_US

Files

Original bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
etd.pdf
Size:
778.12 KB
Format:
Adobe Portable Document Format

Collections