Statistical Topics in Disease Gene Mapping

dc.contributor.advisorBruce S. Weir, Committee Chairen_US
dc.contributor.advisorMargaret G. Ehm, Committee Co-Chairen_US
dc.contributor.advisorZhao-Bang Zeng, Committee Memberen_US
dc.contributor.advisorRuss Wolfinger, Committee Memberen_US
dc.contributor.advisorGreg Gibson, Committee Memberen_US
dc.contributor.advisorJonathan Allen, Committee Memberen_US
dc.contributor.authorMeng, Zhaolingen_US
dc.date.accessioned2010-04-02T18:29:39Z
dc.date.available2010-04-02T18:29:39Z
dc.date.issued2003-04-14en_US
dc.degree.disciplineBioinformaticsen_US
dc.degree.leveldissertationen_US
dc.degree.namePhDen_US
dc.description.abstractEfforts in disease gene mapping have achieved a great deal of success in mendelain diseases, but made slower progress in common disease studies because of their complexity. The rapid development of genetics and molecular technologies provides an immense amount of DNA data; developing powerful and efficient statistical methodologies is under high demand. This dissertation explored some aspects of the problem. The power of two genome-wide disease gene mapping strategies is investigated. One applies linkage analysis and then linkage disequilibrium (LD) tests to markers within linked regions. The other looks for LD with disease using all markers. The results showed that the genome-wide association based tests are much more likely to identify genes. Genotyping closely spaced Single Nucleotide Polymorphisms (SNPs) frequently yields highly correlated data due to extensive LD, and gives association studies unnecessary and unaffordable burden when these markers don't yield significantly different information. Two procedures are developed to select an optimum subset of SNPs that could be efficiently genotyped on larger numbers of samples while retaining most of the information based on genotypes of a large initial set of SNPs on a small number of samples. One utilizes a spectral decomposition method based on matrices of pair-wise LD, and the other extends David Clayton's htSNP selection method. Properties of the procedures are studied; minimum sample sizes needed for achieving consistent results are recommended; the procedures are evaluated using experimental data. Studying gene-treatment interaction is a long desired problem. When the genetic variation that is being tested is not specific functional sites but randomly selected polymorphisms, a source of randomness is introduced. A mixed effect model is developed to fit fixed treatment effects, random haplotypic effects, and random gene-treatment interactions in this scenario; likelihood ratio tests are applied for testing the random effects. Our simulation results showed that the mixed effect model is valid and generally behaves better than the fixed haplotypic effects model in the exploratory phase of a study.en_US
dc.identifier.otheretd-03282003-010933en_US
dc.identifier.urihttp://www.lib.ncsu.edu/resolver/1840.16/3391
dc.rightsI hereby certify that, if appropriate, I have obtained and attached hereto a written permission statement from the owner(s) of each third party copyrighted matter to be included in my thesis, dissertation, or project report, allowing distribution as specified below. I certify that the version I submitted is the same as that approved by my advisory committee. I hereby grant to NC State University or its agents the non-exclusive license to archive and make accessible, under the conditions specified below, my thesis, dissertation, or project report in whole or in part in all forms of media, now or hereafter known. I retain all other ownership rights to the copyright of the thesis, dissertation or project report. I also retain the right to use in future works (such as articles or books) all or part of this thesis, dissertation, or project report.en_US
dc.subjectAssociation disease gene mapping linkage disequilien_US
dc.titleStatistical Topics in Disease Gene Mappingen_US

Files

Original bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
etd.pdf
Size:
533.96 KB
Format:
Adobe Portable Document Format

Collections