Domain Enhanced Analysis of Microarray Data Using GO Annotations

dc.contributor.advisorGeoff Benson, Committee Memberen_US
dc.contributor.advisorJack Alan Menius, Committee Memberen_US
dc.contributor.advisorJacqueline M. Hughes-Oliver, Committee Chairen_US
dc.contributor.advisorJason Osborne, Committee Co-Chairen_US
dc.contributor.advisorSidney Stanley Young, Committee Memberen_US
dc.contributor.authorLiu, Jiajunen_US
dc.date.accessioned2010-04-02T19:13:14Z
dc.date.available2010-04-02T19:13:14Z
dc.date.issued2008-08-17en_US
dc.degree.disciplineStatisticsen_US
dc.degree.leveldissertationen_US
dc.degree.namePhDen_US
dc.description.abstractNew biological systems technologies give scientists the ability to measure thousands of bio-molecules including genes, proteins, lipids and metabolites. We use domain knowledge, e.g., the Gene Ontology, to guide analysis of such data. By focusing on domain-aggregated results at, say the molecular function level, increased interpretability is available to biological scientists beyond what is possible if results are presented at the gene level. We use a 'top-down' approach to perform domain aggregation by first combining gene expressions before testing for differentially expressed patterns. This is in contrast to the more standard 'bottom-up' approach where genes are first tested individually then aggregated by domain knowledge. The benefits are greater sensitivity for detecting signals. In DEA procedure, the first scores from the PLS procedure are used to test for differentially expressed patterns using the t test. We find the general t test inadequate for adjusting for the number of genes within each GO term. New tests are proposed by finding the true null distribution of each PLS score adjusted for the size of the GO term. Our method is assessed using a series of simulation studies. Furthermore, we also discuss the impact of our testing procedure with different coding of our classification response variable, namely 0⁄1 or -1⁄1 for data with two classes.en_US
dc.identifier.otheretd-07272007-102916en_US
dc.identifier.urihttp://www.lib.ncsu.edu/resolver/1840.16/5402
dc.rightsI hereby certify that, if appropriate, I have obtained and attached hereto a written permission statement from the owner(s) of each third party copyrighted matter to be included in my thesis, dis sertation, or project report, allowing distribution as specified below. I certify that the version I submitted is the same as that approved by my advisory committee. I hereby grant to NC State University or its agents the non-exclusive license to archive and make accessible, under the conditions specified below, my thesis, dissertation, or project report in whole or in part in all forms of media, now or hereafter known. I retain all other ownership rights to the copyright of the thesis, dissertation or project report. I also retain the right to use in future works (such as articles or books) all or part of this thesis, dissertation, or project report.en_US
dc.subjectGene Ontologyen_US
dc.subjectMicroarrayen_US
dc.titleDomain Enhanced Analysis of Microarray Data Using GO Annotationsen_US

Files

Original bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
etd.pdf
Size:
1.22 MB
Format:
Adobe Portable Document Format

Collections