Domain Enhanced Analysis of Microarray Data Using GO Annotations
| dc.contributor.advisor | Geoff Benson, Committee Member | en_US |
| dc.contributor.advisor | Jack Alan Menius, Committee Member | en_US |
| dc.contributor.advisor | Jacqueline M. Hughes-Oliver, Committee Chair | en_US |
| dc.contributor.advisor | Jason Osborne, Committee Co-Chair | en_US |
| dc.contributor.advisor | Sidney Stanley Young, Committee Member | en_US |
| dc.contributor.author | Liu, Jiajun | en_US |
| dc.date.accessioned | 2010-04-02T19:13:14Z | |
| dc.date.available | 2010-04-02T19:13:14Z | |
| dc.date.issued | 2008-08-17 | en_US |
| dc.degree.discipline | Statistics | en_US |
| dc.degree.level | dissertation | en_US |
| dc.degree.name | PhD | en_US |
| dc.description.abstract | New biological systems technologies give scientists the ability to measure thousands of bio-molecules including genes, proteins, lipids and metabolites. We use domain knowledge, e.g., the Gene Ontology, to guide analysis of such data. By focusing on domain-aggregated results at, say the molecular function level, increased interpretability is available to biological scientists beyond what is possible if results are presented at the gene level. We use a 'top-down' approach to perform domain aggregation by first combining gene expressions before testing for differentially expressed patterns. This is in contrast to the more standard 'bottom-up' approach where genes are first tested individually then aggregated by domain knowledge. The benefits are greater sensitivity for detecting signals. In DEA procedure, the first scores from the PLS procedure are used to test for differentially expressed patterns using the t test. We find the general t test inadequate for adjusting for the number of genes within each GO term. New tests are proposed by finding the true null distribution of each PLS score adjusted for the size of the GO term. Our method is assessed using a series of simulation studies. Furthermore, we also discuss the impact of our testing procedure with different coding of our classification response variable, namely 0⁄1 or -1⁄1 for data with two classes. | en_US |
| dc.identifier.other | etd-07272007-102916 | en_US |
| dc.identifier.uri | http://www.lib.ncsu.edu/resolver/1840.16/5402 | |
| dc.rights | I hereby certify that, if appropriate, I have obtained and attached hereto a written permission statement from the owner(s) of each third party copyrighted matter to be included in my thesis, dis sertation, or project report, allowing distribution as specified below. I certify that the version I submitted is the same as that approved by my advisory committee. I hereby grant to NC State University or its agents the non-exclusive license to archive and make accessible, under the conditions specified below, my thesis, dissertation, or project report in whole or in part in all forms of media, now or hereafter known. I retain all other ownership rights to the copyright of the thesis, dissertation or project report. I also retain the right to use in future works (such as articles or books) all or part of this thesis, dissertation, or project report. | en_US |
| dc.subject | Gene Ontology | en_US |
| dc.subject | Microarray | en_US |
| dc.title | Domain Enhanced Analysis of Microarray Data Using GO Annotations | en_US |
Files
Original bundle
1 - 1 of 1
