Spectral Analysis of Protein Sequences

dc.contributor.advisorJeffrey Thorne, Committee Memberen_US
dc.contributor.advisorWilliam Atchley, Committee Co-Chairen_US
dc.contributor.advisorCharles Smith, Committee Chairen_US
dc.contributor.advisorBruce Weir, Committee Memberen_US
dc.contributor.authorWang, Zhien_US
dc.date.accessioned2010-04-02T19:07:59Z
dc.date.available2010-04-02T19:07:59Z
dc.date.issued2005-10-25en_US
dc.degree.disciplineBioinformaticsen_US
dc.degree.leveldissertationen_US
dc.degree.namePhDen_US
dc.description.abstractThe purpose of this research is to elucidate how to apply spectral analysis methods to understand the structure, function and evolution of protein sequences. In the first part of this research, spectral analyses have been applied to the basic- helix-loop-helix (bHLH) family of transcription factors. It is shown that the periodicity of the bHLH variability pattern (entropy profile) conforms to the classical alpha-helix periodicity of 3.6 amino acids per turn. Further, the underlying physiochemical attributes profiles (factor score profiles) are examined and their periodicities also have significant implications of the alpha-helix secondary structure. It is suggested that the entropy profile can be well explained by the five factor score variance components that reflect the polarity/hydrophobicity, secondary structure information, molecular volume, codon composition and electrostatic charge attributes of amino acids. In the second part of this research, complex demodulation (CDM) method is introduced in an attempt to quantify the amplitude of periodic components in protein sequences. Proteins are often considered to be 'multiple domain entities' because they are composed of a number of functionally and structurally distinct domains with potentially independent origins. The analyses of bZIP and bHLH-PAS protein domains found that complex demodulation procedures can provide important insight about functional and structural attributes. It is found that the local amplitude minimums or maximums are associated with the boundary between two structural or functional components. In the third part of this research, the periodicity evaluation of a leucine zipper protein domain with a well-known structure is used to rank 494 published indices summarized in a database (http://www.genome.jp/dbget/aaindex.html). This application allows us to select those amino acid indices that are strongly associated with the protein structure and hereby to promote the protein structure prediction. This procedure can be used to reduce some redundancy of the amino acid indices.en_US
dc.identifier.otheretd-07072005-153157en_US
dc.identifier.urihttp://www.lib.ncsu.edu/resolver/1840.16/5111
dc.rightsI hereby certify that, if appropriate, I have obtained and attached hereto a written permission statement from the owner(s) of each third party copyrighted matter to be included in my thesis, dissertation, or project report, allowing distribution as specified below. I certify that the version I submitted is the same as that approved by my advisory committee. I hereby grant to NC State University or its agents the non-exclusive license to archive and make accessible, under the conditions specified below, my thesis, dissertation, or project report in whole or in part in all forms of media, now or hereafter known. I retain all other ownership rights to the copyright of the thesis, dissertation or project report. I also retain the right to use in future works (such as articles or books) all or part of this thesis, dissertation, or project report.en_US
dc.subjectentropyen_US
dc.subjectperiodicityen_US
dc.subjectfactor scoreen_US
dc.subjectcomplex demodulationen_US
dc.subjectfunctional regionen_US
dc.subjectspectral analysisen_US
dc.titleSpectral Analysis of Protein Sequencesen_US

Files

Original bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
etd.pdf
Size:
1.37 MB
Format:
Adobe Portable Document Format

Collections