P-Coffee: A New Divide-and-conquer Method for Multiple Sequence Alignment

dc.contributor.advisorDr. Subhashis Ghosal, Committee Memberen_US
dc.contributor.advisorDr. Dennis R. Bahler, Committee Chairen_US
dc.contributor.advisorDr. Jon Doyle, Committee Memberen_US
dc.contributor.authorChoi, Kwangbomen_US
dc.date.accessioned2010-04-02T17:57:45Z
dc.date.available2010-04-02T17:57:45Z
dc.date.issued2005-01-19en_US
dc.degree.disciplineComputer Scienceen_US
dc.degree.levelthesisen_US
dc.degree.nameMSen_US
dc.description.abstractWe describe a new divide-and-conquer method, P-Coffee, for alignment of multiple sequences. P-Coffee first identifies candidate alignment columns using a position-specific substitution matrix (the T-Coffee extended library), tests those columns, and accepts only qualified ones. Accepted columns do not only constitute a final alignment solution, but also divide a given sequence set into partitions. The same procedure is recursively applied to each partition until all the alignment columns are collected. In P-Coffee, we minimized the source of bias by aligning all the sequences simultaneously without requiring any heuristic function to optmize, phylogenetic tree, nor gap cost scheme. In this research, we show the performance of our approach by comparing our results with that of T-Coffee using the 144 test sets provided in BAliBASE v1.0. P-Coffee outperformed T-Coffee in accuracy especially for more complicated test sets.en_US
dc.identifier.otheretd-01182005-060947en_US
dc.identifier.urihttp://www.lib.ncsu.edu/resolver/1840.16/692
dc.rightsI hereby certify that, if appropriate, I have obtained and attached hereto a written permission statement from the owner(s) of each third party copyrighted matter to be included in my thesis, dissertation, or project report, allowing distribution as specified below. I certify that the version I submitted is the same as that approved by my advisory committee. I hereby grant to NC State University or its agents the non-exclusive license to archive and make accessible, under the conditions specified below, my thesis, dissertation, or project report in whole or in part in all forms of media, now or hereafter known. I retain all other ownership rights to the copyright of the thesis, dissertation or project report. I also retain the right to use in future works (such as articles or books) all or part of this thesis, dissertation, or project report.en_US
dc.subjectmultiple sequence alignmenten_US
dc.subjectpartition wallen_US
dc.subjectwall identificationen_US
dc.subjectwall selectionen_US
dc.titleP-Coffee: A New Divide-and-conquer Method for Multiple Sequence Alignmenten_US

Files

Original bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
etd.pdf
Size:
523.61 KB
Format:
Adobe Portable Document Format

Collections