Development of Digital Signal Processing and Statistical Classification Methods for Distinguishing Nasal Consonants

dc.contributor.advisorZhang,Hao, Committee Memberen_US
dc.contributor.advisorDonald Bitzer, Committee Memberen_US
dc.contributor.advisorDavid Mcallister, Committee Chairen_US
dc.contributor.authorWang, Mengen_US
dc.date.accessioned2010-04-02T18:15:46Z
dc.date.available2010-04-02T18:15:46Z
dc.date.issued2003-10-06en_US
dc.degree.disciplineOperations Researchen_US
dc.degree.levelthesisen_US
dc.degree.nameMSen_US
dc.description.abstractFor almost half a century, people have been looking for efficient classifiers to distinguish two nasal sounds, / / from / /, uttered by a single speaker. From the middle of the last decade, there has been little progress in research on this topic. In recent years, we, researchers of the Voice I/O Group in Department of Computer Science at North Carolina State University, have conducted some new trials on this classical problem. In this thesis, those trials are briefly summarized. Instead of simply using the Fourier transform to produce the spectra as people usually did in the past, the author uses other kinds of transforms to extract more feature differences between / / and / /. The new transforms can be the alternatives of frequencies, such as singular values or eigenvalues, or even other transforms such as wavelets, which can deal with non-stationary systems quite well. We combine together the old and new features to get a larger feature vector, which will bring more classification information. We collect multiple voice samples of a single speaker and calculate the above feature representations, then use them as input of some popular statistical classification techniques, such as Principle Component Analysis (PCA), Discriminant Analysis (DA), and Support Vector Machine (SVM). By way of one training process, one testing process, and one heuristic scheme, we can identify the nasals with low error rates.en_US
dc.identifier.otheretd-09252003-161752en_US
dc.identifier.urihttp://www.lib.ncsu.edu/resolver/1840.16/2600
dc.rightsI hereby certify that, if appropriate, I have obtained and attached hereto a written permission statement from the owner(s) of each third party copyrighted matter to be included in my thesis, dissertation, or project report, allowing distribution as specified below. I certify that the version I submitted is the same as that approved by my advisory committee. I hereby grant to NC State University or its agents the non-exclusive license to archive and make accessible, under the conditions specified below, my thesis, dissertation, or project report in whole or in part in all forms of media, now or hereafter known. I retain all other ownership rights to the copyright of the thesis, dissertation or project report. I also retain the right to use in future works (such as articles or books) all or part of this thesis, dissertation, or project report.en_US
dc.subjectDigital signal processingen_US
dc.subjectstatisticsen_US
dc.subjectclassificationen_US
dc.subjectnasal consonantsen_US
dc.titleDevelopment of Digital Signal Processing and Statistical Classification Methods for Distinguishing Nasal Consonantsen_US

Files

Original bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
etd.pdf
Size:
12.11 MB
Format:
Adobe Portable Document Format

Collections