Analyzing Longitudinal Data with Non-ignorable Missing

dc.contributor.advisorSujit Ghosh, Committee Co-Chairen_US
dc.contributor.advisorSubhashis Ghosal, Committee Co-Chairen_US
dc.contributor.authorZhu, Lianshengen_US
dc.date.accessioned2010-04-02T19:09:18Z
dc.date.available2010-04-02T19:09:18Z
dc.date.issued2006-12-28en_US
dc.degree.disciplineStatisticsen_US
dc.degree.leveldissertationen_US
dc.degree.namePhDen_US
dc.description.abstractIn longitudinal studies, data are often missing despite every attempt made to collect complete data. When the missingness is informative and hence not ignorable, it is generally difficult to analyze non-ignorable missing (NIM) data since the distributional assumptions about missing data are not easily verifiable using traditional goodness of fit tests or otherwise. Selection models and pattern-mixture models are two common approaches to analyze NIM data. Each approach has its advantages and disadvantages. Methods proposed in this thesis fall into the category of pattern-mixture models. Traditionally, patterns are determined by time to occurrence of missing. This definition often results into the problem of not all parameters being identifiable. Moreover, marginalization is commonly required and can be very tricky when outcomes are discrete. It is recognized that patterns can and need to be defined by covariates, surrogate variables and⁄or time to missing. We propose two approaches to model NIM data: (i) pseudo-imputation (PI) approach, in which we first obtain predictive means within each pattern, get transformed predictive means by using a suitable link function and then fit with covariates to obtain marginal estimates; (ii) joint-modeling (JM) approach, in which patterns considered as random effects are marginalized within a generalized linear mixed model framework. The JM approach is shown to be able to capture the dependence of missing indicators on missing outcomes in some degree as is the case with NIM data. Some of the main advantages of these proposed approaches include (i) the capability to handle both continuous and discrete responses, (ii) avoidance of the problem of under-identifiability, (iii) availability of marginal estimates, and (iv) computational efficiency. When the missingness does depend on the patterns, results based on simulated data suggest that both approaches yield accurate estimates if the underlying number of patterns is specified correctly. Otherwise the PI method leads to biased results whereas the JM approach still provides reasonably accurate estimates. Finally, we extend our approaches to a generalized additive model (GAM) replacing the GLM framework. When the underlying relationship is highly non-linear, our extended approaches with a GAM framework provide flexibility and more accurate estimates. The JM approach along with generalized additive models can provide more flexibility than the PI approach since it uses a more robust model for the missing indicatoren_US
dc.identifier.otheretd-07302006-123757en_US
dc.identifier.urihttp://www.lib.ncsu.edu/resolver/1840.16/5194
dc.rightsI hereby certify that, if appropriate, I have obtained and attached hereto a written permission statement from the owner(s) of each third party copyrighted matter to be included in my thesis, dissertation, or project report, allowing distribution as specified below. I certify that the version I submitted is the same as that approved by my advisory committee. I hereby grant to NC State University or its agents the non-exclusive license to archive and make accessible, under the conditions specified below, my thesis, dissertation, or project report in whole or in part in all forms of media, now or hereafter known. I retain all other ownership rights to the copyright of the thesis, dissertation or project report. I also retain the right to use in future works (such as articles or books) all or part of this thesis, dissertation, or project report.en_US
dc.subjectLongitudinalen_US
dc.subjectnon-ignorable missingen_US
dc.subjectpattern-mixtureen_US
dc.subjectjointing modelingen_US
dc.subjectpseudo-imputationen_US
dc.titleAnalyzing Longitudinal Data with Non-ignorable Missingen_US

Files

Original bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
etd.pdf
Size:
407.97 KB
Format:
Adobe Portable Document Format

Collections