Accounting for heterogeneity in the dependence mechanism of longitudinal data

Loading...
Thumbnail Image
Date
2022-01-03
Authors
Hoque, Md. Erfanul
Journal Title
Journal ISSN
Volume Title
Publisher
Abstract

Longitudinal data occur frequently in practice where measurements are collected from subjects over time with an aim to understand the dependence mechanisms among these measurements. A major challenge in longitudinal data analysis is the presence of a complex dependence structure due to both between and within individual heterogeneity. This thesis develops new statistical methodologies that incorporate potential heterogeneity in the dependence structure in various longitudinal data problems.

In the first part, we introduce a D-vine copula-based heterogeneous dependence model which provides a flexible representation of time-heterogeneous dependence in univariate longitudinal data with a continuous outcome. The proposed model allows for time adjustment in the dependence structure of unequally spaced and potentially unbalanced longitudinal data. We show that the proposed approach offers flexibility over its time-homogeneous counterparts as well as allows for parsimonious model specifications at the tree or vine level for a given D-vine structure. The performances of the time-heterogeneous D-vine copula models are evaluated through simulation studies and by real data from the Manitoba Follow-up Study.

In the second part, we propose an approach to incorporate potential heterogeneity in the random effects covariance matrix in longitudinal data with missing responses and mismeasured covariates. The proposed approach uses a modified Cholesky decomposition and allows the random effects covariance matrix to depend on covariates. This decomposition provides an unconstrained and statistically meaningful reparameterization of the random effect covariance matrix which can be modeled without the concern of positive definiteness of the resulting estimators. The performance of the proposed approach is evaluated through simulation studies and is demonstrated using longitudinal data from Framingham Heart Study.

In the last part, we review two major statistical models for longitudinal functional data that are spatially correlated and propose a computationally efficient modeling approach by incorporating a spatio-temporal dependence structure in the error process. Numerical experiments are conducted to compare these models and to investigate the impact of ignoring spatial correlation on prediction performance. We discuss the limitations of these models and outline future directions to develop flexible models that can incorporate potential heterogeneity in the dependence structure of spatial longitudinal data.

Description
Keywords
Longitudinal data, Heterogeneity, Random effects, Cholesky decomposition, Spatial longitudinal data, D-vine, Copula, Measurement error, Missing data, Functional principal component analysis, Spatio-temporal modeling
Citation