Completeness of rheumatoid arthritis prevalence estimates from administrative health data: comparison of capture-recapture models

Nie, Yao

Completeness of rheumatoid arthritis prevalence estimates from administrative health data: comparison of capture-recapture models

dc.contributor.author	Nie, Yao
dc.contributor.examiningcommittee	Jiang, Depeng (Community Health Sciences) Muhajarine, Nazeem (University of Saskatchewan) Shiff, Natalie (University of Saskatchewan)	en_US
dc.contributor.supervisor	Lix, Lisa (Community Health Sciences)	en_US
dc.date.accessioned	2014-07-03T14:37:31Z
dc.date.available	2014-07-03T14:37:31Z
dc.date.issued	2014-07-03
dc.degree.discipline	Community Health Sciences	en_US
dc.degree.level	Master of Science (M.Sc.)	en_US
dc.description.abstract	Rheumatoid arthritis (RA) is a chronic disease characterized by an overactive immune system and joint inflammation. Population-based administrative health data (AHD) are widely used for RA outcomes research and surveillance. However, AHD may not completely capture all cases of RA in the population. Capture-recapture (CR) methods have been proposed to describe the completeness of AHD for estimating disease population size, but AHD may not conform to the assumptions that underlie CR models. A Monte Carlo simulation study was used to investigate the effects of violations of the assumptions for two-source CR methods: dependence between data sources and heterogeneity of capture probabilities. We compared the Chapman estimator and an estimator based on the multinomial logistic regression model (MLRM) to study relative bias (RB), coverage probability (CP) of 95% confidence intervals, width of 95% confidence intervals (WCI), and root-mean-square-error (RMSE) in prevalence estimates. The effects of misspecification of the MLRM were also investigated. In addition, the Chapman and MLRM estimators were used to estimate RA prevalence using AHD data from Saskatchewan, Canada. Population sizes were consistently underestimated for CR methods when the assumptions were violated. The estimated population size for both of the estimators did not differ substantially except for the RMSE values. Parameter estimates became biased when the MLRM model was misspecified, but there was little impact on population size estimates. In conclusion, CR methods are recommended to reduce bias in prevalence estimates based on AHDS. Because these methods may be sensitive to assumption violations, researchers should consider potential dependence between data sources. As well, sufficient overlap in the cases captured by each data source (e.g., 50% of the cases are captured by both data sources) or balanced capture probability in each data source is needed to effectively implement these methods. Researchers who estimate population size using CR methods in AHDs should favour the MLRM estimator over the Chapman estimator.	en_US
dc.description.note	October 2014	en_US
dc.identifier.uri	http://hdl.handle.net/1993/23679
dc.language.iso	eng	en_US
dc.rights	open access	en_US
dc.subject	Capture-Recapture Models	en_US
dc.subject	Monte Carlo Simulation	en_US
dc.subject	Prevalence	en_US
dc.subject	Rheumatoid Arthritis	en_US
dc.title	Completeness of rheumatoid arthritis prevalence estimates from administrative health data: comparison of capture-recapture models	en_US
dc.type	master thesis	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Yao_Nie.pdf
Size:: 997.2 KB
Format:: Adobe Portable Document Format
Description:

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 2.25 KB
Format:: Item-specific license agreed to upon submission
Description:

Download

Collections

FGS - Electronic Theses and Practica