Mixture model analysis with rank-based samples

dc.contributor.authorHatefi, Armin
dc.contributor.examiningcommitteeLeblanc, Alexandre (Statistics) Wang, Liqun (Statistics) Pawlak, Miroslaw (Electrical & Computer Engineering) Nagaraja, Haikady N. (Biostatistics, Ohio State University)en_US
dc.contributor.supervisorJafari Jozani, Mohammad (Statistics)en_US
dc.date.accessioned2014-08-22T00:45:53Z
dc.date.available2014-08-22T00:45:53Z
dc.date.issued2014en_US
dc.date.issued2013en_US
dc.degree.disciplineStatisticsen_US
dc.degree.levelDoctor of Philosophy (Ph.D.)en_US
dc.description.abstractSimple random sampling (SRS) is the most commonly used sampling design in data collection. In many applications (e.g., in fisheries and medical research) quantification of the variable of interest is either time-consuming or expensive but ranking a number of sampling units, without actual measurement on them, can be done relatively easy and at low cost. In these situations, one may use rank-based sampling (RBS) designs to obtain more representative samples from the underlying population and improve the efficiency of the statistical inference. In this thesis, we study the theory and application of the finite mixture models (FMMs) under RBS designs. In Chapter 2, we study the problems of Maximum Likelihood (ML) estimation and classification in a general class of FMMs under different ranked set sampling (RSS) designs. In Chapter 3, deriving Fisher information (FI) content of different RSS data structures including complete and incomplete RSS data, we show that the FI contained in each variation of the RSS data about different features of FMMs is larger than the FI contained in their SRS counterparts. There are situations where it is difficult to rank all the sampling units in a set with high confidence. Forcing rankers to assign unique ranks to the units (as RSS) can lead to substantial ranking error and consequently to poor statistical inference. We hence focus on the partially rank-ordered set (PROS) sampling design, which is aimed at reducing the ranking error and the burden on rankers by allowing them to declare ties (partially ordered subsets) among the sampling units. Studying the information and uncertainty structures of the PROS data in a general class of distributions, in Chapter 4, we show the superiority of the PROS design in data analysis over RSS and SRS schemes. In Chapter 5, we also investigate the ML estimation and classification problems of FMMs under the PROS design. Finally, we apply our results to estimate the age structure of a short-lived fish species based on the length frequency data, using SRS, RSS and PROS designs.en_US
dc.description.noteOctober 2014en_US
dc.identifier.citationHatefi, A., Jafari Jozani, M. and Ziou, D. (2014). Estimation and classification for finite mixture models under ranked set sampling. Statistica Sinica, 24, 675--698.en_US
dc.identifier.citationHatefi , A. and Jafari Jozani, M. (2013). Fisher Information in different types of perfect and imperfect ranked set samples from finite mixture models. Journal of Multivariate Analysis, 119, 16--31.en_US
dc.identifier.urihttp://hdl.handle.net/1993/23849
dc.language.isoengen_US
dc.publisherStatistica Sinicaen_US
dc.publisherJournal of Multivariate Analysisen_US
dc.rightsopen accessen_US
dc.subjectFinite mixture modelsen_US
dc.subjectRanked set samplingen_US
dc.subjectPartial rankingen_US
dc.subjectLatent variablesen_US
dc.subjectExpectation-Maximization algorithmen_US
dc.subjectClassificationen_US
dc.subjectFisher informationen_US
dc.subjectEntropyen_US
dc.subjectAge structures of Spot fishen_US
dc.titleMixture model analysis with rank-based samplesen_US
dc.typedoctoral thesisen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Hatefi_Armin.pdf
Size:
1.35 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
2.25 KB
Format:
Item-specific license agreed to upon submission
Description: