Please use this identifier to cite or link to this item:
http://hdl.handle.net/11375/27364
Title: | On Clustering Comparisons Using Data From a Seroprevalence Study |
Authors: | Grewal, Chandra |
Advisor: | McNicholas, Paul |
Department: | Mathematics and Statistics |
Publication Date: | 2021 |
Abstract: | Various longitudinal clustering approaches are discussed and compared on an application to a seroprevalence study. The data contains information about the behaviours of individuals throughout the course of the COVID-19 pandemic. First, a review of the various longitudinal clustering methods compared throughout this thesis is discussed. Longitudinal k-means, growth mixture models, latent class growth analysis and a two-step approach involving growth curve models and k-means are reviewed. Longitudinal model-based clustering based on a modified Cholesky decomposition of a Gaussian mixture and Gaussian linear means are also reviewed. The BIC is used as the primary criterion to determine the number of components, and the ARI is used to determine cluster similarity between models. The various clustering approaches are then compared as they attempt to identify gathering patterns within the population of the seroprevalence dataset. |
URI: | http://hdl.handle.net/11375/27364 |
Appears in Collections: | Open Access Dissertations and Theses |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Grewal_Chandra_2021Dec_Msc.pdf | 2.1 MB | Adobe PDF | View/Open |
Items in MacSphere are protected by copyright, with all rights reserved, unless otherwise indicated.