Welcome to the upgraded MacSphere! We're putting the finishing touches on it; if you notice anything amiss, email macsphere@mcmaster.ca

On Clustering: Mixture Model Averaging with the Generalized Hyperbolic Distribution

dc.contributor.advisorMcNicholas, Paul
dc.contributor.authorRicciuti, Sarah
dc.contributor.departmentStatisticsen_US
dc.date.accessioned2017-10-12T11:52:49Z
dc.date.available2017-10-12T11:52:49Z
dc.date.issued2017-11
dc.description.abstractCluster analysis is commonly described as the classification of unlabeled observations into groups such that they are more similar to one another than to observations in other groups. Model-based clustering assumes that the data arise from a statistical (mixture) model and typically a group of many models are fit to the data, from which the `best' model is selected by a model selection criterion (often the BIC in mixture model applications). This chosen model is then the only model that is used for making inferences on the data. Although this is common practice, proceeding in this way ignores a large component of model selection uncertainty, especially for situations where the difference between the model selection criterion for two competing models is relatively insignificant. For this reason, recent interest has been placed on selecting a subset of models that are close to the selected best model and using a weighted averaging approach to incorporate information from multiple models in this set. Model averaging is not a novel approach, yet its presence in a clustering framework is minimal. Here, we use Occam's window to select a subset of models eligible for two types of averaging techniques: averaging a posteriori probabilities, and direct averaging of model parameters. The efficacy of these model-based averaging approaches is demonstrated for a family of generalized hyperbolic mixture models using real and simulated data.en_US
dc.description.degreeMaster of Science (MSc)en_US
dc.description.degreetypeThesisen_US
dc.identifier.urihttp://hdl.handle.net/11375/22147
dc.language.isoenen_US
dc.subjectclusteringen_US
dc.subjectfinite mixture modelen_US
dc.subjectmodel averagingen_US
dc.subjectgeneralized hyperbolic distributionen_US
dc.subjectOccam's windowen_US
dc.subjectBayesian model averagingen_US
dc.subjectStatisticsen_US
dc.titleOn Clustering: Mixture Model Averaging with the Generalized Hyperbolic Distributionen_US
dc.typeThesisen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Ricciuti_Sarah_K_2017Sept_MSc.pdf
Size:
3.46 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.68 KB
Format:
Item-specific license agreed upon to submission
Description: