Welcome to the upgraded MacSphere! We're putting the finishing touches on it; if you notice anything amiss, email macsphere@mcmaster.ca

Unsupervised Classification for Skewed and Mixed-Type Data

dc.contributor.advisorMcNicholas, Paul D
dc.contributor.authorAlamer, Eman Mohammed S
dc.contributor.departmentMathematics and Statisticsen_US
dc.date.accessioned2023-01-03T14:36:14Z
dc.date.available2023-01-03T14:36:14Z
dc.date.issued2022
dc.description.abstractClustering, also known as unsupervised classification, is a foundational machine learning technique and is used to find underlying group structures in data. There are many well-established model-based techniques to analyze either categorical or continuous data in the clustering paradigm. However, there is a relative paucity of work for mixed-type data, especially mixed data where the continuous variables exhibit skewness and heavy tails. In this thesis, different methodologies and models are presented for analyzing asymmetric and mixed-typed data. The first method is a mixture model for analyzing asymmetric mixed-type data. The second is modelling contaminated mixed-type data and identifying potential outliers. Lastly, model averaging techniques are developed for skewed-data based on Occam’s window and parsimonious mixture models. The expectation-maximization algorithm is used here to estimate the model parameters. Both real and simulated data are used for illustration.en_US
dc.description.degreeDoctor of Science (PhD)en_US
dc.description.degreetypeThesisen_US
dc.identifier.urihttp://hdl.handle.net/11375/28188
dc.language.isoen_USen_US
dc.titleUnsupervised Classification for Skewed and Mixed-Type Dataen_US
dc.typeThesisen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Alamer_Eman_M.S._202212_PhD.pdf
Size:
6.99 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.68 KB
Format:
Item-specific license agreed upon to submission
Description: