Multivariate longitudinal data clustering with a copula kernel mixture model
| dc.contributor.advisor | McNicholas, Paul | |
| dc.contributor.advisor | Murphy, Orla | |
| dc.contributor.author | Zhang, Xi | |
| dc.contributor.department | Mathematics and Statistics | en_US |
| dc.date.accessioned | 2024-10-11T19:40:12Z | |
| dc.date.available | 2024-10-11T19:40:12Z | |
| dc.date.issued | 2024 | |
| dc.description.abstract | Many common clustering methods cannot be used for clustering multivariate longitudinal data when the covariance of random variables is a function of the time points. For this reason, a copula kernel mixture model (CKMM) is proposed for clustering such data. The CKMM is a finite mixture model that decomposes each mixture component’s joint density function into a copula and marginal distribution functions, where a Gaussian copula is used for its mathematical traceability. This thesis considers three scenarios: first, the CKMM is developed for balanced multivariate longitudinal data with known eigenfunctions; second, the CKMM is used to fit unbalanced data where trajectories are aligned on the time axis, and eigenfunctions are unknown; and lastly, a dynamic CKMM (DCKMM) is applied to unbalanced data where trajectories are misaligned, and eigenfunctions are unknown. Expectation-maximization type algorithms are used for parameter estimation. The performance of CKMM is demonstrated on both simulated and real data. | en_US |
| dc.description.degree | Candidate in Philosophy | en_US |
| dc.description.degreetype | Thesis | en_US |
| dc.identifier.uri | http://hdl.handle.net/11375/30410 | |
| dc.language.iso | en | en_US |
| dc.subject | model-based clustering | en_US |
| dc.subject | longitudinal data clustering | en_US |
| dc.title | Multivariate longitudinal data clustering with a copula kernel mixture model | en_US |
| dc.type | Thesis | en_US |