Please use this identifier to cite or link to this item:
http://hdl.handle.net/11375/26203
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.advisor | Canty, Angelo | - |
dc.contributor.author | Yacas, Clifford | - |
dc.date.accessioned | 2021-02-12T14:36:08Z | - |
dc.date.available | 2021-02-12T14:36:08Z | - |
dc.date.issued | 2021 | - |
dc.identifier.uri | http://hdl.handle.net/11375/26203 | - |
dc.description.abstract | DNA methylation plays a key role in disease analysis, especially for studies that compare known large scale differences in CpG sites, such as cancer/normal studies or between-tissues studies. However, before any analysis can be done, data normalization and preprocessing of methylation data are required. A useful data preprocessing pipeline for large scale comparisons is Functional Normalization (FunNorm), (Fortin et al., 2014) implemented in the minfi package in R. In FunNorm, the univariate quantiles of the methylated and unmethylated signal values in the raw data are used to preprocess the data. However, although FunNorm has been shown to outperform other preprocessing and data normalization processes for these types of studies, it does not account for the correlation between the methylated and unmethylated signals into account; the focus of this paper is to improve upon FunNorm by taking this correlation into account. The concept of a bivariate quantile is used in this study as an attempt to take the correlation between the methylated and unmethylated signals into consideration. From the bivariate quantiles found, the partial least squares method is then used on these quantiles in this preprocessing. The raw datasets used for this research were collected from the European Molecular Biology Laboratory - European Bioinformatics Institute (EMBL-EBI) website. The results from this preprocessing algorithm were then compared and contrasted to the results from FunNorm. Drawbacks, limitations and future research are then discussed. | en_US |
dc.language.iso | en | en_US |
dc.subject | methylation | en_US |
dc.subject | methylation data | en_US |
dc.subject | partial least squares | en_US |
dc.subject | bivariate | en_US |
dc.subject | bivariate quantile | en_US |
dc.subject | applied statistics | en_US |
dc.subject | preprocessing | en_US |
dc.subject | normalization | en_US |
dc.subject | machine learning | en_US |
dc.title | Bivariate Functional Normalization of Methylation Array Data | en_US |
dc.type | Thesis | en_US |
dc.contributor.department | Mathematics and Statistics | en_US |
dc.description.degreetype | Thesis | en_US |
dc.description.degree | Master of Science (MSc) | en_US |
Appears in Collections: | Open Access Dissertations and Theses |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Thesis Methodology.txt | 34.24 kB | Text | View/Open | |
Yacas_Clifford_Thesis.pdf | 1.99 MB | Adobe PDF | View/Open |
Items in MacSphere are protected by copyright, with all rights reserved, unless otherwise indicated.