Skip navigation
  • Home
  • Browse
    • Communities
      & Collections
    • Browse Items by:
    • Publication Date
    • Author
    • Title
    • Subject
    • Department
  • Sign on to:
    • My MacSphere
    • Receive email
      updates
    • Edit Profile


McMaster University Home Page
  1. MacSphere
  2. Open Access Dissertations and Theses Community
  3. Open Access Dissertations and Theses
Please use this identifier to cite or link to this item: http://hdl.handle.net/11375/26203
Full metadata record
DC FieldValueLanguage
dc.contributor.advisorCanty, Angelo-
dc.contributor.authorYacas, Clifford-
dc.date.accessioned2021-02-12T14:36:08Z-
dc.date.available2021-02-12T14:36:08Z-
dc.date.issued2021-
dc.identifier.urihttp://hdl.handle.net/11375/26203-
dc.description.abstractDNA methylation plays a key role in disease analysis, especially for studies that compare known large scale differences in CpG sites, such as cancer/normal studies or between-tissues studies. However, before any analysis can be done, data normalization and preprocessing of methylation data are required. A useful data preprocessing pipeline for large scale comparisons is Functional Normalization (FunNorm), (Fortin et al., 2014) implemented in the minfi package in R. In FunNorm, the univariate quantiles of the methylated and unmethylated signal values in the raw data are used to preprocess the data. However, although FunNorm has been shown to outperform other preprocessing and data normalization processes for these types of studies, it does not account for the correlation between the methylated and unmethylated signals into account; the focus of this paper is to improve upon FunNorm by taking this correlation into account. The concept of a bivariate quantile is used in this study as an attempt to take the correlation between the methylated and unmethylated signals into consideration. From the bivariate quantiles found, the partial least squares method is then used on these quantiles in this preprocessing. The raw datasets used for this research were collected from the European Molecular Biology Laboratory - European Bioinformatics Institute (EMBL-EBI) website. The results from this preprocessing algorithm were then compared and contrasted to the results from FunNorm. Drawbacks, limitations and future research are then discussed.en_US
dc.language.isoenen_US
dc.subjectmethylationen_US
dc.subjectmethylation dataen_US
dc.subjectpartial least squaresen_US
dc.subjectbivariateen_US
dc.subjectbivariate quantileen_US
dc.subjectapplied statisticsen_US
dc.subjectpreprocessingen_US
dc.subjectnormalizationen_US
dc.subjectmachine learningen_US
dc.titleBivariate Functional Normalization of Methylation Array Dataen_US
dc.typeThesisen_US
dc.contributor.departmentMathematics and Statisticsen_US
dc.description.degreetypeThesisen_US
dc.description.degreeMaster of Science (MSc)en_US
Appears in Collections:Open Access Dissertations and Theses

Files in This Item:
File Description SizeFormat 
Thesis Methodology.txt
Open Access
34.24 kBTextView/Open
Yacas_Clifford_Thesis.pdf
Open Access
1.99 MBAdobe PDFView/Open
Show simple item record Statistics


Items in MacSphere are protected by copyright, with all rights reserved, unless otherwise indicated.

Sherman Centre for Digital Scholarship     McMaster University Libraries
©2022 McMaster University, 1280 Main Street West, Hamilton, Ontario L8S 4L8 | 905-525-9140 | Contact Us | Terms of Use & Privacy Policy | Feedback

Report Accessibility Issue