Skip navigation
  • Home
  • Browse
    • Communities
      & Collections
    • Browse Items by:
    • Publication Date
    • Author
    • Title
    • Subject
    • Department
  • Sign on to:
    • My MacSphere
    • Receive email
      updates
    • Edit Profile


McMaster University Home Page
  1. MacSphere
  2. Open Access Dissertations and Theses Community
  3. Digitized Open Access Dissertations and Theses
Please use this identifier to cite or link to this item: http://hdl.handle.net/11375/22779
Full metadata record
DC FieldValueLanguage
dc.contributor.advisorEsterby, Sylvia-
dc.contributor.authorLi, Fang-
dc.date.accessioned2018-05-01T20:10:22Z-
dc.date.available2018-05-01T20:10:22Z-
dc.date.issued2005-09-
dc.identifier.urihttp://hdl.handle.net/11375/22779-
dc.description.abstractDNA microarray technology makes it possible to analyze the expression levels of many thousands of genes simultaneously. One of the goals of microarray data analysis is to understand the multiple biological roles of genes and their interactions in complex biological processes. Genes with similar expression patterns are likely to share similar functions or biological processes. Therefore, analysis of changes in gene expression of a certain biological processes over time is of particular interest. Unsupervised clustering methods provide an efficient way of finding overall patterns and tendencies by clustering microarray gene expression data. The genes in the same cluster are regulated in a similar manner based on the assumption above. But traditional unsupervised clustering methods usually end up with clusters of genes with similar expression patterns but without interpretations describing the clusters in terms of gene functions or processes involved. In this project, some statistical techniques are applied to analyze the data set from microarray experiments of sporulation in yeast. These techniques include LOWESS data normalization, which is intended to remove the systematic variations from the data; a partitional clustering method, K-means, is used with initial centroids obtained from hierarchical clustering method of DIANA; the "gap statistic" technique is implemented to estimate the "optimal" number of clusters in the data set; and finally multiple hypothesis testing is used to determine whether biologically related genes are statistically over-represented in the gene clusters using the web query tool FatiGO. These methods are combined with graphical representation of cluster profile shape and colour maps of up and down regulation via heat maps. Application of these methods to a yeast sporulation time-course data set [Chu 𝘦𝘵 𝘢𝘭. 1998] demonstrates the utility of cluster analysis to such data sets and provides an automated method for including biological information about gene function and characteristics.en_US
dc.language.isoenen_US
dc.subjectgeneen_US
dc.subjecttime-courseen_US
dc.subjectdataen_US
dc.titleGene Expression Analysis for Time-Course Microarray Dataen_US
dc.typeThesisen_US
dc.contributor.departmentStatisticsen_US
dc.description.degreetypeThesisen_US
dc.description.degreeMaster of Science (MSc)en_US
Appears in Collections:Digitized Open Access Dissertations and Theses

Files in This Item:
File Description SizeFormat 
li_fang_2005Sept_masters.pdf
Open Access
6.34 MBAdobe PDFView/Open
Show simple item record Statistics


Items in MacSphere are protected by copyright, with all rights reserved, unless otherwise indicated.

Sherman Centre for Digital Scholarship     McMaster University Libraries
©2022 McMaster University, 1280 Main Street West, Hamilton, Ontario L8S 4L8 | 905-525-9140 | Contact Us | Terms of Use & Privacy Policy | Feedback

Report Accessibility Issue