Please use this identifier to cite or link to this item:
http://hdl.handle.net/11375/27069
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.advisor | Canty, Angelo | - |
dc.contributor.advisor | Davies, Katherine | - |
dc.contributor.author | Lai, Winfield | - |
dc.date.accessioned | 2021-10-18T17:25:00Z | - |
dc.date.available | 2021-10-18T17:25:00Z | - |
dc.date.issued | 2021 | - |
dc.identifier.uri | http://hdl.handle.net/11375/27069 | - |
dc.description.abstract | Single nucleotide polymorphisms (SNPs) have been involved in describing the risk a person is at for developing diseases. Simulating a collection of d correlated autosomal biallelic SNPs is useful to acquire empirical results for statistical tests in settings such as having a low sample size. A collection of d correlated autosomal biallelic SNPs can be modeled as a random vector X = (X1,...,Xd) where Xi ∼ binomial(2, pi) and pi is the minor allele frequency for the ith SNP. The pairwise correlations between components of X can be specified by a d ×d symmetric positive definite correlation matrix having all diagonal entries equal to one. Two versions of a novel method to simulate X are developed in this thesis; one version is based on generating correlated binomials directly and the other is based on generating correlated Bernoulli random vectors and summing them component wise. Two existing methods to simulate X are also discussed and implemented. In particular, a method involving the multivariate normal by Madsen and Birkes (2013) is compared to our novel methods for d ≥ 3. Our novel binomial method has a different variance for the Fisher transformed sample correlation than the other two methods. Overall, if the target pairwise correlations are smaller than the lowest upper bound possible and the number of SNPs is low, then our novel Bernoulli method works the best since it is faster than the Madsen and Birkes method and has comparable variability and bias for sample correlation. | en_US |
dc.language.iso | en | en_US |
dc.subject | Statistics | en_US |
dc.subject | Binomial | en_US |
dc.subject | Correlation | en_US |
dc.subject | Simulation | en_US |
dc.subject | Multivariate | en_US |
dc.title | Methods to Simulate Correlated Binomial Random Variables | en_US |
dc.type | Thesis | en_US |
dc.contributor.department | Mathematics and Statistics | en_US |
dc.description.degreetype | Thesis | en_US |
dc.description.degree | Master of Science (MSc) | en_US |
Appears in Collections: | Open Access Dissertations and Theses |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Lai_Winfield_2021September_MScStatistics.pdf | Thesis file | 689.74 kB | Adobe PDF | View/Open |
Lai_Winfield_2021September_MScStatistics_RCode.txt | R code found in the appendix of the thesis | 36.43 kB | Text | View/Open |
Items in MacSphere are protected by copyright, with all rights reserved, unless otherwise indicated.