Welcome to the upgraded MacSphere! We're putting the finishing touches on it; if you notice anything amiss, email macsphere@mcmaster.ca

Methods to Simulate Correlated Binomial Random Variables

dc.contributor.advisorCanty, Angelo
dc.contributor.advisorDavies, Katherine
dc.contributor.authorLai, Winfield
dc.contributor.departmentMathematics and Statisticsen_US
dc.date.accessioned2021-10-18T17:25:00Z
dc.date.available2021-10-18T17:25:00Z
dc.date.issued2021
dc.description.abstractSingle nucleotide polymorphisms (SNPs) have been involved in describing the risk a person is at for developing diseases. Simulating a collection of d correlated autosomal biallelic SNPs is useful to acquire empirical results for statistical tests in settings such as having a low sample size. A collection of d correlated autosomal biallelic SNPs can be modeled as a random vector X = (X1,...,Xd) where Xi ∼ binomial(2, pi) and pi is the minor allele frequency for the ith SNP. The pairwise correlations between components of X can be specified by a d ×d symmetric positive definite correlation matrix having all diagonal entries equal to one. Two versions of a novel method to simulate X are developed in this thesis; one version is based on generating correlated binomials directly and the other is based on generating correlated Bernoulli random vectors and summing them component wise. Two existing methods to simulate X are also discussed and implemented. In particular, a method involving the multivariate normal by Madsen and Birkes (2013) is compared to our novel methods for d ≥ 3. Our novel binomial method has a different variance for the Fisher transformed sample correlation than the other two methods. Overall, if the target pairwise correlations are smaller than the lowest upper bound possible and the number of SNPs is low, then our novel Bernoulli method works the best since it is faster than the Madsen and Birkes method and has comparable variability and bias for sample correlation.en_US
dc.description.degreeMaster of Science (MSc)en_US
dc.description.degreetypeThesisen_US
dc.identifier.urihttp://hdl.handle.net/11375/27069
dc.language.isoenen_US
dc.subjectStatisticsen_US
dc.subjectBinomialen_US
dc.subjectCorrelationen_US
dc.subjectSimulationen_US
dc.subjectMultivariateen_US
dc.titleMethods to Simulate Correlated Binomial Random Variablesen_US
dc.typeThesisen_US

Files

Original bundle

Now showing 1 - 2 of 2
Loading...
Thumbnail Image
Name:
Lai_Winfield_2021September_MScStatistics.pdf
Size:
689.74 KB
Format:
Adobe Portable Document Format
Description:
Thesis file
Loading...
Thumbnail Image
Name:
Lai_Winfield_2021September_MScStatistics_RCode.txt
Size:
36.43 KB
Format:
Plain Text
Description:
R code found in the appendix of the thesis

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.68 KB
Format:
Item-specific license agreed upon to submission
Description: