Welcome to the upgraded MacSphere! We're putting the finishing touches on it; if you notice anything amiss, email macsphere@mcmaster.ca

MonsterLM: A method to estimate the variance explained by genome-wide interactions with environmental factors

dc.contributor.advisorPare, Guillaume
dc.contributor.authorKhan, Mohammad
dc.contributor.departmentStatisticsen_US
dc.date.accessioned2020-10-09T17:55:44Z
dc.date.available2020-10-09T17:55:44Z
dc.date.issued2020
dc.description.abstractEstimations of heritability and variance explained due to environmental exposures and interaction effects help in understanding complex diseases. Current methods to detect such interactions rely on variance component methods. These methods have been neces- sary due to the m » n problem, where the number of predictors (m) vastly outnumbers the number of observations (n). These methods are all computationally intensive, which is further exacerbated when considering gene-environment interactions, as the number of predictors increases from m to 2m+1 in the case of a single environmental exposure. Novel methods are thus needed to enable fast and unbiased calculations of the variance explained (R2) for gene-environment interactions in very large samples on multiple traits. Taking advantage of the large number of participants in contemporary genetic studies, we herein propose a novel method for continuous trait R2 estimates that are up to 20 times faster than current methods. We have devised a novel method, monsterlm, that enables multiple linear regression on large regions encompassing tens of thousands of variants in hundreds of thousands of participants. We tested monsterlm with simulations using real genotypes from the UK Biobank. During simulations we verified the properties of monsterlm to estimate the variance explained by interaction terms. Our preliminary results showcase potential interactions between blood biochemistry biomarkers such as HbA1c, Triglycerides and ApoB with an environmental factor relating to obesity-related lifestyle factor: Waist-hip Ratio (WHR). We further investigate these results to reveal that more than 50% of the interaction variance calculated can be attributed to ∼5% of the single-nucleotide polymorphisms (SNPs) interacting with the environmental trait. Lastly, we showcase the impact of interactions on improving polygenic risk scores.en_US
dc.description.degreeMaster of Science (MSc)en_US
dc.description.degreetypeThesisen_US
dc.identifier.urihttp://hdl.handle.net/11375/25888
dc.language.isoenen_US
dc.subjectStatistical Genetics, Linear Model, SNPs, Gene-Environment Interactionsen_US
dc.titleMonsterLM: A method to estimate the variance explained by genome-wide interactions with environmental factorsen_US
dc.typeThesisen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Khan_Mohammad_A_2020September_MScStatistics.pdf
Size:
2.25 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.68 KB
Format:
Item-specific license agreed upon to submission
Description: