Skip navigation
  • Home
  • Browse
    • Communities
      & Collections
    • Browse Items by:
    • Publication Date
    • Author
    • Title
    • Subject
    • Department
  • Sign on to:
    • My MacSphere
    • Receive email
      updates
    • Edit Profile


McMaster University Home Page
  1. MacSphere
  2. Open Access Dissertations and Theses Community
  3. Open Access Dissertations and Theses
Please use this identifier to cite or link to this item: http://hdl.handle.net/11375/12779
Full metadata record
DC FieldValueLanguage
dc.contributor.advisorGolding, G. B.en_US
dc.contributor.authorLou, Melanieen_US
dc.date.accessioned2014-06-18T17:00:43Z-
dc.date.available2014-06-18T17:00:43Z-
dc.date.created2012-12-19en_US
dc.date.issued2013-04en_US
dc.identifier.otheropendissertations/7637en_US
dc.identifier.other8698en_US
dc.identifier.other3551744en_US
dc.identifier.urihttp://hdl.handle.net/11375/12779-
dc.description.abstract<p>This work investigates the assignment of unknown sequences to their species of origin. In particular, I examine four questions: Is existing (GenBank) data reliable for accurate species identification? Does a segregating sites algorithm make accurate species identifications and how does it compare to another Bayesian method? Does broad sampling of reference species improve the information content of reference data? And does an extended model (of the theory of segregating sites) describe the genetic variation in a set of sequences (of a species or population) better? Though we did not find unusually similar between-species sequences in GenBank, there was evidence of unusually divergent within-species sequences, suggesting that caution and a firm understanding of GenBank species should be exercised before utilizing GenBank data. To address challenging identifications resulting from an overlap between within- and between species variation, we introduced a Bayesian treeless statistical assignment method that makes use of segregating sites. Assignments with simulated and <em>Drosophila</em> (fruit fly) sequences show that this method can provide fast, high probability assignments for recently diverged species. To address reference sequences with low information content, the addition of even one broadly sampled reference sequence can increase the number of correct assignments. Finally, an extended theory of segregating sites generates more realistic probability estimates of the genetic variability of a set of sequences. Species are dynamic entities and this work will highlight ideas and methods to address dynamic genetic patterns in species.</p>en_US
dc.subjectTheory of segregating sitesen_US
dc.subjectBayesian theoryen_US
dc.subjectSpecies identificationen_US
dc.subjectMitochondrial DNAen_US
dc.subjectDNA barcodingen_US
dc.subjectTreeless statistical assignment methoden_US
dc.subjectComputational Biologyen_US
dc.subjectEcology and Evolutionary Biologyen_US
dc.subjectEvolutionen_US
dc.subjectGenetics and Genomicsen_US
dc.subjectPopulation Biologyen_US
dc.subjectComputational Biologyen_US
dc.titleImproving specimen identification: Informative DNA using a statistical Bayesian methoden_US
dc.typethesisen_US
dc.contributor.departmentBiologyen_US
dc.description.degreeDoctor of Philosophy (PhD)en_US
Appears in Collections:Open Access Dissertations and Theses

Files in This Item:
File SizeFormat 
fulltext.pdf
Open Access
569.31 kBAdobe PDFView/Open
Show simple item record Statistics


Items in MacSphere are protected by copyright, with all rights reserved, unless otherwise indicated.

Sherman Centre for Digital Scholarship     McMaster University Libraries
©2022 McMaster University, 1280 Main Street West, Hamilton, Ontario L8S 4L8 | 905-525-9140 | Contact Us | Terms of Use & Privacy Policy | Feedback

Report Accessibility Issue