Welcome to the upgraded MacSphere! We're putting the finishing touches on it; if you notice anything amiss, email macsphere@mcmaster.ca

Identification of Environmental Alphaproteobacteria with Conserved Signature Proteins in Metagenomic Datasets

dc.contributor.advisorSchellhorn, Herb E.en_US
dc.contributor.advisorGupta, Radhey S.en_US
dc.contributor.advisorIgdoura, Suleiman A.en_US
dc.contributor.authorYao, Quanen_US
dc.contributor.departmentBiologyen_US
dc.date.accessioned2014-06-18T21:13:39Z
dc.date.created2013-12-21en_US
dc.date.embargo2014-12-21
dc.date.embargoset2014-12-21en_US
dc.date.issued2014-04en_US
dc.description.abstract<p>Microbial metagenomics is the exploration of taxonomical diversity of microbial communities in environmental habitats using large, exhaustive DNA sequence datasets. However, due to inherent limitations of sequencing technology and the complexity of environmental genomes, current analytical approaches do not reveal the existence of all microbes that may be present. In this study, a new classification approach is proposed based upon unique proteins that are specific for different clades of Alphaproteobacteria to predict the presence and absence of species from these groups of bacteria in published metagenomic datasets. In this work, 264 previously–identified, published conserved signature proteins (CSPs) characteristic of individual taxonomic clades of Alphaproteobacteria are used as probes to detect the presence of bacteria in metagenomic datasets. Although public genome sequence information has increased manifold since these CSPs were initially identified 6 years ago, results indicate that nearly all of these CSPs (259 of 265) are specific for their previously characterized clades. Furthermore, they are confirmed to be present in the newly–identified and sequenced members of these clades. In view of their specificity and predictive ability in different monophyletic clades of Alphaproteobacteria, the sequences of these CSPs provide reliable probes to determine the presence or absence of these Alphaproteobacteria in metagenomic datasets. In this work, CSPs are used to determine the presence of Alphaproteobacteria diversity in 10 published metagenomic datasets (bioreactor, compost, wastewater, activated sludge, groundwater, freshwater sediment, microbial mat, marine, hydrothermal vent and whale fall metagenomes), which cover diverse environment and ecosystems. It is indicated that the BLAST searches with these CSPs can be used to efficiently identify Alphaproteobacteria species in these metagenome dataset and substantial differences can be determined in the distribution and relative abundance of different Alphaproteobacteria species in the tested metagenome datasets. Thus the CSPs, which are specific for different microbial taxa, provide novel and powerful means for identification of microbes and for their taxonomic profiling in metagenomic datasets.</p>en_US
dc.description.degreeMaster of Science (MSc)en_US
dc.identifier.otheropendissertations/8659en_US
dc.identifier.other9738en_US
dc.identifier.other4942657en_US
dc.identifier.urihttp://hdl.handle.net/11375/15324
dc.subjectmetagenomicen_US
dc.subjectAlphaproteobacteriaen_US
dc.subjectmolecular markeren_US
dc.subjectbacterial diagnosisen_US
dc.subjectBioinformaticsen_US
dc.subjectBioinformaticsen_US
dc.titleIdentification of Environmental Alphaproteobacteria with Conserved Signature Proteins in Metagenomic Datasetsen_US
dc.typethesisen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
fulltext.pdf
Size:
1.68 MB
Format:
Adobe Portable Document Format