41 |
Bayesian Biclustering on Discrete Data: Variable Selection MethodsGuo, Lei 18 October 2013 (has links)
Biclustering is a technique for clustering rows and columns of a data matrix simultaneously. Over the past few years, we have seen its applications in biology-related fields, as well as in many data mining projects. As opposed to classical clustering methods, biclustering groups objects that are similar only on a subset of variables. Many biclustering algorithms on continuous data have emerged over the last decade. In this dissertation, we will focus on two Bayesian biclustering algorithms we developed for discrete data, more specifically categorical data and ordinal data. / Statistics
|
42 |
Historical and contemporary processes shaping population genetic structure in an anadromous fish (Osmerus mordax)Coulson, Mark 12 February 2014 (has links)
The spatial scale at which populations are genetically structured is of immense interest for the understanding of a species’ ecology and evolutionary biology. This can have important implications for management of resources as well as predicting responses to future change. Rainbow smelt (Osmerus mordax) is an anadromous species with a relatively short freshwater residence time compared to other species with similar life-history strategies. Therefore, while they offer the opportunity to sample distinct spawning aggregations, they also offer an insight into the relative roles of contemporary and historical factors shaping connectivity among marine populations, an area of great interest, and for which further understanding is required. With the use of both mitochondrial DNA and nuclear microsatellite markers, I explored the historical and contemporary factors influencing population structure in smelt. While previous phylogeographic work on this species has resolved two mtDNA lineages dating back to previous glacial episodes, I document the discovery of a zone of contact between these lineages in Newfoundland. This is in addition to the established contact zone in the St. Lawrence estuary, and results in a longitudinal distribution of the races with one race predominating on opposite ends of the species distribution, while the other race is geographically intermediate. Patterns of nuclear genetic variation largely mirror the phylogeographic signals in Newfoundland and suggest a more recent colonization of the Avalon Peninsula as well as implicating a remnant historical signal of colonization of the west coast of Newfoundland from the mainland. In addition, contrasting patterns of genetic diversity and levels of differentiation were apparent between the mainland and Newfoundland and suggest differing scales of dispersal within this species. While the population structure within Newfoundland is most consistent with dispersal restricted to within bays, larger scale biogeographic regions were identified in the mainland range, suggesting dispersal is more common and widespread. In addition, sampling of different run times (i.e. ‘early’ vs. ‘late’) demonstrated the potential for isolation by time when spawning events are separated by a break in activity. Overall, these results shed light into the possible roles of both historical and contemporary factors shaping the dynamics and connectivity among populations.
|
43 |
Global and Fine Scale Molecular Studies of Polyploid Evolution in Crataegus L. (Rosaceae)Lo, Eugenia Yuk Ying 19 January 2009 (has links)
As many as 70% of angiosperm species are known to contain polyploids, but many aspects of polyploid evolution are unclear in woody plants. Crataegus is a woody genus of Rosaceae comprising 140-200 species that are widely distributed in the Northern Hemisphere. Several species, particularly those in North America, are shown to contain polyploids. The overall goal of the thesis is to provide a better understanding of polyploid evolution by resolving problems from intergeneric to intraspecific levels in Crataegus using phylogenetic and population genetic approaches. Three major aspects were investigated: (1) Phylogeography of the Old and New World Crataegus; (2) Reproductive system and distribution of cytotypes of the black-fruited series Douglasianae in Pacific Northwest and; (3) Origins, population structure, and genetic diversity of diploid and polyploid species.
Phylogenetic analyses of molecular data provide evidences of historical events such as trans-Beringian migrations and North Atlantic vicariance that contributed to modern distribution of Crataegus. Poor resolution and short internal branches in eastern North American species suggest genetic bottlenecks and/or rapid divergence following glaciations. In the Pacific Northwest, polyploids of series Douglasianae show a wider distribution and ecological amplitude than diploids. Parsimony tree and network analyses indicate that autotriploids and allotriploids occur in C. suksdorfii, while tetraploid C. suksdorfii are formed via the triploid bridge followed by introgression of sympatric C. douglasii. At the regional level, microsatellite data indicate a separation of the Pacific coastal diploids and triploids from the Columbia Plateau and Rocky Mountain triploids and tetraploids. High genetic differentiation among C. suksdorfii populations suggests that gene flow is limited by ploidy level differences as well as geographical distance. Within-population multilocus genotypic variation is greatest in sexual diploids, and least in apomictic triploids. Frequent gene flow via seed dispersal contributes to an appreciable level of intrapopulation diversity in apomictic tetraploids, and counterbalances the effects of apomixis and/or self-fertilization, which diminish genetic variation within and between seed families. These findings collectively clarify taxonomy and historical biogeography, provide an explicit reticulation model for polyploid formation, and shed light on evolution of natural populations in woody plants that show heterogeneous ploidy levels and reproductive systems.
|
44 |
Global and Fine Scale Molecular Studies of Polyploid Evolution in Crataegus L. (Rosaceae)Lo, Eugenia Yuk Ying 19 January 2009 (has links)
As many as 70% of angiosperm species are known to contain polyploids, but many aspects of polyploid evolution are unclear in woody plants. Crataegus is a woody genus of Rosaceae comprising 140-200 species that are widely distributed in the Northern Hemisphere. Several species, particularly those in North America, are shown to contain polyploids. The overall goal of the thesis is to provide a better understanding of polyploid evolution by resolving problems from intergeneric to intraspecific levels in Crataegus using phylogenetic and population genetic approaches. Three major aspects were investigated: (1) Phylogeography of the Old and New World Crataegus; (2) Reproductive system and distribution of cytotypes of the black-fruited series Douglasianae in Pacific Northwest and; (3) Origins, population structure, and genetic diversity of diploid and polyploid species.
Phylogenetic analyses of molecular data provide evidences of historical events such as trans-Beringian migrations and North Atlantic vicariance that contributed to modern distribution of Crataegus. Poor resolution and short internal branches in eastern North American species suggest genetic bottlenecks and/or rapid divergence following glaciations. In the Pacific Northwest, polyploids of series Douglasianae show a wider distribution and ecological amplitude than diploids. Parsimony tree and network analyses indicate that autotriploids and allotriploids occur in C. suksdorfii, while tetraploid C. suksdorfii are formed via the triploid bridge followed by introgression of sympatric C. douglasii. At the regional level, microsatellite data indicate a separation of the Pacific coastal diploids and triploids from the Columbia Plateau and Rocky Mountain triploids and tetraploids. High genetic differentiation among C. suksdorfii populations suggests that gene flow is limited by ploidy level differences as well as geographical distance. Within-population multilocus genotypic variation is greatest in sexual diploids, and least in apomictic triploids. Frequent gene flow via seed dispersal contributes to an appreciable level of intrapopulation diversity in apomictic tetraploids, and counterbalances the effects of apomixis and/or self-fertilization, which diminish genetic variation within and between seed families. These findings collectively clarify taxonomy and historical biogeography, provide an explicit reticulation model for polyploid formation, and shed light on evolution of natural populations in woody plants that show heterogeneous ploidy levels and reproductive systems.
|
45 |
Examination of the population structure of darkbarbel catfish (Pelteobagrus vachelli) in the Upper Yangtze River, China, using novel microsatellite markersPOWELL, ADRIENNE 05 July 2012 (has links)
The darkbarbel catfish (Peltoebagrus vachelli) is a small bodied benthic fish that inhabits the Yangtze River in China and is commercially valued as food. The objectives of this project were to develop the first set of microsatellite primers specific to P. vachelli and use them to examine the levels of genetic variability and population structure of three populations collected at three sites (Yibin, Luzhou, and Song Ji) along an unfragmented stretch of the upper Yangtze River. Microsatellite primers were designed from an enriched library of genomic DNA and a total of eight primer pairs were optimized to produce reliable polymorphic PCR amplicons on a LI-COR 4200 IR2 platform. A high level of variability was detected amongst the isolated loci with the number of alleles per locus ranging from 14 – 29 and mean observed and expected heterozygosity values of 0.84 and 0.90 respectively across the entire sample set. Overall levels of genetic diversity were high within the three populations with mean observed and expected heterozygosities ranging from 0.823 – 0.869 and 0.864 – 0.921 respectively. Little evidence of genetic structure was detected (global FST = 0.0065, p < 0.05) within the sampled region and pairwise tests of differentiation were not significant (all FST p > 0.05). These results imply historically large populations of P. vachelli in the upper portion of the Yangtze River that, in the absence of artificial barriers, are part of a single panmictic unit. As plans have been put forth for construction of hydroelectric installations within the sampled region, this study provides a baseline estimate of the levels of genetic variation present in P. vachelli within the remaining undammed stretch of the Upper Yangtze River which will serve as a foundation for future analyses on the effects of river fragmentation within this region. Additionally the isolation and characterization of the microsatellites will provide useful molecular markers for a variety of other applications in this and closely related species such as parentage analysis, determination of stocks, and maintenance of genetic variation in stocking practices. / Thesis (Master, Biology) -- Queen's University, 2012-06-29 18:00:25.509
|
46 |
Genetic population structure of walleye (Sander vitreus) in northern Alberta and application to species managementBurke, Lindsey Alison Unknown Date
No description available.
|
47 |
Population Structure, Association Mapping of Economic Traits and Landscape Genomics of East Texas Loblolly Pine ( Pinus taeda L.)Chhatre, Vikram E. 03 October 2013 (has links)
Loblolly pine (Pinus taeda L.) is an ecologically and economically important southern pine, distributed across the southeastern United States. Its genetic improvement for breeding and deployment is a major goal of the Western Gulf Forest Tree Improvement Program (WGFTIP) hosted by the Texas A&M Forest Service. Rapid advances in genomics and molecular marker technology have created potential for application of Marker Assisted Selection (MAS) and Genomic Selection (GS) for accelerated breeding in forest trees. First-generation selection (FGS) and second- generation selection (SGS) breeding populations of loblolly pine from east Texas were studied to estimate the genetic diversity, population structure, linkage disequilibrium (LD), signatures of selection and association of breeding traits with genetic markers using a genome-wide panel of 4264 single nucleotide polymorphisms (SNPs). Under- standing the genetic basis of local adaptation is crucial to disentangle the dynamics of gene flow, drift and selection and to address climate change. Bayesian mixed linear models and logistic regression were used to associate SNP variation with geography, climate, aridity and growth season length and markers with strong correlations were investigated for biological functions.
Relatively high levels of observed (Ho = 0.178–0.198) and expected (He = 0.180-0.198) heterozygosities were found in all populations. The amount of inbreeding was very low, and many populations exhibited a slight excess of heterozygotes. The population substructure was weak, but FST indicated more pronounced differentiation in the SGS populations. As expected for outcrossing natural populations, the genome-wide LD was low, but marker density was insufficient to deduce the decay rate. Numerous associations were found between various phenotypes and SNPs, but few remained significant after false positive correction. Signatures of diversifying and balancing selection were found in markers representing important biological functions. Strong correlations supported by Bayes factors were found between various environmental variables and several SNPs. Logistic regression found hundreds of significant marker-environment associations, but none remained significant after false-positive correction, which was likely too stringent and will require further investigation. Annotations of significant markers implicated them in crucial biological functions.
These results present the first step in the application of MAS to the WGFTIP for loblolly pine genetic improvement and will contribute to the knowledgebase necessary for genomic selection technology. Results from environmental association study provide important information for designing breeding strategies to address climate change and for genetic conservation purposes.
|
48 |
Genetic population structure of walleye (Sander vitreus) in northern Alberta and application to species managementBurke, Lindsey Alison 11 1900 (has links)
Walleye (Sander vitreus) is an economically valuable freshwater fish throughout North America. In Alberta, pressure from sport fishing and commercial fishing make effective management and protection of this species crucial to its sustainability. Walleye from 12 Alberta lakes were genetically characterized using 15 microsatellite markers. Each lake contained a genetically distinct walleye subpopulation within a larger population of the river basin in which the lake was situated. Differentiation between subpopulations varied (ST=0.05 to 0.29). Patterns of genetic divergence aligned closely with the current hydro-geographical landscape, except where stocking events have occurred. Vicariance and natal philopatry are likely mechanisms maintaining the current genetic structure. The markers detected sufficient genetic variation between most subpopulations to assign an individual fish to a subpopulation of origin. The utility of genetic assignment was illustrated for stocking assessment and forensic enforcement. These genetic data will help to inform management decisions, monitor population status and enforce harvest restrictions for Alberta walleye. / Systematics and Evolution
|
49 |
Testing hypotheses in molecular ecology: genetic exchange and hybrid performanceHolleley, Clare Ellen, Biological, Earth & Environmental Sciences, Faculty of Science, UNSW January 2009 (has links)
Population structure, gene flow and dispersal are some of the most commonly estimated population parameters in population genetics, evolutionary biology and conservation genetics. The primary aim of thesis is to test the precision and accuracy of genetic estimates of population structure, gene flow and dispersal. The controlled replicated Drosophila melanogaster experiments of known effective population size (Ne = 14.3) and dispersal rate (m = 0.0025 - 0.04) all adhered to Wright??s demographic island model. Three statistical approaches were empirically tested: 1) the conversion of population structure to gene flow using FST, RST, SHUA and PhiST ; 2) the private alleles method to estimate gene flow; 3) a Bayesian assignment method to estimate dispersal (BAYESASS 1.2). Even in the best-case scenario, almost all current methods except SHUA significantly underestimate population structure, and consequently overestimate gene flow and dispersal when applied to real populations. It was crucial to ensure that the manipulated rate of gene flow was correctly defined. This led to three supporting investigations of hybrid performance, inversion polymorphisms and effective population size. The hybrid performance investigation demonstrated that the manipulated rate of gene flow had not been unexpectedly inflated by hybrid vigour or reduced by breakdown. These experiments also demonstrated that close inbreeding is not a necessary precondition for hybrid vigour or breakdown, which is important for conservation strategies involving induced dispersal. The investigation of inversion polymorphisms ensured that the manipulated rate of gene flow was not affected by selection on inverted regions. The effective population size investigation used a temporal estimation method to confirm that the Ne was accurately predicted by an N:Ne ratio of 0.286. Additionally this experiment showed that the single-sample estimation methods implemented by ONeSAMP or LDNE resulted in downwardly biased estimates of Ne in structured populations. In conclusion these results call into question the confidence that biologists may have in some of the most widely used molecular tools in conservation biology.
|
50 |
The Population Genetic Structure of Portunus Pelagicus in Australian WatersEsezmis@murdoch.edu.au, Ertug Sezmis January 2004 (has links)
This thesis describes the results of an investigation into the population genetic structure
of the blue swimmer crab, Portunus pelagicus, in Australian waters. P. pelagicus is an
Indo-West Pacific species, with adults and juveniles that inhabit sheltered benthic
coastal environments and a planktonic phase (of modest duration) in its life cycle.
The investigation was done by examining the patterns of variation at six microsatellite
loci and in a 342 bp portion of the cytochrome oxidase subunit I (COI) gene in the
mitochondrial DNA in samples of Portunus pelagicus from a total of 16 different
assemblages/waterbodies. Overall, the samples were collected from throughout the
geographical range of this species in Australian waters, i.e. from the western seaboard,
from the eastern seaboard, from Darwin on the north coast and from South Australia on
the south coast. The samples sizes ranged from 4 to 57 individuals, depending on the
sample and the genetic assay. The population genetic structure of P. pelagicus was
analysed from both a traditional population structure perspective and from a
phylogeographical and historical demography perspective.
The traditional assessment of the population genetic structure of Portunus pelagicus
indicates that this species exhibits a significant amount of genetic heterogeneity in
Australian waters (e.g. FST for microsatellite data = 0.098; ¥èST for COI data = 0.375 and
¥ÕST for COI data = 0.492). This assessment also indicates that P. pelagicus exhibits
varying degrees of genetic heterogeneity within and between geographical regions in
Australian waters, as follows. (1) The genetic compositions of the samples from the
different coastlines (i.e. north, south, east and west) invariably showed statistically
significant differences for at least two microsatellite loci, although the differences
between the samples from the eastern seaboard, Darwin and those from the western
seaboard to the north of Port Denison were not as great as those within the western
seaboard samples or within South Australian samples. (2) The genetic compositions of
the samples from the assemblages on the eastern seaboard of Australia, which ranged
from Mackay (21¨¬06¡ÇS) to Port Stephens (32¡Æ40¡ÇS), were essentially homogeneous. (3)
The samples from the assemblages on the western seaboard of Australia, which ranged
from Broome (17¡Æ58¡ÇS) to Geographe Bay (33¡Æ35¡ÇS), exhibited significant levels of
genetic heterogeneity. Furthermore, those from south of Port Denison formed a highly
distinctive (but not invariant) group compared to those from elsewhere. (4) The samples
from South Australia were also highly genetically distinctive compared to those from
elsewhere, although they also showed significant heterogeneity amongst themselves.
The above findings were more or less suggested by both the microsatellite and COI
markers, although the former generally provided a higher resolution picture of the
population structure of P. pelagicus than did the latter.
The main findings of the investigation into the phylogeography and recent demographic
history of Portunus pelagicus in Australian waters were as follows. (1) A phylogeny
constructed from COI sequence variation was shallow, with the lineages showing varied
geographical distributions. (2) The results of a nested clade analysis of this variation
indicate that range expansion has been a predominant influence on the historical
demography of P. pelagicus in Australian waters. (3) The samples from the
assemblages on the western seaboard to the south of Port Denison contained low levels
of genetic diversity, a sub-set of the diversity present in the samples from lower latitude
sites on the western seaboard, and microsatellite-based evidence of having coming from
assemblages that have undergone a bottleneck (or founder effect) followed by an
expansion in size. (4) The samples from the assemblages in South Australia contained
low levels of genetic diversity, phylogenetic affinities with samples from the eastern
seaboard, and microsatellite-based evidence of having coming from assemblages that
have undergone a bottleneck (or founder effect) followed by an expansion in size.
The two major interpretations to stem from the results of this investigation are as
follows. (1) Overall, Portunus pelagicus has undergone a recent (in an evolutionary
sense) range expansion, from a single source, within Australian waters. At a finer-scale,
this species appears to have colonised south-western Australia from a lower latitude
site(s) on the western seaboard and probably colonised South Australia from the
southern margins of its range on the eastern seaboard. Regardless, there has been
limited penetrance of genetic variation into temperate waters on the western seaboard
and into South Australia, presumably due one or more of the barriers to gene flow listed
below. (2) P. pelagicus experiences significant restrictions to gene flow within its
present-day geographical range in Australian waters due to (i) geographic distance per
se; (ii) discontinuities in the distribution of the sheltered coastal environments; (iii)
hydrological barriers to dispersal and (iv) possibly low temperatures in the temperature
margins of the range.
|
Page generated in 0.1765 seconds