Spelling suggestions: "subject:"dequence databases"" "subject:"1sequence databases""
1 |
A Hash Trie Filter Approach to Approximate String Match for Genomic DatabasesHsu, Min-tze 28 June 2005 (has links)
Genomic sequence databases, like GenBank, EMBL, are widely used by molecular biologists for homology searching. Because of the long length of each genomic sequence and the increase of the size of genomic sequence databases, the importance of efficient searching methods for fast queries grows. The DNA sequences are composed of four kinds of nucleotides, and these genomic sequences can be regarded as the text strings. However, there is no concept of words in a genomic sequence, which makes the search of the genomic sequence in the genomic database much difficult. Approximate String Matching (ASM) with k errors is considered for genomic sequences, where k errors would be caused by insertion, deletion, and replacement operations. Filtration of the DNA sequence is a widely adopted technique to reduce the number of the text areas (i.e., candidates) for further verification. In most of the filter methods, they first split the database sequence into q-grams. A sequence of grams (subpatterns) which match some part of the text will be passed as a candidate. The match problem of grams with the part of the text could be speed up by using the index structure for the exact match. Candidates will then be examined by dynamic programming to get the final result. However, in the previous methods for ASM, most of them considered the local order within each gram. Only the (k + s) h-samples filter considers the global order of the sequence of matched grams. Although the (k + s) h-samples filter keeps the global order of the sequence of the grams, it still has some disadvantages. First, to be a candidate in the (k + s) h-samples filter, the number of the ordered matched grams, s, is always fixed to 2 which results in low precision. Second, the (k + s) h-samples filter uses the query time to build the index for query patterns. In this thesis, we propose a new approximate string matching method, the hash trie filter, for efficiently searching in genomic databases. We build a hash trie in the pre-computing time for the genomic sequence stored in database. Although the size q of each split grams is also decided by the same formula used in the (k + s) h-samples filter, we have proposed a different way to find the ordered subpatterns in text T. Moreover, we reduce the number of candidates by pruning some unreasonable matched positions. Furthermore, unlike the (k + s) h-samples filter which always uses s = 2 to decide whether s matched subpatterns could be a candidate or not, our method will dynamically decide s, resulting in the increase of precision. The simulation results show that our hash trie filter outperforms the (k +s) h-samples filter in terms of the response time, the number of verified candidates, and the precision under different length of the query patterns and different error levels.
|
2 |
Bacterial diversity in Buruli ulcer skin lesions: Challenges in the clinical microbiome analysis of a skin diseaseVan Leuvenhaege, C., Vandelannoote, K., Affolabi, D., Portaels, F., Sopoh, G., de Jong, B.C., Eddyani, M., Meehan, Conor J. 05 November 2019 (has links)
Yes / Background
Buruli ulcer (BU) is an infectious disease caused by Mycobacterium ulcerans and considered the third most prevalent mycobacterial disease in humans. Secondary bacterial infections in open BU lesions are the main cause of pain, delayed healing and systemic illness, resulting in prolonged hospital stay. Thus, understanding the diversity of bacteria, termed the microbiome, in these open lesions is important for proper treatment. However, adequately studying the human microbiome in a clinical setting can prove difficult when investigating a neglected tropical skin disease due to its rarity and the setting.
Methodology/Principal findings
Using 16S rRNA sequencing, we determined the microbial composition of 5 BU lesions, 3 non-BU lesions and 3 healthy skin samples. Although no significant differences in diversity were found between BU and non-BU lesions, the former were characterized by an increase of Bacteroidetes compared to the non-BU wounds and the BU lesions also contained significantly more obligate anaerobes. With this molecular-based study, we were also able to detect bacteria that were missed by culture-based methods in previous BU studies.
Conclusions/Significance
Our study suggests that BU may lead to changes in the skin bacterial community within the lesions. However, in order to determine if such changes hold true across all BU cases and are either a cause or consequence of a specific wound environment, further microbiome studies are necessary. Such skin microbiome analysis requires large sample sizes and lesions from the same body site in many patients, both of which can be difficult for a rare disease. Our study proposes a pipeline for such studies and highlights several drawbacks that must be considered if microbiome analysis is to be utilized for neglected tropical diseases.
|
3 |
Identification and Expression Analysis of Zebrafish Glypicans during Embryonic DevelopmentBrand, Michael, Gupta, Mansi 02 December 2015 (has links) (PDF)
Heparan sulfate Proteoglycans (HSPG) are ubiquitous molecules with indispensable functions in various biological processes. Glypicans are a family of HSPG’s, characterized by a Gpi-anchor which directs them to the cell surface and/or extracellular matrix where they regulate growth factor signaling during development and disease. We report the identification and expression pattern of glypican genes from zebrafish. The zebrafish genome contains 10 glypican homologs, as opposed to six in mammals, which are highly conserved and are phylogenetically related to the mammalian genes. Some of the fish glypicans like Gpc1a, Gpc3, Gpc4, Gpc6a and Gpc6b show conserved synteny with their mammalian cognate genes. Many glypicans are expressed during the gastrulation stage, but their expression becomes more tissue specific and defined during somitogenesis stages, particularly in the developing central nervous system. Existence of multiple glypican orthologs in fish with diverse expression pattern suggests highly specialized and/or redundant function of these genes during embryonic development.
|
4 |
Phylogenetic relationships and arbuscular mycorrhizal diversity of Tolpis Adans. (Asteraceae), with special reference to island endemism and biogeographyGruenstaeudl, Michael 29 January 2014 (has links)
The plant genus Tolpis (Asteraceae) is a predominantly insular plant lineage. It inhabits four of the five archipelagoes that comprise the Atlantic region of Macaronesia and also occurs in Mediterranean Europe and North Africa. Twelve species are currently recognized in Tolpis, of which ten are insular and two continental. The majority of the insular species inhabit the five western Canarian islands, where they constitute endemics to specific ecological habitats. A comprehensive molecular phylogeny of Tolpis is generated via DNA sequences of one nuclear ribosomal and two low-copy nuclear DNA markers. Considerable phylogenetic uncertainty among inferred tree topologies is detected, and incongruence between these topologies is resolved via statistical hypotheses testing. The extant diversity of the genus is identified to be the result of two independent colonization pathways and adaptive radiations on several islands. Moreover, potential hybridization is detected between species that inhabit different islands and archipelagoes, indicating a more widespread historical distribution of the genus. Details of the biogeographic history of Tolpis are inferred via ancestral area reconstructions under parsimony and likelihood optimality criteria. The hypothesis that Tolpis may have undergone a back-dispersal from an island to a continental habitat is also tested. Uncertainty in taxon cladograms owing to the presence of hybrid or allopolyploid taxa is characterized and a potential adjustment strategy evaluated. Averaging reconstruction results over all optimal phylogenetic trees and the manual pruning of cloned DNA sequences are found potential adjustment strategies against the impact of topological uncertainty owning to hybrid or allopolyploid taxa. Adjusted ancestral area reconstructions in Tolpis do not support the scenario that the genus has undergone a reverse colonization of the continent. In addition to the phylogenetic and biogeographic history of the genus, the diversity of symbiotic mycorrhizal fungi associated with Tolpis is characterized. A molecular survey using two nuclear ribosomal DNA markers and 454 pyrosequencing is performed. Particular emphasis is placed on the quality filtering of resulting fungal DNA sequences, the generation of operational taxonomic units, and their taxonomic assignment via similarity searches against DNA sequence databases. Numerous potentially novel fungal genotypes are identified. / text
|
5 |
Identification and Expression Analysis of Zebrafish Glypicans during Embryonic DevelopmentBrand, Michael, Gupta, Mansi 02 December 2015 (has links)
Heparan sulfate Proteoglycans (HSPG) are ubiquitous molecules with indispensable functions in various biological processes. Glypicans are a family of HSPG’s, characterized by a Gpi-anchor which directs them to the cell surface and/or extracellular matrix where they regulate growth factor signaling during development and disease. We report the identification and expression pattern of glypican genes from zebrafish. The zebrafish genome contains 10 glypican homologs, as opposed to six in mammals, which are highly conserved and are phylogenetically related to the mammalian genes. Some of the fish glypicans like Gpc1a, Gpc3, Gpc4, Gpc6a and Gpc6b show conserved synteny with their mammalian cognate genes. Many glypicans are expressed during the gastrulation stage, but their expression becomes more tissue specific and defined during somitogenesis stages, particularly in the developing central nervous system. Existence of multiple glypican orthologs in fish with diverse expression pattern suggests highly specialized and/or redundant function of these genes during embryonic development.
|
6 |
Tunable Protein Stabilization In Vivo Mediated by Shield-1 in Transgenic MedakaFroschauer, Alexander, Kube, Lisa, Kegler, Alexandra, Rieger, Christiane, Gutzeit, Herwig O. 07 January 2016 (has links) (PDF)
Techniques for conditional gene or protein expression are important tools in developmental biology and in the analysis of physiology and disease. On the protein level, the tunable and reversible expression of proteins can be achieved by the fusion of the protein of interest to a destabilizing domain (DD). In the absence of its specific ligand (Shield-1), the protein is degraded by the proteasome. The DD-Shield system has proven to be an excellent tool to regulate the expression of proteins of interests in mammalian systems but has not been applied in teleosts like the medaka. We present the application of the DD-Shield technique in transgenic medaka and show the ubiquitous conditional expression throughout life. Shield-1 administration to the water leads to concentration-dependent induction of a YFP reporter gene in various organs and in spermatogonia at the cellular level.
|
7 |
Tunable Protein Stabilization In Vivo Mediated by Shield-1 in Transgenic Medaka: Research ArticleFroschauer, Alexander, Kube, Lisa, Kegler, Alexandra, Rieger, Christiane, Gutzeit, Herwig O. 07 January 2016 (has links)
Techniques for conditional gene or protein expression are important tools in developmental biology and in the analysis of physiology and disease. On the protein level, the tunable and reversible expression of proteins can be achieved by the fusion of the protein of interest to a destabilizing domain (DD). In the absence of its specific ligand (Shield-1), the protein is degraded by the proteasome. The DD-Shield system has proven to be an excellent tool to regulate the expression of proteins of interests in mammalian systems but has not been applied in teleosts like the medaka. We present the application of the DD-Shield technique in transgenic medaka and show the ubiquitous conditional expression throughout life. Shield-1 administration to the water leads to concentration-dependent induction of a YFP reporter gene in various organs and in spermatogonia at the cellular level.
|
Page generated in 0.0441 seconds