In previous work, we presented GAMI, an approach to motif inference that uses a genetic algorithms search. GAMI is designed specifically to find putative conserved regulatory motifs in noncoding regions of divergent species, and is designed to allow for analysis of long nucleotide sequences. In this work, we compare GAMI's performance when run with its original fitness function (a simple count of the number of matches) and when run with information content, as well as several variations on these metrics. Results indicate that information content does not identify highly conserved regions, and thus is not the appropriate metric for this task, while variations on information content as well as the original metric succeed in identifying putative conserved regions.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TCBB.2007.1059DOI Listing

Publication Analysis

Top Keywords

putative conserved
12
noncoding regions
8
genetic algorithms
8
content well
8
conserved regions
8
evaluation content
4
content metric
4
metric inference
4
inference putative
4
conserved
4

Similar Publications

Neptunizhulides, Cryptic -AT Polyketide Synthase-Derived Metabolites from NBU2194.

Org Lett

January 2025

Li Dak Sum Yip Yio Chin Kenneth Li Marine Biopharmaceutical Research Center, Health Science Center, Ningbo University, Ningbo, Zhejiang 315211, China.

Genome mining of NBU2194 resulted in the identification of a family of 17-membered macrolides, neptunizhulides A-F. Their structures were elucidated by comprehensive spectroscopic data analysis. Stereochemical assignments of the neptunizhulides were determined by -based configuration analysis, ROESY NMR, Mosher's ester derivatization, and bioinformatic predictions.

View Article and Find Full Text PDF

Phylogenomics, reticulation, and biogeographical history of Elaeagnaceae.

Plant Divers

November 2024

Germplasm Bank of Wild Species & Yunnan Key Laboratory of Crop Wild Relatives Omics, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming 650201, China.

The angiosperm family Elaeagnaceae comprises three genera and . 100 species distributed mainly in Eurasia and North America. Little family-wide phylogenetic and biogeographic research on Elaeagnaceae has been conducted, limiting the application and preservation of natural genetic resources.

View Article and Find Full Text PDF

The leaf economics spectrum (LES) characterizes a tradeoff between building a leaf for durability versus for energy capture and gas exchange, with allocation to leaf dry mass per projected surface area (LMA) being a key trait underlying this tradeoff. However, regardless of the biomass supporting the leaf, high rates of gas exchange are typically accomplished by small, densely packed stomata on the leaf surface, which is enabled by smaller genome sizes. Here, we investigate how variation in genome size-cell size allometry interacts with variation in biomass allocation (i.

View Article and Find Full Text PDF

In the present study, we identified 22 significant SNPs, eight stable QTLs and 17 potential candidate genes associated with 100-seed weight in soybean. Soybean is an economically important crop that is rich in seed oil and protein. The 100-seed weight (HSW) is a crucial yield contributing trait.

View Article and Find Full Text PDF

Miy1 is a highly conserved de-ubiquitinating enzyme in yeast with MINDY1 as its human homolog. Miy1 is known to act on K48-linked polyubiquitin chain, but its biological function is unknown. Miy1 has a putative prenylation site, suggesting it as a membrane-associated protein that may contribute to the regulation of cell signaling.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!