Evaluation of Glycine max mRNA clusters.

BMC Bioinformatics

Biological Sciences Department, University of Missouri-Rolla, Rolla, MO, USA.

Published: July 2005

Background: Clustering the ESTs from a large dataset representing a single species is a convenient starting point for a number of investigations into gene discovery, genome evolution, expression patterns, and alternatively spliced transcripts. Several methods have been developed to accomplish this, the most widely available being UniGene, a public domain collection of gene-oriented clusters for over 45 different species created and maintained by NCBI. The goal is for each cluster to represent a unique gene, but currently it is not known how closely the overall results represent that reality. UniGene's build procedure begins with initial mRNA clusters before joining ESTs. UniGene's results for soybean indicate a significant amount of redundancy among some sequences reported to be unique mRNAs. To establish a valid non-redundant known gene set for Glycine max we applied our algorithm to the clustering of only mRNA sequences. The mRNA dataset was run through the algorithm using two different matching stringencies. The resulting cluster compositions were compared to each other and to UniGene. Clusters exhibiting differences among the three methods were analyzed by 1) nucleotide and amino acid alignment and 2) submitting authors conclusions to determine whether members of a single cluster represented the same gene or not.

Results: Of the 12 clusters that were examined closely most contained examples of sequences that did not belong in the same cluster. However, neither the two stringencies of PECT nor UniGene had a significantly greater record of accuracy in placing paralogs into separate clusters.

Conclusion: Our results reveal that, although each method produces some errors, using multiple stringencies for matching or a sequential hierarchical method of increasing stringencies can provide more reliable results and therefore allow greater confidence in the vast majority of clusters that contain only ESTs and no mRNA sequences.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1637028PMC
http://dx.doi.org/10.1186/1471-2105-6-S2-S7DOI Listing

Publication Analysis

Top Keywords

glycine max
8
mrna clusters
8
mrna sequences
8
clusters
6
mrna
5
evaluation glycine
4
max mrna
4
clusters background
4
background clustering
4
clustering ests
4

Similar Publications

Objectives: Soybeans have various positive effects on health, including anti-inflammatory and preventing kidney damage. There is concern regarding the phytoestrogen content due to the high isoflavone content in soybeans. Various forms of soybean processing have been tried; in this study, the hydrolysis method will be used to obtain the active substance Arginine-Glycine-Aspartate (RGD) tripeptide in soybean protein hydrolyzed by bromelain (SPHB).

View Article and Find Full Text PDF

We generated soybean mutants related to two ß-amyrin synthase genes using DNA-free site-directed mutagenesis system. Our results suggested that one of the genes is predominant in the soyasaponin biosynthesis. Soyasaponins, which are triterpenoid saponins contained in soybean [Glycine max (L.

View Article and Find Full Text PDF

Synergistic effects of GmLFYa and GmLFYb on Compound Leaf Development in Soybean.

Physiol Plant

January 2025

School of Life Sciences, Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei, Anhui, China.

Legume leaves exhibit diverse compound forms, with various regulatory mechanisms underlying the development. The transcription factor-encoding KNOXI genes are required to promote leaflet initiation in most compound-leafed angiosperms. In non-IRLC (inverted repeat-lacking clade) legumes, KNOXI are expressed in compound leaf primordia but not in others (IRLC).

View Article and Find Full Text PDF

Variations in the proportions of the two major soybean [Glycine max (L.) Merr.] seed globulins, glycinin (11S) and β-conglycinin (7S), significantly affect the nutritional and functional properties of soy-based products, but comprehensive methods for the identification and quantification of individual subunits of these proteins are currently lacking.

View Article and Find Full Text PDF

Crop rotation effects on the population density of soybean soilborne pathogens under no-till cropping system.

Plant Dis

January 2025

USDA-ARS North Central Agricultural Research Laboratory, Brookings, South Dakota, United States;

Soilborne diseases are persistent problems in soybean production. Long-term crop rotation can contribute to soilborne disease management. However, the response of soilborne pathogens to crop rotation is inconsistent, and rotation efficacy is highly variable.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!