The contemporary capacity of genome sequence analysis significantly lags behind the rapidly evolving sequencing technologies. Retrieving biological meaningful information from an ever-increasing amount of genome data would be significantly beneficial for functional genomic studies. For example, the duplication, organization, evolution, and function of superfamily genes are arguably important in many aspects of life. However, the incompleteness of annotations in many sequenced genomes often results in biased conclusions in comparative genomic studies of superfamilies. Here, we present a Perl software, called Closing Target Trimming (CTT), for automatically identifying most, if not all, members of a gene family in any sequenced genomes on CentOS 7 platform. To benefit a broader application on other operating systems, we also created a Docker application package, CTTdocker. Our test data on the F-box gene superfamily showed 78.2 and 79% gene finding accuracies in two well annotated plant genomes, Arabidopsis thaliana and rice, respectively. To further demonstrate the effectiveness of this program, we ran it through 18 plant genomes and five non-plant genomes to compare the expansion of the F-box and the BTB superfamilies. The program discovered that on average 12.7 and 9.3% of the total F-box and BTB members, respectively, are new loci in plant genomes, while it only found a small number of new members in vertebrate genomes. Therefore, different evolutionary and regulatory mechanisms of Cullin-RING ubiquitin ligases may be present in plants and animals. We also annotated and compared the Pkinase family members across a wide range of organisms, including 10 fungi, 10 metazoa, 10 vertebrates, and 10 additional plants, which were randomly selected from the Ensembl database. Our CTT annotation recovered on average 14% more loci, including pseudogenes, of the Pkinase superfamily in these 40 genomes, demonstrating its robust replicability and scalability in annotating superfamiy members in any genomes.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6605638 | PMC |
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0209468 | PLOS |
Genome
January 2025
USDA-ARS, Wheat, Sorghum & Forage Research Unit, Lincoln, Nebraska, United States.
(2n=2x=14, genome SS) is a wild relative of wheat and a donor of useful traits for wheat improvement. Several whole-genome studies compared genic regions of from the section and wheat and found that is most closely related to the wheat B subgenome but is not its direct progenitor. The results showed that a B subgenome ancestor diverged from more than 4 MYA and either has not yet been discovered, or is extinct.
View Article and Find Full Text PDFGenome
January 2025
Damietta University Faculty of Science, New Damietta, Damietta, Egypt;
Polyamine oxidase (PAOs) are enzymes associated with polyamine catabolism and play important roles in growth and development and stress tolerance of plants. In the present study, genome-wide discovery and analysis of the PAO family in sorghum was done utilizing model PAO of Arabidopsis. Six PAO genes were found using publicly available genomic data.
View Article and Find Full Text PDFProc Natl Acad Sci U S A
February 2025
Molecular Genetics, Institute of Biology, Faculty of Life Sciences, Humboldt Universität zu Berlin, Berlin 10115, Germany.
The chloroplast genome encodes key components of the photosynthetic light reaction machinery as well as the large subunit of the enzyme central for carbon fixation, Ribulose-1,5-bisphosphat-carboxylase/-oxygenase (RuBisCo). Its expression is predominantly regulated posttranscriptionally, with nuclear-encoded RNA-binding proteins (RBPs) playing a key role. Mutants of chloroplast gene expression factors often exhibit impaired chloroplast biogenesis, especially in cold conditions.
View Article and Find Full Text PDFMicrobiol Resour Announc
January 2025
Centre de Biotechnologies Végétales et Microbiennes, Biodiversité et Environnement, Faculty of Sciences, Mohammed V University in Rabat, Rabat, Morocco.
In this study, we present the complete genome of LLZ14, a nodule-forming bacterium isolated from root nodules with high plant growth-promoting abilities. This genome contains genes predicted to be involved in plant stress tolerance and growth promotion, including auxin production, phosphatase, and 1-aminocyclopropane-1-carboxylate deaminase.
View Article and Find Full Text PDFMicrobiol Resour Announc
January 2025
Institute of Grassland, Flowers and Ecology, Beijing Academy of Agriculture and Forestry Sciences, Beijing, China.
Here, we present the complete genome sequence of strain ZAPR22R, isolated from the petiole and tuber of calla lily (), infected with soft rot. The genome consists of a single chromosome (4,528,722 bp) with a G+C content of 41.1%.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!