Preference of simple sequence repeats in coding and non-coding regions of Arabidopsis thaliana.

Bioinformatics

Plant Biotechnology Research Center, Fudan-SJTU-Nottingham Plant Biotechnology R&D Center, School of Agriculture and Biology, Shanghai Jiao Tong University, Shanghai 200030, People's Republic of China.

Published: May 2004

Motivation: Simple sequence repeats or microsatellites have been found abundantly in many genomes. However, the significance of distribution preference has not been completely understood. Completion of the Arabidopsis genome sequencing allows us to better understand and characterize microsatellites.

Results: Microsatellite distribution was more abundant in 5'-flanking regions of genes compared with that expected in the whole genome, with an over-representation of AG and AAG repeats; there were clear differences from distributions in 3'-flanks and coding fractions, where triplet frequencies evidently corresponded to codon usage. We identified 1140 full-length genes that contained at least one locus of AG or AAG repeats in their upstream sequences, and whose functional characteristics were significantly associated with the repeats. This observation indicates that selective pressure markedly differed in the three transcribed regions, with positive selection of AG and AAG repeats in 5'-flanks close to those genes whose products are preferentially involved in transcription.

Download full-text PDF

Source
http://dx.doi.org/10.1093/bioinformatics/bth043DOI Listing

Publication Analysis

Top Keywords

aag repeats
12
simple sequence
8
sequence repeats
8
repeats
6
preference simple
4
repeats coding
4
coding non-coding
4
non-coding regions
4
regions arabidopsis
4
arabidopsis thaliana
4

Similar Publications

Development of Roselle ( L.) Transcriptome-Based Simple Sequence Repeat Markers and Their Application in Roselle.

Plants (Basel)

December 2024

Key Laboratory of Ministry of Education for Genetics, Breeding and Multiple Utilization of Crops, Fujian Agriculture and Forestry University, Fuzhou 350002, China.

Roselle ( L.) simple sequence repeat (SSR) markers were developed using RNA sequencing technology, providing a foundation for genetic analysis and the identification of roselle varieties. In this study, 10 785 unigenes containing 12 994 SSR loci with an average of one SSR locus per 6.

View Article and Find Full Text PDF

Genome-wide identification of SSR markers from coding regions for endangered Argania spinosa L. skeels and construction of SSR database: AsSSRdb.

Database (Oxford)

November 2024

Microbiology and Molecular Biology Team, Center of Plant and Microbial Biotechnologies, Biodiversity and Environment, Faculty of Sciences, Mohammed V University, 4 Avenue Ibn Batouta, B.P. 1014, Rabat 10000, Morocco.

Article Synopsis
  • Microsatellites, or simple sequence repeats (SSRs), are important genetic markers in plants, but a thorough identification in the Argania spinosa L. genome has not been done until now.
  • This study identified 5351 SSRs from 66,280 coding sequences (CDSs) in A. spinosa, with tri-nucleotide motifs being the most common.
  • An online database (AsSSRdb) was created to share these SSRs, providing resources for DNA fingerprinting and genetic studies in argan and similar species.
View Article and Find Full Text PDF

Interleukin 1 receptor antagonist (IL1RN) is a competitive inhibitor of interleukin 1 (IL-1). Natural killer cells (NK cells) contribute to the elimination of viruses by their antiviral effector function, which depends on a balance between inhibitory and activating receptor genes such as NKG2D and NKG2A. Using polymerase chain reaction restriction fragment length polymorphism (PCR-RFLP) assays, the association of intronic single-nucleotide polymorphisms (SNPs) in these genes with viral infection were assessed in 111 patients with hepatitis E virus (HEV) infection and 222 HEV-naive healthy controls.

View Article and Find Full Text PDF

First genome assembly and characterization of .

Front Plant Sci

October 2024

School of Pharmaceutical Sciences, Yunnan Key Laboratory of Pharmacology for Natural Products, and Yunnan College of Modern Biomedical Industry, Kunming Medical University, Kunming, Yunnan, China.

Article Synopsis
  • - Kalm ex L. is an evergreen shrub valuable for its methyl salicylate content and ornamental and medicinal properties, but lacks comprehensive genomic data, prompting this study.
  • - Researchers conducted high-throughput sequencing to assemble the genome, obtaining 417 Mb of data with high quality (47.94 Gb) and identifying over 26,000 protein-coding genes and numerous SSRs (simple sequence repeats).
  • - The study also identified thousands of transcription factors, transcription regulators, and protein kinases, while performing phylogenetic analyses to gain insights into the genetic relationships among species.
View Article and Find Full Text PDF
Article Synopsis
  • Kunze Wiltshire 1933 is a fungus that causes leaf blotch disease in plants, and the study focused on assembling its mitochondrial genome from isolate AT-1224.
  • The mitochondrial genome has a size of 57,475 bp, featuring 12 coding genes, 15 hypothetical proteins, 34 tRNA genes, and 2 rRNA genes, along with various repetitive sequences.
  • Phylogenetic analysis indicated close relationships with two species and placed it in a sister clade with six other species, providing important insights for future identification and evolutionary studies related to this fungus in apple orchards.
View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!