Normalization of gene/protein names in biological literatures using Vector-Space Model.

Annu Int Conf IEEE Eng Med Biol Soc

Lifeinformatics Team, Electronics and Telecommunication Research Institute, Gajeong-Dong, Yuseon-Gu, Daejeon, 305-700, Korea.

Published: March 2008

As the number of biological literatures grows exponentially, needs for text mining system are increased. In text mining area, normalization is mapping gene/protein names to a database. It is necessary to combine extracted information from various literatures and to create a database or an ontology using literatures. Previous normalization researches used direct comparison methods between a database and literatures, but it is weak to extremely variational gene/protein names in literatures. Therefore, in this paper, we propose a normalization method using Vector-Space Model. For each gene/protein name, we rank identifiers using Vector-Space Model, and find the most similar identifier with the name. Experimental result shows the proposed method has 70.7% f-measure.

Download full-text PDF

Source
http://dx.doi.org/10.1109/IEMBS.2007.4352306DOI Listing

Publication Analysis

Top Keywords

gene/protein names
12
vector-space model
12
biological literatures
8
text mining
8
literatures
6
normalization
4
normalization gene/protein
4
names biological
4
literatures vector-space
4
model number
4

Similar Publications

Inovirus-Encoded Peptides Induce Specific Toxicity in .

Viruses

January 2025

Key Laboratory of Tropical Marine Bio-resources and Ecology, South China Sea Institute of Oceanology, Chinese Academy of Sciences, Guangzhou 511458, China.

is a common opportunistic pathogen associated with nosocomial infections. The primary treatment for infections typically involves antibiotics, which can lead to the emergence of multidrug-resistant strains. Therefore, there is a pressing need for safe and effective alternative methods.

View Article and Find Full Text PDF

Non-invasive prenatal testing (NIPT) has been widely adopted for the screening of chromosomal abnormalities; however, its adoption for monogenic disorders, such as β-thalassaemia, has proven challenging. Haemoglobinopathies are the most common monogenic disorders globally, with β-thalassaemia being particularly prevalent in Cyprus. This study introduces a non-invasive prenatal haplotyping (NIPH) assay for β-thalassaemia, utilizing cell-free DNA (cfDNA) from maternal plasma.

View Article and Find Full Text PDF

Background: The mammalian NAD-dependent deacetylase sirtuin-1 family (named also silent information regulator or SIRT family, where NAD stands for "nicotinamide adenine dinucleotide" (NAD)) appears to have a dual role in several human cancers by modulating cell proliferation and death. This study examines how SIRT1 protein levels correlate with clinicopathological characteristics and survival outcomes in patients with breast cancer.

Methods: A total of 407 BC formalin-fixed paraffin-embedded (FFPE) samples were collected from King Abdulaziz University Hospital, Saudi Arabia.

View Article and Find Full Text PDF

Glutathione S-transferases (GSTs) are promising pharmacological targets for developing antiparasitic agents against helminths, as they play a key role in detoxifying cytotoxic xenobiotics and managing oxidative stress. Inhibiting GST activity can compromise parasite viability. This study reports the successful identification of two selective inhibitors for the mu-class glutathione S-transferase of 25 kDa (Ts25GST) from , named and , using a computationally guided approach.

View Article and Find Full Text PDF

Background: Proteolysis targeting chimeras (PROTACs) are heterobifunctional small molecules that utilize the ubiquitin-proteasome system to selectively degrade target proteins. This innovative technology has shown remarkable efficacy and specificity in degrading oncogenic proteins and has progressed through various stages of preclinical and clinical development for hematologic malignancies, including adult acute myeloid leukemia (AML). However, the application of PROTACs in pediatric AML remains largely unexplored.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!