Named entity recognition of rice genes and phenotypes based on BiGRU neural networks.

Comput Biol Chem

School of Information and Computer, Anhui Provincial Engineering Laboratory for Beidou Precision Agriculture Information, Anhui Agricultural University, Hefei, China. Electronic address:

Published: February 2024

Named Entity Recognition (NER) is a fundamental but crucial task in natural language processing (NLP) and big data analysis, with wide application range. NER for rice genes and phenotypes is a technique to identify genes and phenotypes from a large amount of text. NER for rice genes and phenotypes can facilitate the acquisition of information in the field of crops and provide references for our research on higher quality crops. At the same time, named entity recognition still faces many challenges. In this paper, we propose an improved bidirectional gated recurrent unit neural network (BI-GRU) method, which is used to automatically identify the required entities (i.e. gene names, rice phenotypes) from relevant rice literature and patents. The neural network model is combined with the Softmax function to directly output the probabilities of labels, forming the BI-GRU-SF model. With the ability of deep learning methods, the semantic information in the context can be learned without the need for feature engineering. Finally, we conducted experiments, and the results showed that our proposed model provided better performance compared to other models. All datasets and resource codes of BI-GRU-SF are available at https://github.com/qqeeqq/NER for academic use.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.compbiolchem.2023.107977DOI Listing

Publication Analysis

Top Keywords

genes phenotypes
16
named entity
12
entity recognition
12
rice genes
12
ner rice
8
neural network
8
rice
5
phenotypes
5
recognition rice
4
genes
4

Similar Publications

Deletion of metal transporter Zip14 reduces major histocompatibility complex II expression in murine small intestinal epithelial cells.

Proc Natl Acad Sci U S A

January 2025

Center for Nutritional Sciences, Food Science and Human Nutrition Department, College of Agricultural and Life Sciences, University of Florida, Gainesville, FL 32611.

Documented worldwide, impaired immunity is a cardinal signature resulting from loss of dietary zinc, an essential micronutrient. A steady supply of zinc to meet cellular requirements is regulated by an array of zinc transporters. Deletion of the transporter Zip14 (Slc39a14) in mice produced intestinal inflammation.

View Article and Find Full Text PDF

Introduction: Varenicline is an α4β2 nicotinic acetylcholine receptor partial agonist with the highest therapeutic efficacy of any pharmacological smoking cessation aid and a 12-month cessation rate of 26%. Genetic variation may be associated with varenicline response, but to date no genome-wide association studies of varenicline response have been published.

Methods: In this study, we investigated the genetic contribution to varenicline effectiveness using two electronic health record-derived phenotypes.

View Article and Find Full Text PDF

Closed genomes of commercial inoculant rhizobia provide a blueprint for management of legume inoculation.

Appl Environ Microbiol

January 2025

Legume Rhizobium Sciences, Food Futures Institute, Murdoch University, Murdoch, Western Australia, Australia.

Unlabelled: Rhizobia are soil bacteria capable of establishing symbiosis within legume root nodules, where they reduce atmospheric N into ammonia and supply it to the plant for growth. Australian soils often lack rhizobia compatible with introduced agricultural legumes, so inoculation with exotic strains has become a common practice for over 50 years. While extensive research has assessed the N-fixing capabilities of these inoculants, their genomics, taxonomy, and core and accessory gene phylogeny are poorly characterized.

View Article and Find Full Text PDF

Vascular smooth muscle cell (SMC) relaxation by guanylyl cyclases (GCs) and cGMP is mediated by NO and its receptor soluble GC (sGC) or natriuretic peptides (NPs) ANP/BNP and CNP with the receptors GC-A and GC-B, respectively. It is commonly accepted that cultured SMCs differ from those in intact vessels. Nevertheless, cell culture often remains the first step for signaling investigations and drug testing.

View Article and Find Full Text PDF

Copy number variations (CNVs) have become widely acknowledged as a significant source of genomic variability and phenotypic variance. To understand the genetic variants in horses, CNVs from six Indian horse breeds, Manipuri, Zanskari, Bhutia, Spiti, Kathiawari and Marwari were discovered using Axiom Equine Genotyping Array. These breeds differed in agro-climatic adaptation with distinct phenotypic characters.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!