A corpus of plant-disease relations in the biomedical domain.

PLoS One

School of Electrical Engineering and Computer Science, Gwangju Institute of Science and Technology, Buk-gu, Gwangju, South Korea.

Published: March 2020

Background: Many new medicines have been derived from natural sources such as plants, which have a long history of being used for disease treatment. Thus, their benefits and side effects have been studied, and plant-related information including plant and disease relations have been accumulated in Medline articles. Because numerous articles are available in Medline and are written in natural language, text-mining is important. However, a corpus of plant and disease relations is not available yet. Thus, we aimed to construct such a corpus.

Methods And Results: In this study, we designed and annotated a plant-disease relations corpus, and proposed a computational model to predict plant-disease relations using the corpus. We categorized plant and disease relations into four types: treatments of diseases, causes of diseases, associations, and negative relations. To construct a corpus of plant-disease relations, we first created its annotation guidelines and randomly selected 200 Medline abstracts. From these abstracts, we identified 1,405 and 1,755 plant and disease mentions, annotated to 105 and 237 unique plant and disease identifiers, respectively. When we selected sentences containing at least one plant and one disease mention, we extracted 878 plant and 1,077 disease entities, which finally generated a corpus of plant-disease relations including 1,309 relations from 199 abstracts. To verify the effectiveness of the corpus, we proposed a convolutional neural network model with the shortest dependency path (SDP-CNN) and applied it to the constructed corpus. The micro F-score with ten-fold cross-validation was found to be 0.764. We also applied the proposed SDP-CNN model to all Medline abstracts. When we measured its performance for 483 randomly selected plant-disease co-occurring sentences, the model showed a precision of 0.707.

Conclusion: The plant-disease relations corpus is unique and represents an important resource for biomedical text-mining. The corpus of plant and disease relations is available at http://gcancer.org/pdr/.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6713337PMC
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0221582PLOS

Publication Analysis

Top Keywords

plant disease
28
plant-disease relations
24
disease relations
16
corpus plant-disease
12
relations
12
relations corpus
12
corpus
10
disease
9
plant
8
text-mining corpus
8

Similar Publications

The use of biological control agents is one of the best strategies available to combat the plant diseases in an ecofriendly manner. Biocontrol bacteria capable of providing beneficial effect in crop plant growth and health, have been developed for several decades. It highlights the need for a deeper understanding of the colonization mechanisms employed by biocontrol bacteria to enhance their efficacy in plant pathogen control.

View Article and Find Full Text PDF

Alzheimer's disease (AD) is a common central neurodegenerative disease disorder characterized primarily by cognitive impairment and non-cognitive neuropsychiatric symptoms that significantly impact patients' daily lives and behavioral functioning. The pathogenesis of AD remains unclear and current Western medicines treatment are purely symptomatic, with a singular pathway, limited efficacy, and substantial toxicity and side effects. In recent years, as research into AD has deepened, there has been a gradual increase in the exploration and application of medicinal plants for the treatment of AD.

View Article and Find Full Text PDF

Wheat viruses are major yield-reducing factors, with mixed infections causing substantial economic losses. Determining field virus populations is crucial for effective management and developing virus-resistant cultivars. This study utilized the high-throughput Oxford Nanopore sequencing technique (ONT) to characterize wheat viral populations in major wheat-growing counties of Kansas from 2019 to 2021.

View Article and Find Full Text PDF

, a medicinal herbaceous plant documented in the Chinese Pharmacopoeia, is a promising candidate for research into plant-derived pharmaceuticals. However, the study of newly emerging viruses that threaten the cultivation of remains limited. In this study, plants exhibiting symptoms such as leaf yellowing, mottled leaves, and vein chlorosis were collected and subjected to RNA sequencing to identify potential viral pathogens.

View Article and Find Full Text PDF

A Survey of Wild Indigenous Orchid Populations in Western Australia Reveals Spillover of Exotic Viruses.

Viruses

January 2025

School of Medical, Molecular and Forensic Sciences, College of Environmental and Life Sciences, Murdoch University, 90 South Street, Perth 6150, Australia.

is a terrestrial orchid endemic to southwestern Australia. The virus status of has not been studied. Eighty-three samples from 16 populations were collected, and sequencing was used to identify RNA viruses from them.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!