Computing on Phenotypic Descriptions for Candidate Gene Discovery and Crop Improvement.

Plant Phenomics

Interdepartmental Bioinformatics and Computational Biology, Iowa State University, Ames, IA 50011, USA.

Published: May 2020

Many newly observed phenotypes are first described, then experimentally manipulated. These language-based descriptions appear in both the literature and in community datastores. To standardize phenotypic descriptions and enable simple data aggregation and analysis, controlled vocabularies and specific data architectures have been developed. Such simplified descriptions have several advantages over natural language: they can be rigorously defined for a particular context or problem, they can be assigned and interpreted programmatically, and they can be organized in a way that allows for semantic reasoning (inference of implicit facts). Because researchers generally report phenotypes in the literature using natural language, curators have been translating phenotypic descriptions into controlled vocabularies for decades to make the information computable. Unfortunately, this methodology is highly dependent on human curation, which does not scale to the scope of all publications available across all of plant biology. Simultaneously, researchers in other domains have been working to enable computation on natural language. This has resulted in new, automated methods for computing on language that are now available, with early analyses showing great promise. Natural language processing (NLP) coupled with machine learning (ML) allows for the use of unstructured language for direct analysis of phenotypic descriptions. Indeed, we have found that these automated methods can be used to create data structures that perform as well or better than those generated by human curators on tasks such as predicting gene function and biochemical pathway membership. Here, we describe current and ongoing efforts to provide tools for the plant phenomics community to explore novel predictions that can be generated using these techniques. We also describe how these methods could be used along with mobile speech-to-text tools to collect and analyze in-field spoken phenotypic descriptions for association genetics and breeding applications.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7706311PMC
http://dx.doi.org/10.34133/2020/1963251DOI Listing

Publication Analysis

Top Keywords

phenotypic descriptions
20
natural language
16
controlled vocabularies
8
automated methods
8
descriptions
7
language
6
computing phenotypic
4
descriptions candidate
4
candidate gene
4
gene discovery
4

Similar Publications

Objectives: We hypothesized that semiquantitative visual scoring of lung MRI is suitable for GOLD-grade specific characterization of parenchymal and airway disease in COPD and that MRI scores correlate with quantitative CT (QCT) and pulmonary function test (PFT) parameters.

Methods: Five hundred ninety-eight subjects from the COSYCONET study (median age = 67 (60-72)) at risk for COPD or with GOLD1-4 underwent PFT, same-day paired inspiratory/expiratory CT, and structural and contrast-enhanced MRI. QCT assessed total lung volume (TLV), emphysema, and air trapping by parametric response mapping (PRM, PRM) and airway disease by wall percentage (WP).

View Article and Find Full Text PDF

Clinical and preclinical evidence of psilocybin as antidepressant. A narrative review.

Prog Neuropsychopharmacol Biol Psychiatry

January 2025

Department of Pharmacology, University of the Basque Country UPV/EHU, Leioa, Bizkaia, Spain; Centro de Investigación Biomédica en Red de Salud Mental, Instituto de Salud Carlos III, Spain; Biobizkaia Health Research Institute, Barakaldo, Bizkaia, Spain. Electronic address:

In the rapidly growing field of psychedelic research, psilocybin (and active metabolite psilocin) has been proposed as a promising candidate in the search for novel treatments for neuropsychiatric disorders. Clinical trials have revealed that psilocybin has a large, rapid, and persistent effect in the improvement of symptoms of depression and anxiety. The safety profile is considered favourable, with low toxicity and good tolerance.

View Article and Find Full Text PDF

Background: Frailty is a recognisable clinical measure of impaired physiological reserve and vulnerability to adverse outcomes that is validated among patients with kidney disease. Practice patterns reveal inconsistent use of objective frailty measures by nephrologists, with clinicians prioritising subjective clinical impressions, possibly risking misclassification and discrimination.

Aims: The aim of this study was to examine correlations between subjective and objective measures of frailty in a cohort of patients attending routine nephrologist review.

View Article and Find Full Text PDF

Familial hypercholesterolemia (FH) is a genetic disease, usually with onset during childhood, characterized by elevated blood LDL cholesterol levels and potentially associated with severe cardiovascular complications. Concerning mutated genes in FH, such as , a small subset of FH patients presents a homozygous genotype, resulting in homozygous FH (HoFH) disease with a generally aggressive phenotype. Besides statins, ezetimibe and PCSK9 inhibitors, lomitapide (an anti-ApoB therapy) was also approved in 2012-2013 as an adjunctive treatment for HoFH.

View Article and Find Full Text PDF

The Candida Genome Database (CGD; www.candidagenome.org) is unique in being both a model organism database and a fungal pathogen database.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!