The reality of larger and larger molecular databases and the need to integrate data scalably have presented a major challenge for the use of phenotypic data. Morphology is currently primarily described in discrete publications, entrenched in noncomputer readable text, and requires enormous investments of time and resources to integrate across large numbers of taxa and studies. Here we present a new methodology, using ontology-based reasoning systems working with the Phenoscape Knowledgebase (KB; kb.phenoscape.org), to automatically integrate large amounts of evolutionary character state descriptions into a synthetic character matrix of neomorphic (presence/absence) data. Using the KB, which includes more than 55 studies of sarcopterygian taxa, we generated a synthetic supermatrix of 639 variable characters scored for 1051 taxa, resulting in over 145,000 populated cells. Of these characters, over 76% were made variable through the addition of inferred presence/absence states derived by machine reasoning over the formal semantics of the source ontologies. Inferred data reduced the missing data in the variable character-subset from 98.5% to 78.2%. Machine reasoning also enables the isolation of conflicts in the data, that is, cells where both presence and absence are indicated; reports regarding conflicting data provenance can be generated automatically. Further, reasoning enables quantification and new visualizations of the data, here for example, allowing identification of character space that has been undersampled across the fin-to-limb transition. The approach and methods demonstrated here to compute synthetic presence/absence supermatrices are applicable to any taxonomic and phenotypic slice across the tree of life, providing the data are semantically annotated. Because such data can also be linked to model organism genetics through computational scoring of phenotypic similarity, they open a rich set of future research questions into phenotype-to-genome relationships.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4604830PMC
http://dx.doi.org/10.1093/sysbio/syv031DOI Listing

Publication Analysis

Top Keywords

machine reasoning
12
data
10
integrate large
8
reasoning enables
8
reasoning
5
synthesizing knowledge
4
knowledge morphology
4
morphology ontologies
4
ontologies machine
4
reasoning extract
4

Similar Publications

Generative Artificial Intelligence (AI), characterized by its ability to generate diverse forms of content including text, images, video and audio, has revolutionized many fields, including medical education. Generative AI leverages machine learning to create diverse content, enabling personalized learning, enhancing resource accessibility, and facilitating interactive case studies. This narrative review explores the integration of generative artificial intelligence (AI) into orthopedic education and training, highlighting its potential, current challenges, and future trajectory.

View Article and Find Full Text PDF

Segmenting female students' perceptions about Fintech using Explainable AI.

Front Artif Intell

December 2024

Department of Economics, University of Crete, Rethymnon, Greece.

The use of Financial Technology (Fintech) has been proposed as a promising way to bridge the gender gap, both financially and socially. However, there is evidence that Fintech is far from achieving this objective, and that women's perceptions of Fintech usages are not clear. Therefore, the main objective of the this study is to segment women's perceptions toward Fintech tools and interpret these segments using machine learning methods.

View Article and Find Full Text PDF

Prenatal metals and offspring cognitive development: Insights from a large-scale placental bioassay study.

Environ Res

December 2024

School of Public Health, Anhui Medical University, Hefei, 230032, Anhui, China; Key Laboratory of Population Health Across Life Cycle (AHMU), MOE, Hefei 230032, China; NHC Key Laboratory of study on abnormal gametes and reproductive tract, Hefei 230032, China; Anhui Provincial Key Laboratory of Environment and Population Health Across the Life Course, Hefei, 230032, China. Electronic address:

Article Synopsis
  • The developing fetus is highly sensitive to neurotoxic metals, and this study uses placenta as a key biomarker for monitoring such exposures during pregnancy.
  • Conducted as part of the Ma'anshan Birth Cohort (MABC) in China, the research analyzed the effects of both non-essential and essential metals (like arsenic, cadmium, mercury, and zinc) on cognitive development in children, involving 1,586 mother-child pairs.
  • Results showed negative associations between elevated levels of certain metals, especially arsenic and cadmium, with cognitive tasks measured by the Wechsler Preschool and Primary Scale of Intelligence (WPPSI-IV), with findings also indicating varied effects based on the sex of the children.
View Article and Find Full Text PDF

The standard model of theory of mind posits that we attribute mental states to other people to explain their behavior. However, what of cases in which we think the other person is being scripted, acting automatically with no goals or beliefs to recover? While a great deal of past work has distinguished between automatic and reflective behaviors in one's own decision making, here we argue that reasoning about automatic behavior in other people is an important and largely unexplored area in research into theory of mind. We report results from two studies (N = 4,528 total) that examine the detection of automatic behavior in others.

View Article and Find Full Text PDF

Background: Rare diseases affect millions worldwide but sometimes face limited research focus individually due to low prevalence. Many rare diseases do not have specific International Classification of Diseases, Ninth Edition (ICD-9) and Tenth Edition (ICD-10), codes and therefore cannot be reliably extracted from granular fields like "Diagnosis" and "Problem List" entries, which complicates tasks that require identification of patients with these conditions, including clinical trial recruitment and research efforts. Recent advancements in large language models (LLMs) have shown promise in automating the extraction of medical information, offering the potential to improve medical research, diagnosis, and management.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!