Traditional methods for deriving property-based representations of concepts from text have focused on either extracting only a subset of possible relation types, such as hyponymy/hypernymy (e.g., car is-a vehicle) or meronymy/metonymy (e.g., car has wheels), or unspecified relations (e.g., car--petrol). We propose a system for the challenging task of automatic, large-scale acquisition of unconstrained, human-like property norms from large text corpora, and discuss the theoretical implications of such a system. We employ syntactic, semantic, and encyclopedic information to guide our extraction, yielding concept-relation-feature triples (e.g., car be fast, car require petrol, car cause pollution), which approximate property-based conceptual representations. Our novel method extracts candidate triples from parsed corpora (Wikipedia and the British National Corpus) using syntactically and grammatically motivated rules, then reweights triples with a linear combination of their frequency and four statistical metrics. We assess our system output in three ways: lexical comparison with norms derived from human-generated property norm data, direct evaluation by four human judges, and a semantic distance comparison with both WordNet similarity data and human-judged concept similarity ratings. Our system offers a viable and performant method of plausible triple extraction: Our lexical comparison shows comparable performance to the current state-of-the-art, while subsequent evaluations exhibit the human-like character of our generated properties.

Download full-text PDF

Source
http://dx.doi.org/10.1111/cogs.12091DOI Listing

Publication Analysis

Top Keywords

large text
8
text corpora
8
lexical comparison
8
car
5
automatic extraction
4
extraction property
4
property norm-like
4
norm-like data
4
data large
4
corpora traditional
4

Similar Publications

Background And Aim: Visual data from images is essential for many medical diagnoses. This study evaluates the performance of multimodal Large Language Models (LLMs) in integrating textual and visual information for diagnostic purposes.

Methods: We tested GPT-4o and Claude Sonnet 3.

View Article and Find Full Text PDF

Background Understanding the attitudes and perceptions of the general population is necessary for organizing health promotion initiatives. During outbreaks, social media has a significant impact on creating social perceptions. This study aims to identify and examine the emotions expressed and topics of discussion among Indian citizens related to COVID-19 third wave, from the messages posted on Twitter using text mining techniques.

View Article and Find Full Text PDF

Mesenchymal stromal cells-derived extracellular vesicles in cartilage regeneration: potential and limitations.

Stem Cell Res Ther

January 2025

Grupo de Investigación en Terapia Celular y Medicina Regenerativa, Instituto de Investigación Biomédica de A Coruña (INIBIC), Fundación Pública Gallega de Investigación Biomédica INIBIC, Complexo Hospitalario Universitario de A Coruña (CHUAC), Servizo Galego de Saúde (SERGAS), A Coruña, 15006, Spain.

Background: Articular cartilage injuries can lead to pain, stiffness, and reduced mobility, and may eventually progress to osteoarthritis (OA). Despite substantial research efforts, effective therapies capable of regenerating cartilage are still lacking. Mesenchymal stromal cells (MSCs) are known for their differentiation and immunomodulatory capabilities, yet challenges such as limited survival post-injection and inconsistent therapeutic outcomes hinder their clinical application.

View Article and Find Full Text PDF

Enhancing doctor-patient communication using large language models for pathology report interpretation.

BMC Med Inform Decis Mak

January 2025

Department of Thoracic Surgery, Guizhou Provincial People's Hospital, No. 83, Zhongshan East Road, Guiyang, Guizhou, 550000, China.

Background: Large language models (LLMs) are increasingly utilized in healthcare settings. Postoperative pathology reports, which are essential for diagnosing and determining treatment strategies for surgical patients, frequently include complex data that can be challenging for patients to comprehend. This complexity can adversely affect the quality of communication between doctors and patients about their diagnosis and treatment options, potentially impacting patient outcomes such as understanding of their condition, treatment adherence, and overall satisfaction.

View Article and Find Full Text PDF

Pterosaurs were the first vertebrates to evolve active flight. The lack of many well-preserved pterosaur fossils limits our understanding of the functional anatomy and behavior of these flight pioneers, particularly from their early history (Triassic to Middle Jurassic). Here we describe in detail the osteology of an exceptionally preserved Middle Jurassic pterosaur, the holotype of Dearc sgiathanach from the Isle of Skye, Scotland.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!