Automatic learning of the morphology of medical language using information compression.

AMIA Annu Symp Proc

Department of Biomedical Informatics, Columbia University, New York, NY, USA.

Published: December 2004

Conversion of free-text strings in a natural language to a standard representation (codes) is an important reoccurring problem in biomedical informatics. Determining the content of a string involves identifying its meaningful constituents (morphemes). One current method of identifying these constituents is to look them up in a preexisting table (lexicon). Manual construction of lexicons and grammars in complex domains such as biomedicine is extremely laborious. As an alternative to the lexico-grammatical approach, we introduce a segmentation algorithm that automatically learns lexical and structural preferences from corpora via information compression. The method is based on the Minimum Description Length (MDL) principle from classic information theory.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1480252PMC

Publication Analysis

Top Keywords

automatic learning
4
learning morphology
4
morphology medical
4
medical language
4
language compression
4
compression conversion
4
conversion free-text
4
free-text strings
4
strings natural
4
natural language
4

Similar Publications

Nursing activity recognition has immense importance in the development of smart healthcare management and is an extremely challenging area of research in human activity recognition. The main reasons are an extreme class-imbalance problem and intra-class variability depending on both the subject and the recipient. In this paper, we apply a unique two-step feature extraction, coupled with an intermediate feature 'Angle' and a new feature called mean min max sum to render the features robust against intra-class variation.

View Article and Find Full Text PDF

Manual segmentation of lesions, required for radiotherapy planning and follow-up, is time-consuming and error-prone. Automatic detection and segmentation can assist radiologists in these tasks. This work explores the automated detection and segmentation of brain metastases (BMs) in longitudinal MRIs.

View Article and Find Full Text PDF

Background: Automatic classification of arrhythmias based on electrocardiography (ECG) data faces several significant challenges, particularly due to the substantial volume of clinical data involved in ECG signal analysis. The volume of clinical data has increased considerably, especially with the emergence of new clinical symptoms and signs in various arrhythmia conditions. These symptoms and signs, which serve as distinguishing features, can number in the tens of thousands.

View Article and Find Full Text PDF

Towards safe and reliable deep learning for lung nodule malignancy estimation using out-of-distribution detection.

Comput Biol Med

December 2024

Diagnostic Imaging Analysis Group, Medical Imaging Department, Radboud University Medical Center, Geert Grooteplein Zuid 10, 6525 GA, Nijmegen, the Netherlands.

Artificial Intelligence (AI) models may fail or suffer from reduced performance when applied to unseen data that differs from the training data distribution, referred to as dataset shift. Automatic detection of out-of-distribution (OOD) data contributes to safe and reliable clinical implementation of AI models. In this study, we propose a recognized OOD detection method that utilizes the Mahalanobis distance (MD) and compare its performance to widely known classical methods.

View Article and Find Full Text PDF

Background: Wireless capsule endoscopy (WCE) has become an important noninvasive and portable tool for diagnosing digestive tract diseases and has been propelled by advancements in medical imaging technology. However, the complexity of the digestive tract structure, and the diversity of lesion types, results in different sites and types of lesions distinctly appearing in the images, posing a challenge for the accurate identification of digestive tract diseases.

Aim: To propose a deep learning-based lesion detection model to automatically identify and accurately label digestive tract lesions, thereby improving the diagnostic efficiency of doctors, and creating significant clinical application value.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!