Leveraging LLaMA2 for improved document classification in English.

PeerJ Comput Sci

School of Foreign Languages, Taizhou University, Taizhou City, Jiangsu Province, China.

Published: February 2025

Document classification is an important component of natural language processing, with applications that include sentiment analysis, content recommendation, and information retrieval. This article investigates the potential of Large Language Model Meta AI (LLaMA2), a cutting-edge language model, to enhance document classification in English. Our experiments show that LLaMA2 outperforms traditional classification methods, achieving higher precision and recall values on the WOS-5736 dataset. Additionally, we analyze the interpretability of LLaMA2's classification process to reveal the most pertinent features for categorization and the model's decision-making. These results emphasize the potential of advanced language models to enhance classification outcomes and provide a more profound comprehension of document structures, thereby contributing to the advancement of natural language processing methodologies.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11888901PMC
http://dx.doi.org/10.7717/peerj-cs.2740DOI Listing

Publication Analysis

Top Keywords

document classification
12
classification english
8
natural language
8
language processing
8
language model
8
classification
6
language
5
leveraging llama2
4
llama2 improved
4
document
4

Similar Publications

Neuroendocrine tumors (NET) of the lung constitute a rare entity of primary lung malignancies that often exhibit an indolent clinical course. Epigenetics-related differences have been described previously for lung NET, but the clinical significance remains unclear. In this study, we performed genome-wide methylation analysis using the Infinium MethylationEPIC BeadChip technology on FFPE tissues from lung NET treated at two academic centers.

View Article and Find Full Text PDF

Classifying Continuous Glucose Monitoring Documents From Electronic Health Records.

J Diabetes Sci Technol

March 2025

Department of Population Health, Grossman School of Medicine, New York University, New York, NY, USA.

Background: Clinical use of continuous glucose monitoring (CGM) is increasing storage of CGM-related documents in electronic health records (EHR); however, the standardization of CGM storage is lacking. We aimed to evaluate the sensitivity and specificity of CGM Ambulatory Glucose Profile (AGP) classification criteria.

Methods: We randomly chose 2244 (18.

View Article and Find Full Text PDF

Background: Research on patients with persistent symptoms despite prior treatment for Lyme disease can be challenging to interpret given the diversity of criteria selected to characterize Lyme disease and to define the syndrome of those with persistent symptoms. Because most research studies only include patients with well-documented prior Lyme disease, the generalizability of the study results is limited, excluding the larger group of patients often seen in community practice who do not meet these stringent enrollment criteria. Researchers at the Lyme and other Tick-borne Diseases Clinical Trials Network (LTD-CTN) recognized early on that a research classification system was needed to facilitate the design of studies that are more inclusive.

View Article and Find Full Text PDF

The evaluation of the Resistance Spot Welding (RSW) that guarantees satisfactory performance of mechanical characteristics without altering physical properties can be reached by modeling the input parameters such as current, welding time, and applied force from which each unit has been built and correlating with digital images of the surface and infrared images that allows to identify variations on the parameters that modify the quality of the welding spot [1]. With this, mechanical and surface characteristics can be detected without the need for a mechanical test that modifies the structure of the unit. The database serves as a comprehensive record of the welding spot process, including the monitor of crucial input parameters such as current and force.

View Article and Find Full Text PDF

Diversity, management, and uses of edible plants in a Ñäñho community of Southern Querétaro, Mexico.

J Ethnobiol Ethnomed

March 2025

Instituto de Ecología, A. C., Centro Regional del Bajío, Av. Lázaro Cárdenas 253, CP 61600, Pátzcuaro, Michoacán, Mexico.

Background: Mexico is one of the countries with the highest cultural, biological, and agrobiological diversity. However, an accelerated process of ancestral knowledge loss, related to the management of agrobiodiversity, native seeds, and other edible plant species management is affecting food sovereignty. This process of knowledge loss was documented at the Ñäñho region, of southern Querétaro, where our study took place.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!