Chinese technical terminology extraction based on DC-value and information entropy.

Sci Rep

School of Management and Engineering, Capital University of Economics and Business, Beijing, 100070, China.

Published: November 2022

China's technology is developing rapidly, and the number of patent applications has surged. Therefore, there is an urgent need for technical managers and researchers that how to apply computer technology to conduct in-depth mining and analysis of lots of Chinese patent documents to efficiently use patent information, perform technological innovation and avoid R&D risks. Automatic term extraction is the basis of patent mining and analysis, but many existing approaches focus on extracting domain terms in English, which are difficult to extend to Chinese due to the distinctions between Chinese and English languages. At the same time, some common Chinese technical terminology extraction methods focus on the high-frequency characteristics, while technical domain correlation characteristic and the unithood feature of terminology are given less attention. Aiming at these problems, this paper proposes a Chinese technical terminology method based on DC-value and information entropy to achieve automatic extraction of technical terminology in Chinese patents. The empirical results show that the presented algorithm can effectively extract the technical terminology in Chinese patent literatures and has a better performance than the C-value method, the log-likelihood ratio method and the mutual information method, which has theoretical significance and practical application value.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9681760PMC
http://dx.doi.org/10.1038/s41598-022-23209-6DOI Listing

Publication Analysis

Top Keywords

technical terminology
20
chinese technical
12
chinese
8
terminology extraction
8
based dc-value
8
dc-value entropy
8
mining analysis
8
chinese patent
8
terminology chinese
8
terminology
6

Similar Publications

Purpose: The parents of children who are deaf or hard-of-hearing may require a spoken language interpreter to access early-intervention services. This research sought to describe speech-language pathologists' perspectives regarding collaboration with interpreters in this space.

Method: Twenty-seven speech-language pathologists working in Australia completed a cross-sectional mixed-method online survey.

View Article and Find Full Text PDF

Distinguishing between endo- and exo-type enzymes within the glycoside hydrolase (GH) classification presents significant challenges. Traditional methods, often based on endpoint activity measurements, do not capture the full range of products generated, leading to inconsistencies in classification. Not all exo-acting fructanases and glucanases produce monosaccharides (like fructose or glucose), while endo-acting enzymes do not solely produce higher-degree polymerization oligosaccharides.

View Article and Find Full Text PDF

Background/objectives: The increasing medical and nursing care complexity in hospitalized children represents a significant challenge for healthcare systems. However, the link between these two dimensions remains partially explored. This study aims to decipher the relationship between Diagnosis-Related Group (DRG) weight and nursing care complexity in hospitalized children and to identify the determinants of medical complexity.

View Article and Find Full Text PDF

: Earlier detection of severe immune-related hematological adverse events (irHAEs) in cancer patients treated with a PD-1 or PD-L1 inhibitor is critical to improving treatment outcomes. The study aimed to develop a simple machine learning (ML) model for predicting irHAEs associated with PD-1/PD-L1 inhibitors. : We utilized the Observational Medical Outcomes Partnership-Common Data Model based on electronic medical records from a tertiary (KHMC) and a secondary (KHNMC) hospital in South Korea.

View Article and Find Full Text PDF

Drug development is a lengthy process with considerable uncertainty at each milestone. Several trials are needed to progress to confirmatory evaluation and establish a positive benefit-risk balance. One of the critical milestones is the decision to progress to phase III based on phase II trial results.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!