Publications by authors named "Yi-Yu Hsu"

The BioRED track at BioCreative VIII calls for a community effort to identify, semantically categorize, and highlight the novelty factor of the relationships between biomedical entities in unstructured text. Relation extraction is crucial for many biomedical natural language processing (NLP) applications, from drug discovery to custom medical solutions. The BioRED track simulates a real-world application of biomedical relationship extraction, and as such, considers multiple biomedical entity types, normalized to their specific corresponding database identifiers, as well as defines relationships between them in the documents.

View Article and Find Full Text PDF

Background: To evaluate nasal carriage, antibiotic susceptibility and molecular characteristics of methicillin-resistant Staphylococcus aureus (MRSA), as well as the risk factors of MRSA colonization, in human immunodeficiency virus (HIV)-infected patients in northern Taiwan.

Methods: From September 2014 to November 2015, HIV-infected patients seeking outpatient care at four hospitals were eligible for this study. A nasal specimen was obtained from each subject for the detection of S.

View Article and Find Full Text PDF

Tracking scientific research publications on the evaluation, utility and implementation of genomic applications is critical for the translation of basic research to impact clinical and population health. In this work, we utilize state-of-the-art machine learning approaches to identify translational research in genomics beyond bench to bedside from the biomedical literature. We apply the convolutional neural networks (CNNs) and support vector machines (SVMs) to the bench/bedside article classification on the weekly manual annotation data of the Public Health Genomics Knowledge Base database.

View Article and Find Full Text PDF

The text-mining services for kinome curation track, part of BioCreative VI, proposed a competition to assess the effectiveness of text mining to perform literature triage. The track has exploited an unpublished curated data set from the neXtProt database. This data set contained comprehensive annotations for 300 human protein kinases.

View Article and Find Full Text PDF
Article Synopsis
  • - In response to the overwhelming amount of published articles in bio-literature, a new document triage system was developed to enhance the efficiency of retrieving relevant articles for biocurators, specifically focusing on human kinase proteins.
  • - The system utilizes text mining and various machine learning methods to extract detailed information for improving the curation process, leveraging tools for concept extraction and co-occurrence analysis.
  • - Evaluation results from BioCreative VI show that the system significantly outperforms the existing neXtA5 system, with improved metrics in retrieving articles related to disease and biological processes, indicating its potential to effectively assist biocurators.
View Article and Find Full Text PDF

Diseases play central roles in many areas of biomedical research and healthcare. Consequently, aggregating the disease knowledge and treatment research reports becomes an extremely critical issue, especially in rapid-growth knowledge bases (e.g.

View Article and Find Full Text PDF

Named-entity recognition (NER) plays an important role in the development of biomedical databases. However, the existing NER tools produce multifarious named-entities which may result in both curatable and non-curatable markers. To facilitate biocuration with a straightforward approach, classifying curatable named-entities is helpful with regard to accelerating the biocuration workflow.

View Article and Find Full Text PDF

Background: Determining the semantic relatedness of two biomedical terms is an important task for many text-mining applications in the biomedical field. Previous studies, such as those using ontology-based and corpus-based approaches, measured semantic relatedness by using information from the structure of biomedical literature, but these methods are limited by the small size of training resources. To increase the size of training datasets, the outputs of search engines have been used extensively to analyze the lexical patterns of biomedical terms.

View Article and Find Full Text PDF

In recent years, there was a rapid increase in the number of medical articles. The number of articles in PubMed has increased exponentially. Thus, the workload for biocurators has also increased exponentially.

View Article and Find Full Text PDF

A PHP Error was encountered

Severity: Warning

Message: fopen(/var/lib/php/sessions/ci_sessionkchgdod8ji6dlrbvgfb79flvp56mhd2g): Failed to open stream: No space left on device

Filename: drivers/Session_files_driver.php

Line Number: 177

Backtrace:

File: /var/www/html/index.php
Line: 316
Function: require_once

A PHP Error was encountered

Severity: Warning

Message: session_start(): Failed to read session data: user (path: /var/lib/php/sessions)

Filename: Session/Session.php

Line Number: 137

Backtrace:

File: /var/www/html/index.php
Line: 316
Function: require_once