The curation of neuroscience entities is crucial to ongoing efforts in neuroinformatics and computational neuroscience, such as those being deployed in the context of continuing large-scale brain modelling projects. However, manually sifting through thousands of articles for new information about modelled entities is a painstaking and low-reward task. Text mining can be used to help a curator extract relevant information from this literature in a systematic way. We propose the application of text mining methods for the neuroscience literature. Specifically, two computational neuroscientists annotated a corpus of entities pertinent to neuroscience using active learning techniques to enable swift, targeted annotation. We then trained machine learning models to recognise the entities that have been identified. The entities covered are Neuron Types, Brain Regions, Experimental Values, Units, Ion Currents, Channels, and Conductances and Model organisms. We tested a traditional rule-based approach, a conditional random field and a model using deep learning named entity recognition, finding that the deep learning model was superior. Our final results show that we can detect a range of named entities of interest to the neuroscientist with a macro average precision, recall and F1 score of 0.866, 0.817 and 0.837 respectively. The contributions of this work are as follows: 1) We provide a set of Named Entity Recognition (NER) tools that are capable of detecting neuroscience entities with performance above or similar to prior work. 2) We propose a methodology for training NER tools for neuroscience that requires very little training data to get strong performance. This can be adapted for any sub-domain within neuroscience. 3) We provide a small corpus with annotations for multiple entity types, as well as annotation guidelines to help others reproduce our experiments.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6594987PMC
http://dx.doi.org/10.1007/s12021-018-9404-yDOI Listing

Publication Analysis

Top Keywords

text mining
12
deep learning
12
neuroscience
8
computational neuroscience
8
neuroscience entities
8
named entity
8
entity recognition
8
ner tools
8
entities
7
learning
5

Similar Publications

Identifying technologies in circular economy paradigm through text mining on scientific literature.

PLoS One

December 2024

Department of Energy, Systems, Territory, and Construction Engineering, Pisa, Italy.

Technological innovation serves as the catalyst for the shift towards circular practices. Technologies not only address technical challenges, facilitating the transition to a more circular economy, but they also enhance business efficiency and profitability. Furthermore, they promote inclusivity and create job opportunities, ultimately yielding positive societal impacts.

View Article and Find Full Text PDF

Methods: This is a mixed-method study using individual interviews (duration between 40-60 minutes) of 181 CNCP patients (71% females) in a tertiary Pain Care Unit, and applying the text mining methodology. Incomes (low or middle) and gender roles (productive vs. reproductive)".

View Article and Find Full Text PDF

Background: Health economic evaluations require cost data as a key input, and reimbursement policies and systems should incentivize valuable care. Subfertility is a growing global phenomenon, and Dutch per-treatment DRGs alone do not support value-based decision-making because they don't reflect patient-level variation or the impact of technologies on costs across entire patient pathways.

Methods: We present a real-world micro-costing analysis of subfertility patient pathways (n = 4.

View Article and Find Full Text PDF

Membrane engineering is a complex field involving the development of the most suitable membrane process for specific purposes and dealing with the design and operation of membrane technologies. This study analyzed 1424 articles on reverse osmosis (RO) membrane engineering from the Scopus database to provide guidance for future studies. The results show that since the first article was published in 1964, the domain has gained popularity, especially since 2009.

View Article and Find Full Text PDF

Digital-Tier Strategy Improves Newborn Screening for Glutaric Aciduria Type 1.

Int J Neonatal Screen

December 2024

Engineering Mathematics and Computing Lab (EMCL), Interdisciplinary Center for Scientific Computing (IWR), Heidelberg University, 69120 Heidelberg, Germany.

Glutaric aciduria type 1 (GA1) is a rare inherited metabolic disease increasingly included in newborn screening (NBS) programs worldwide. Because of the broad biochemical spectrum of individuals with GA1 and the lack of reliable second-tier strategies, NBS for GA1 is still confronted with a high rate of false positives. In this study, we aim to increase the specificity of NBS for GA1 and, hence, to reduce the rate of false positives through machine learning methods.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!