Towards User-centered Corpus Development: Lessons Learnt from Designing and Developing MedTator.

AMIA Annu Symp Proc

Department of Artificial Intelligence and Informatics, Mayo Clinic, Rochester, MN, USA.

Published: May 2023

A gold standard annotated corpus is usually indispensable when developing natural language processing (NLP) systems. Building a high-quality annotated corpus for clinical NLP requires considerable time and domain expertise during the annotation process. Existing annotation tools may provide powerful features to cover various needs of text annotation tasks, but the target end users tend to be trained annotators. It is challenging for clinical research teams to utilize those tools in their projects due to various factors such as the complexity of advanced features and data security concerns. To address those challenges, we developed MedTator, a serverless web-based annotation tool with an intuitive user-centered interface aiming to provide a lightweight solution for the core tasks in corpus development. Moreover, we present three lessons learned from the designing and developing MedTator, which will contribute to the research community's knowledge for future open-source tool development.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10148300PMC

Publication Analysis

Top Keywords

corpus development
8
designing developing
8
developing medtator
8
annotated corpus
8
user-centered corpus
4
development lessons
4
lessons learnt
4
learnt designing
4
medtator gold
4
gold standard
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!