Captioning Ultrasound Images Automatically.

Mohammad Alsharid Harshita Sharma Lior Drukker Pierre Chatelain Aris T Papageorghiou J Alison Noble

Med Image Comput Comput Assist Interv

Published: October 2019

We describe an automatic natural language processing (NLP)-based image captioning method to describe fetal ultrasound video content by modelling the vocabulary commonly used by sonographers and sonologists. The generated captions are similar to the words spoken by a sonographer when describing the scan experience in terms of visual content and performed scanning actions. Using full-length second-trimester fetal ultrasound videos and text derived from accompanying expert voice-over audio recordings, we train deep learning models consisting of convolutional neural networks and recurrent neural networks in merged configurations to generate captions for ultrasound video frames. We evaluate different model architectures using established general metrics (, ) and application-specific metrics. Results show that the proposed models can learn joint representations of image and text to generate relevant and descriptive captions for anatomies, such as the spine, the abdomen, the heart, and the head, in clinical fetal ultrasound scans.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6978141	PMC
http://dx.doi.org/10.1007/978-3-030-32251-9_37	DOI Listing

Publication Analysis

Top Keywords

fetal ultrasound

ultrasound video

neural networks

captioning ultrasound

ultrasound images

images automatically

automatically describe

describe automatic

automatic natural

natural language

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!