IEEE Trans Pattern Anal Mach Intell
December 2013
We present a system to automatically generate natural language descriptions from images. This system consists of two parts. The first part, content planning, smooths the output of computer vision-based detection and recognition algorithms with statistics mined from large pools of visually descriptive text to determine the best content words to use to describe an image.
View Article and Find Full Text PDF