IEEE Trans Pattern Anal Mach Intell
January 2017
Object category localization is a challenging problem in computer vision. Standard supervised training requires bounding box annotations of object instances. This time-consuming annotation process is sidestepped in weakly supervised learning.
View Article and Find Full Text PDFIEEE Trans Pattern Anal Mach Intell
June 2016
The bag-of-words (BoW) model treats images as sets of local descriptors and represents them by visual word histograms. The Fisher vector (FV) representation extends BoW, by considering the first and second order statistics of local descriptors. In both representations local descriptors are assumed to be identically and independently distributed (iid), which is a poor assumption from a modeling perspective.
View Article and Find Full Text PDF