Publications by authors named "Diane Larlus"

Article Synopsis
  • The authors introduce Neural Feature Fusion Fields (N3F), a technique that enhances 2D image feature extractors for analyzing images that can be reconstructed into a 3D scene.
  • N3F works by using a 2D feature extractor as a teacher to train a 3D "student" network, capable of distilling these features while leveraging differentiable rendering methods.
  • The method improves tasks like 2D object retrieval and 3D segmentation without needing manual labels, showing better performance compared to traditional self-supervised 2D methods, especially in scenarios like long egocentric videos.
View Article and Find Full Text PDF

We propose an unsupervised method to learn the 3D geometry of object categories by looking around them. Differently from traditional approaches, this method does not require CAD models or manual supervision. Instead, using only video sequences showing object instances from a moving viewpoint, the method learns a deep neural network that can predict several aspects of the 3D geometry of such objects from single images.

View Article and Find Full Text PDF

This article deals with the detection of prominent objects in images. As opposed to the standard approaches based on sliding windows, we study a fundamentally different solution by formulating the supervised prediction of a bounding box as an image retrieval task. Indeed, given a global image descriptor, we find the most similar images in an annotated dataset, and transfer the object bounding boxes.

View Article and Find Full Text PDF

This paper considers scalable and unobtrusive activity recognition using on-body sensing for context awareness in wearable computing. Common methods for activity recognition rely on supervised learning requiring substantial amounts of labeled training data. Obtaining accurate and detailed annotations of activities is challenging, preventing the applicability of these approaches in real-world settings.

View Article and Find Full Text PDF

Color names are required in real-world applications such as image retrieval and image annotation. Traditionally, they are learned from a collection of labeled color chips. These color chips are labeled with color names within a well-defined experimental setup by human test subjects.

View Article and Find Full Text PDF