Video Object Segmentation, and video processing in general, has been historically dominated by methods that rely on the temporal consistency and redundancy in consecutive video frames. When the temporal smoothness is suddenly broken, such as when an object is occluded, or some frames are missing in a sequence, the result of these methods can deteriorate significantly. This paper explores the orthogonal approach of processing each frame independently, i.e., disregarding the temporal information. In particular, it tackles the task of semi-supervised video object segmentation: the separation of an object from the background in a video, given its mask in the first frame. We present Semantic One-Shot Video Object Segmentation (OSVOS$^\mathrm {S}$S), based on a fully-convolutional neural network architecture that is able to successively transfer generic semantic information, learned on ImageNet, to the task of foreground segmentation, and finally to learning the appearance of a single annotated object of the test sequence (hence one shot). We show that instance-level semantic information, when combined effectively, can dramatically improve the results of our previous method, OSVOS. We perform experiments on two recent single-object video segmentation databases, which show that OSVOS$^\mathrm {S}$S is both the fastest and most accurate method in the state of the art. Experiments on multi-object video segmentation show that OSVOS$^\mathrm {S}$S obtains competitive results.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1109/TPAMI.2018.2838670 | DOI Listing |
Int J Psychoanal
December 2024
Rio de Janeiro.
The following text describes an analysis, ongoing for three years now, of a boy currently 12 years old, whose projective-expulsive functioning becomes evident through rude and vulgar words. The image of the Cretan labyrinth and its meanders, created by Daedalus as a "protection" against the ferocity of the Minotaur, were the inspiration for this narrative. The intricate defences that imprison the patient, with their characteristics of pathological organisation, resemble a labyrinth, and through this path, the analyst and the patient go on confronting the difficulties of the process.
View Article and Find Full Text PDFBMC Bioinformatics
January 2025
Biology Department, University of Massachusetts Amherst, Amherst, MA, USA.
Background: High-throughput behavioral analysis is important for drug discovery, toxicological studies, and the modeling of neurological disorders such as autism and epilepsy. Zebrafish embryos and larvae are ideal for such applications because they are spawned in large clutches, develop rapidly, feature a relatively simple nervous system, and have orthologs to many human disease genes. However, existing software for video-based behavioral analysis can be incompatible with recordings that contain dynamic backgrounds or foreign objects, lack support for multiwell formats, require expensive hardware, and/or demand considerable programming expertise.
View Article and Find Full Text PDFJ Imaging
January 2025
Science and Research Department, Moscow Technical University of Communications and Informatics, 111024 Moscow, Russia.
Object detection in images is a fundamental component of many safety-critical systems, such as autonomous driving, video surveillance systems, and robotics. Adversarial patch attacks, being easily implemented in the real world, provide effective counteraction to object detection by state-of-the-art neural-based detectors. It poses a serious danger in various fields of activity.
View Article and Find Full Text PDFJ Imaging
January 2025
School of Artificial Intelligence, Changchun University of Science and Technology, Changchun 130012, China.
For surveillance video management in university laboratories, issues such as occlusion and low-resolution face capture often arise. Traditional face recognition algorithms are typically static and rely heavily on clear images, resulting in inaccurate recognition for low-resolution, small-sized faces. To address the challenges of occlusion and low-resolution person identification, this paper proposes a new face recognition framework by reconstructing Retinaface-Resnet and combining it with Quality-Adaptive Margin (adaface).
View Article and Find Full Text PDFJ Pers Med
January 2025
Department of Thoracic Surgery, Sant'Andrea, Hospital, Sapienza University, 00189 Rome, Italy.
. The optimal surgical approach for thymoma resection is still an object of debate. The increasing experience in robotic-assisted thoracic surgery (RATS) has led to the progressive affirmation of this technique as a valid alternative to Sternotomy, Thoracotomy and Video-Assisted Thoracic Surgery (VATS) in this setting.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!