Using a model of human visual perception to improve deep learning.

Neural Netw

École Polytechnique Fédérale de Lausanne (EPFL), Switzerland; Purdue University, Department of Psychological Sciences, 703 Third Street, West Lafayette, IN 47906, United States. Electronic address:

Published: August 2018

Deep learning algorithms achieve human-level (or better) performance on many tasks, but there still remain situations where humans learn better or faster. With regard to classification of images, we argue that some of those situations are because the human visual system represents information in a format that promotes good training and classification. To demonstrate this idea, we show how occluding objects can impair performance of a deep learning system that is trained to classify digits in the MNIST database. We describe a human inspired segmentation and interpolation algorithm that attempts to reconstruct occluded parts of an image, and we show that using this reconstruction algorithm to pre-process occluded images promotes training and classification performance.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.neunet.2018.04.005DOI Listing

Publication Analysis

Top Keywords

deep learning
12
human visual
8
training classification
8
model human
4
visual perception
4
perception improve
4
improve deep
4
learning deep
4
learning algorithms
4
algorithms achieve
4

Similar Publications

Deep learning has emerged as a powerful tool in medical imaging, particularly for corneal topographic map classification. However, the scarcity of labeled data poses a significant challenge to achieving robust performance. This study investigates the impact of various data augmentation strategies on enhancing the performance of a customized convolutional neural network model for corneal topographic map classification.

View Article and Find Full Text PDF

Economic losses in cattle farms are frequently associated with failed pregnancies. Some studies found that the transcriptomic profiles of blood and endometrial tissues in cattle with varying pregnancy outcomes display discrepancies even before artificial insemination (AI) or embryo transfer (ET). In the study, 330 samples from seven distinct sources and two tissue types were integrated and divided into two groups based on the ability to establish and maintain pregnancy after AI or ET: P (pregnant) and NP (nonpregnant).

View Article and Find Full Text PDF

Objectives: This study aimed to develop an automated method for generating clearer, well-aligned panoramic views by creating an optimized three-dimensional (3D) reconstruction zone centered on the teeth. The approach focused on achieving high contrast and clarity in key dental features, including tooth roots, morphology, and periapical lesions, by applying a 3D U-Net deep learning model to generate an arch surface and align the panoramic view.

Methods: This retrospective study analyzed anonymized cone-beam CT (CBCT) scans from 312 patients (mean age 40 years; range 10-78; 41.

View Article and Find Full Text PDF

Bruises can affect the appearance and nutritional value of apples and cause economic losses. Therefore, the accurate detection of bruise levels and bruise time of apples is crucial. In this paper, we proposed a method that combines a self-designed multispectral imaging system with deep learning to accurately detect the level and time of bruising on apples.

View Article and Find Full Text PDF

The image retrieval is the process of retrieving the relevant images to the query image with minimal searching time in internet. The problem of the conventional Content-Based Image Retrieval (CBIR) system is that they produce retrieval results for either colour images or grey scale images alone. Moreover, the CBIR system is more complex which consumes more time period for producing the significant retrieval results.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!