Scattering networks are a class of designed Convolutional Neural Networks (CNNs) with fixed weights. We argue they can serve as generic representations for modelling images. In particular, by working in scattering space, we achieve competitive results both for supervised and unsupervised learning tasks, while making progress towards constructing more interpretable CNNs. For supervised learning, we demonstrate that the early layers of CNNs do not necessarily need to be learned, and can be replaced with a scattering network instead. Indeed, using hybrid architectures, we achieve the best results with predefined representations to-date, while being competitive with end-to-end learned CNNs. Specifically, even applying a shallow cascade of small-windowed scattering coefficients followed by $1\times 1$1×1-convolutions results in AlexNet accuracy on the ILSVRC2012 classification task. Moreover, by combining scattering networks with deep residual networks, we achieve a single-crop top-5 error of 11.4 percent on ILSVRC2012. Also, we show they can yield excellent performance in the small sample regime on CIFAR-10 and STL-10 datasets, exceeding their end-to-end counterparts, through their ability to incorporate geometrical priors. For unsupervised learning, scattering coefficients can be a competitive representation that permits image recovery. We use this fact to train hybrid GANs to generate images. Finally, we empirically analyze several properties related to stability and reconstruction of images from scattering coefficients.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TPAMI.2018.2855738DOI Listing

Publication Analysis

Top Keywords

scattering networks
12
scattering coefficients
12
scattering
8
learning scattering
8
unsupervised learning
8
networks hybrid
4
hybrid representation
4
learning
4
representation learning
4
networks
4

Similar Publications

Objective: This study aims to evaluate the efficacy of wearable physiology and movement sensors in identifying a spectrum of challenging behaviors, including self-injurious behavior (SIB), in children and teenagers with autism spectrum disorder (ASD) in real-world settings.

Approach: We utilized a long-short-term memory (LSTM) network with features derived using the wavelet scatter transform to analyze physiological biosignals, including electrodermal activity and skin temperature, alongside three-dimensional movement data captured via accelerometers. The study was conducted in naturalistic environments, focusing on participants' daily activities.

View Article and Find Full Text PDF

Firefighters have a greater prevalence of posttraumatic stress disorder (PTSD) because of their greater risk of exposure to traumatic events. Network analysis offers novel perspectives for understanding PTSD. However, most previous network analysis studies were cross-sectional and failed to reveal the dynamics and causality of PTSD symptoms.

View Article and Find Full Text PDF

Hydration plays a crucial role in regulating the dispersion behavior of biomolecules in water, particularly in how pH-sensitive hydration water network forms around proteins. This study explores the conformation and hydration structure of Type-I tropocollagen using small- and wide-angle X-ray scattering (SWAXS) and molecular dynamics (MD) simulations. The results reveal that tropocollagen exhibits a significant softening conformation in solution, transitioning from its rod-like structure in tissues to a worm-like conformation, characterized by a reduced radius of gyration of 50 nm and a persistent length of 34 nm.

View Article and Find Full Text PDF

Metastable phases can exist within local minima in the potential energy landscape when they are kinetically "trapped" by various processing routes, such as thermal treatment, grain size reduction, chemical doping, interfacial stress, or irradiation. Despite the importance of metastable materials for many technological applications, little is known about the underlying structural mechanisms of the stabilization process and atomic-scale nature of the resulting defective metastable phase. Investigating ion-irradiated and nanocrystalline zirconia with neutron total scattering experiments, we show that metastable tetragonal ZrO consists of an underlying structure of ferroelastic, orthorhombic nanoscale domains stabilized by a network of domain walls.

View Article and Find Full Text PDF

Vision transformers have garnered substantial attention and attained impressive performance in image super-resolution tasks. Nevertheless, these networks face challenges associated with attention complexity and the effective capture of intricate, fine-grained details within images. These hurdles impede the efficient and scalable deployment of transformer models for image super-resolution tasks in real-world applications.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!