Publications by authors named "Guy Wolf"

Neurons in the brain have rich and adaptive input-output properties. Features such as heterogeneous f-I curves and spike frequency adaptation are known to place single neurons in optimal coding regimes when facing changing stimuli. Yet, it is still unclear how brain circuits exploit single-neuron flexibility, and how network-level requirements may have shaped such cellular function.

View Article and Find Full Text PDF

Big neuroscience datasets are not big small datasets when it comes to quantitative data analysis. Neuroscience has now witnessed the advent of many population cohort studies that deep-profile participants, yielding hundreds of measures, capturing dimensions of each individual's position in the broader society. Indeed, there is a rebalancing from small, strictly selected, and thus homogenized cohorts toward always larger, more representative, and thus diverse cohorts.

View Article and Find Full Text PDF

We aimed to implement four data partitioning strategies evaluated with four federated learning (FL) algorithms and investigate the impact of data distribution on FL model performance in detecting steatosis using B-mode US images. A private dataset (153 patients; 1530 images) and a public dataset (55 patient; 550 images) were included in this retrospective study. The datasets contained patients with metabolic dysfunction-associated fatty liver disease (MAFLD) with biopsy-proven steatosis grades and control individuals without steatosis.

View Article and Find Full Text PDF

Late onset Alzheimer's disease (AD) is a progressive neurodegenerative disease, with brain changes beginning years before symptoms surface. AD is characterized by neuronal loss, the classic feature of the disease that underlies brain atrophy. However, GWAS reports and recent single-nucleus RNA sequencing (snRNA-seq) efforts have highlighted that glial cells, particularly microglia, claim a central role in AD pathophysiology.

View Article and Find Full Text PDF
Article Synopsis
  • - The study analyzes plasma samples from 318 COVID-19 patients to understand how RNAemia, delayed antibody responses, and inflammation affect patient outcomes, revealing four distinct patient clusters based on severity and survival probability.
  • - Critically ill patients were categorized into good prognosis and high-fatality clusters, while non-critical survivors were divided into high and low early antibody responders, each showing different patterns in antibody development and inflammation.
  • - The findings indicate that high-fatality patients have specific genomic signatures linked to severe COVID-19, and both critical and non-critical patients with delayed antibody responses exhibit persistent interferon (IFN) activity, suggesting that high IFN levels might hinder the body's ability to build effective immunity.
View Article and Find Full Text PDF
Article Synopsis
  • Dimensionality reduction methods like PHATE, t-SNE, and UMAP help visualize complex biological data, but they often do so without the guidance of expert labels.
  • The new method RF-PHATE combines expert knowledge with unsupervised techniques by using random forests to create low-dimensional visualizations that emphasize important data relationships while filtering out irrelevant features.
  • RF-PHATE is effective for large datasets and has been successfully applied in multiple case studies, showing its ability to handle time-series data in multiple sclerosis research, analyze noisy Raman spectral data, and connect geometric structures with COVID-19 outcomes.
View Article and Find Full Text PDF

Efficient computation of optimal transport distance between distributions is of growing importance in data science. Sinkhorn-based methods are currently the state-of-the-art for such computations, but require computations. In addition, Sinkhorn-based methods commonly use an Euclidean ground distance between datapoints.

View Article and Find Full Text PDF

Background Screening for nonalcoholic fatty liver disease (NAFLD) is suboptimal due to the subjective interpretation of US images. Purpose To evaluate the agreement and diagnostic performance of radiologists and a deep learning model in grading hepatic steatosis in NAFLD at US, with biopsy as the reference standard. Materials and Methods This retrospective study included patients with NAFLD and control patients without hepatic steatosis who underwent abdominal US and contemporaneous liver biopsy from September 2010 to October 2019.

View Article and Find Full Text PDF

The complexity of the human brain gives the illusion that brain activity is intrinsically high-dimensional. Nonlinear dimensionality-reduction methods such as uniform manifold approximation and t-distributed stochastic neighbor embedding have been used for high-throughput biomedical data. However, they have not been used extensively for brain activity data such as those from functional magnetic resonance imaging (fMRI), primarily due to their inability to maintain dynamic structure.

View Article and Find Full Text PDF
Article Synopsis
  • - We introduce a technique called Manifold Interpolating Optimal-Transport Flow (MIOFlow) that uses neural ordinary differential equations to create continuous population dynamics from discrete snapshots taken at different times.
  • - MIOFlow combines dynamic models, manifold learning, and optimal transport, enhancing interpolation by using a geodesic autoencoder to maintain the geometry of the data.
  • - Our method outperforms traditional models like normalizing flows and Schrödinger bridges in effectively connecting different population states, as demonstrated through simulated data and real biological datasets.
View Article and Find Full Text PDF

Diffusion-based manifold learning methods have proven useful in representation learning and dimensionality reduction of modern high dimensional, high throughput, noisy datasets. Such datasets are especially present in fields like biology and physics. While it is thought that these methods preserve underlying manifold structure of data by learning a proxy for geodesic distances, no specific theoretical links have been established.

View Article and Find Full Text PDF

Due to commonalities in pathophysiology, age-related macular degeneration (AMD) represents a uniquely accessible model to investigate therapies for neurodegenerative diseases, leading us to examine whether pathways of disease progression are shared across neurodegenerative conditions. Here we use single-nucleus RNA sequencing to profile lesions from 11 postmortem human retinas with age-related macular degeneration and 6 control retinas with no history of retinal disease. We create a machine-learning pipeline based on recent advances in data geometry and topology and identify activated glial populations enriched in the early phase of disease.

View Article and Find Full Text PDF

In modern relational machine learning it is common to encounter large graphs that arise via interactions or similarities between observations in many domains. Further, in many cases the target entities for analysis are actually signals on such graphs. We propose to compare and organize such datasets of graph signals by using an earth mover's distance (EMD) with a geodesic cost over the underlying graph.

View Article and Find Full Text PDF

A fundamental task in data exploration is to extract low dimensional representations that capture intrinsic geometry in data, especially for faithfully visualizing data in two or three dimensions. Common approaches use kernel methods for manifold learning. However, these methods typically only provide an embedding of the input data and cannot extend naturally to new data points.

View Article and Find Full Text PDF

We propose a method called integrated diffusion for combining multimodal data, gathered via different sensors on the same system, to create a integrated data diffusion operator. As real world data suffers from both local and global noise, we introduce mechanisms to optimally calculate a diffusion operator that reflects the combined information in data by maintaining low frequency eigenvectors of each modality both globally and locally. We show the utility of this integrated operator in denoising and visualizing multimodal toy data as well as multi-omic data generated from blood cells, measuring both gene expression and chromatin accessibility.

View Article and Find Full Text PDF
Article Synopsis
  • * A set of genomic surveillance tools based on population genetics has been developed to analyze the virus's genetic diversity, utilizing data from 329,854 sequences in the GISAID database from the early pandemic phase.
  • * Innovative methods like haplotype networks and principal component analysis (PCA) allow for efficient lineage identification and visualization of mutation patterns, aiding in real-time monitoring and understanding of SARS-CoV-2 evolution.
View Article and Find Full Text PDF

As the biomedical community produces datasets that are increasingly complex and high dimensional, there is a need for more sophisticated computational tools to extract biological insights. We present Multiscale PHATE, a method that sweeps through all levels of data granularity to learn abstracted biological features directly predictive of disease outcome. Built on a coarse-graining process called diffusion condensation, Multiscale PHATE learns a data topology that can be analyzed at coarse resolutions for high-level summarizations of data and at fine resolutions for detailed representations of subsets.

View Article and Find Full Text PDF

The first confirmed case of COVID-19 in Quebec, Canada, occurred at Verdun Hospital on February 25, 2020. A month later, a localized outbreak was observed at this hospital. We performed tiled amplicon whole genome nanopore sequencing on nasopharyngeal swabs from all SARS-CoV-2 positive samples from 31 March to 17 April 2020 in 2 local hospitals to assess viral diversity (unknown at the time in Quebec) and potential associations with clinical outcomes.

View Article and Find Full Text PDF
GEOMETRIC SCATTERING ATTENTION NETWORKS.

Proc IEEE Int Conf Acoust Speech Signal Process

June 2021

Geometric scattering has recently gained recognition in graph representation learning, and recent work has shown that integrating scattering features in graph convolution networks (GCNs) can alleviate the typical oversmoothing of features in node representation learning. However, scattering often relies on handcrafted design, requiring careful selection of frequency bands via a cascade of wavelet transforms, as well as an effective weight sharing scheme to combine low- and band-pass information. Here, we introduce a new attention-based architecture to produce adaptive task-driven node representations by implicitly learning node-wise weights for combining multiple scattering and GCN channels in the network.

View Article and Find Full Text PDF

We propose a new graph neural network (GNN) module, based on relaxations of recently proposed geometric scattering transforms, which consist of a cascade of graph wavelet filters. Our learnable geometric scattering (LEGS) module enables adaptive tuning of the wavelets to encourage band-pass features to emerge in learned representations. The incorporation of our LEGS-module in GNNs enables the learning of longer-range graph relations compared to many popular GNNs, which often rely on encoding graph structure via smoothness or similarity between neighbors.

View Article and Find Full Text PDF

While generative models such as GANs have been successful at mapping from noise to specific distributions of data, or more generally from one distribution of data to another, they cannot isolate the transformation that is occurring and apply it to a new distribution not seen in training. Thus, they memorize the domain of the transformation, and cannot generalize the transformation . To address this, we propose a new neural network called a (NTNet) that isolates the signal representing the transformation itself from the other signals representing internal distribution variation.

View Article and Find Full Text PDF

The Euclidean scattering transform was introduced nearly a decade ago to improve the mathematical understanding of convolutional neural networks. Inspired by recent interest in geometric deep learning, which aims to generalize convolutional neural networks to manifold and graph-structured domains, we define a geometric scattering transform on manifolds. Similar to the Euclidean scattering transform, the geometric scattering transform is based on a cascade of wavelet filters and pointwise nonlinearities.

View Article and Find Full Text PDF

It is increasingly common to encounter data from dynamic processes captured by static cross-sectional measurements over time, particularly in biomedical settings. Recent attempts to model individual trajectories from this data use optimal transport to create pairwise matchings between time points. However, these methods cannot model continuous dynamics and non-linear paths that entities can take in these systems.

View Article and Find Full Text PDF
Article Synopsis
  • * Researchers collected and sequenced 264 viral genomes from 242 individuals between March 31 and April 17, 2020, revealing various viral subclades and hospital transmission patterns.
  • * Analysis showed two subclades that challenged standard classification methods and found certain symptoms like headache and sore throat were linked to better patient outcomes while also highlighting limitations in bioinformatics for handling diverse viral strains.
View Article and Find Full Text PDF

We propose a new fast method of measuring distances between large numbers of related high dimensional datasets called the Diffusion Earth Mover's Distance (EMD). We model the datasets as distributions supported on common data graph that is derived from the affinity matrix computed on the combined data. In such cases where the graph is a discretization of an underlying Riemannian closed manifold, we prove that Diffusion EMD is topologically equivalent to the standard EMD with a geodesic ground distance.

View Article and Find Full Text PDF