Publications by authors named "Hirn M"

The ability to measure gene expression at single-cell resolution has elevated our understanding of how biological features emerge from complex and interdependent networks at molecular, cellular, and tissue scales. As technologies have evolved that complement scRNAseq measurements with things like single-cell proteomic, epigenomic, and genomic information, it becomes increasingly apparent how much biology exists as a product of multimodal regulation. Biological processes such as transcription, translation, and post-translational or epigenetic modification impose both energetic and specific molecular demands on a cell and are therefore implicitly constrained by the metabolic state of the cell.

View Article and Find Full Text PDF

The purpose of this study was to assess the utility of a low-cost flow simulation tool for an indoor air modeling application by comparing its outputs with the results of a physical experiment, as well as those from a more advanced computational fluid dynamics (CFD) software package. Five aerosol dispersion tests were performed in two different classrooms by releasing a CO tracer gas from six student locations. Resultant steady-state concentrations were monitored at 13 locations around the periphery of the room.

View Article and Find Full Text PDF

In this paper, we generalize finite depth wavelet scattering transforms, which we formulate as norms of a cascade of continuous wavelet transforms (or dyadic wavelet transforms) and contractive nonlinearities. We then provide norms for these operators, prove that these operators are well-defined, and are Lipschitz continuous to the action of diffeomorphisms in specific cases. Lastly, we extend our results to formulate an operator invariant to the action of rotations and an operator that is equivariant to the action of rotations of .

View Article and Find Full Text PDF
Article Synopsis
  • The manifold scattering transform is a deep learning tool designed for analyzing data on Riemannian manifolds, extending concepts from convolutional neural networks to more complex geometries.
  • Previous research primarily examined its theoretical aspects but lacked practical numerical methods, particularly outside of two-dimensional settings.
  • This new work introduces diffusion map-based techniques for applying the transform to high-dimensional datasets, such as those found in single-cell genetics, demonstrating effectiveness in classifying signals and manifolds.
View Article and Find Full Text PDF

Due to commonalities in pathophysiology, age-related macular degeneration (AMD) represents a uniquely accessible model to investigate therapies for neurodegenerative diseases, leading us to examine whether pathways of disease progression are shared across neurodegenerative conditions. Here we use single-nucleus RNA sequencing to profile lesions from 11 postmortem human retinas with age-related macular degeneration and 6 control retinas with no history of retinal disease. We create a machine-learning pipeline based on recent advances in data geometry and topology and identify activated glial populations enriched in the early phase of disease.

View Article and Find Full Text PDF

Accurate earthquake location and magnitude estimation play critical roles in seismology. Recent deep learning frameworks have produced encouraging results on various seismological tasks (e.g.

View Article and Find Full Text PDF

Motivation: Accurately representing biological networks in a low-dimensional space, also known as network embedding, is a critical step in network-based machine learning and is carried out widely using node2vec, an unsupervised method based on biased random walks. However, while many networks, including functional gene interaction networks, are dense, weighted graphs, node2vec is fundamentally limited in its ability to use edge weights during the biased random walk generation process, thus under-using all the information in the network.

Results: Here, we present node2vec+, a natural extension of node2vec that accounts for edge weights when calculating walk biases and reduces to node2vec in the cases of unweighted graphs or unbiased walks.

View Article and Find Full Text PDF

We present a machine learning model for the analysis of randomly generated discrete signals, modeled as the points of an inhomogeneous, compound Poisson point process. Like the wavelet scattering transform introduced by Mallat, our construction is naturally invariant to translations and reflections, but it decouples the roles of scale and frequency, replacing wavelets with Gabor-type measurements. We show that, with suitable nonlinearities, our measurements distinguish Poisson point processes from common self-similar processes, and separate different types of Poisson point processes.

View Article and Find Full Text PDF

The scattering transform is a wavelet-based model of Convolutional Neural Networks originally introduced by S. Mallat. Mallat's analysis shows that this network has desirable stability and invariance guarantees and therefore helps explain the observation that the filters learned by early layers of a Convolutional Neural Network typically resemble wavelets.

View Article and Find Full Text PDF

The prevalence of graph-based data has spurred the rapid development of graph neural networks (GNNs) and related machine learning algorithms. Yet, despite the many datasets naturally modeled as directed graphs, including citation, website, and traffic networks, the vast majority of this research focuses on undirected graphs. In this paper, we propose , a GNN for directed graphs based on a complex Hermitian matrix known as the magnetic Laplacian.

View Article and Find Full Text PDF

As the biomedical community produces datasets that are increasingly complex and high dimensional, there is a need for more sophisticated computational tools to extract biological insights. We present Multiscale PHATE, a method that sweeps through all levels of data granularity to learn abstracted biological features directly predictive of disease outcome. Built on a coarse-graining process called diffusion condensation, Multiscale PHATE learns a data topology that can be analyzed at coarse resolutions for high-level summarizations of data and at fine resolutions for detailed representations of subsets.

View Article and Find Full Text PDF

We propose a nonlinear, wavelet-based signal representation that is translation invariant and robust to both additive noise and random dilations. Motivated by the multi-reference alignment problem and generalizations thereof, we analyze the statistical properties of this representation given a large number of independent corruptions of a target signal. We prove the nonlinear wavelet-based representation uniquely defines the power spectrum but allows for an unbiasing procedure that cannot be directly applied to the power spectrum.

View Article and Find Full Text PDF

The Euclidean scattering transform was introduced nearly a decade ago to improve the mathematical understanding of convolutional neural networks. Inspired by recent interest in geometric deep learning, which aims to generalize convolutional neural networks to manifold and graph-structured domains, we define a geometric scattering transform on manifolds. Similar to the Euclidean scattering transform, the geometric scattering transform is based on a cascade of wavelet filters and pointwise nonlinearities.

View Article and Find Full Text PDF

Background: Humans are exposed to tens of thousands of chemical substances that need to be assessed for their potential toxicity. Acute systemic toxicity testing serves as the basis for regulatory hazard classification, labeling, and risk management. However, it is cost- and time-prohibitive to evaluate all new and existing chemicals using traditional rodent acute toxicity tests.

View Article and Find Full Text PDF

This work examines methods for predicting the partition coefficient (log P) for a dataset of small molecules. Here, we use atomic attributes such as radius and partial charge, which are typically used as force field parameters in classical molecular dynamics simulations. These atomic attributes are transformed into index-invariant molecular features using a recently developed method called geometric scattering for graphs (GSG).

View Article and Find Full Text PDF

The dream of machine learning in materials science is for a model to learn the underlying physics of an atomic system, allowing it to move beyond the interpolation of the training set to the prediction of properties that were not present in the original training data. In addition to advances in machine learning architectures and training techniques, achieving this ambitious goal requires a method to convert a 3D atomic system into a feature representation that preserves rotational and translational symmetries, smoothness under small perturbations, and invariance under re-ordering. The atomic orbital wavelet scattering transform preserves these symmetries by construction and has achieved great success as a featurization method for machine learning energy prediction.

View Article and Find Full Text PDF

Big data often has emergent structure that exists at multiple levels of abstraction, which are useful for characterizing complex interactions and dynamics of the observations. Here, we consider multiple levels of abstraction via a multiresolution geometry of data points at different granularities. To construct this geometry we define a time-inhomogemeous diffusion process that effectively condenses data points together to uncover nested groupings at larger and larger granularities.

View Article and Find Full Text PDF

An amendment to this paper has been published and can be accessed via a link at the top of the paper.

View Article and Find Full Text PDF

The high-dimensional data created by high-throughput technologies require visualization tools that reveal data structure and patterns in an intuitive form. We present PHATE, a visualization method that captures both local and global nonlinear structure using an information-geometric distance between data points. We compare PHATE to other tools on a variety of artificial and biological datasets, and find that it consistently preserves a range of patterns in data, including continual progressions, branches and clusters, better than other tools.

View Article and Find Full Text PDF

We present a machine learning algorithm for the prediction of molecule properties inspired by ideas from density functional theory (DFT). Using Gaussian-type orbital functions, we create surrogate electronic densities of the molecule from which we compute invariant "solid harmonic scattering coefficients" that account for different types of interactions at different scales. Multilinear regressions of various physical properties of molecules are computed from these invariant coefficients.

View Article and Find Full Text PDF

Establishing and maintaining public water services in fragile states is a significant development challenge. In anticipation of water infrastructure investments, this study compares drinking water sources and quality between Port Harcourt, Nigeria, and Monrovia, Liberia, two cities recovering from political and economic instability. In both cities, access to piped water is low, and residents rely on a range of other private and public water sources.

View Article and Find Full Text PDF

The adsorption of two anionic dyes, Remazol Black B (RB5) and Acidol Red 2BE-NW (AR42), onto a microporous activated carbon felt was investigated. The characterization of carbon surface chemistry by X-ray microanalysis, Boehm titrations, and pH-PZC measurements indicates that the surface oxygenated groups are mainly acidic. The rate of adsorption depends on the pH and the experimental data fit the intraparticle diffusion model.

View Article and Find Full Text PDF

Aim: The aims of this project were to analyze the factors that influence quality and safety of tissues for transplantation and to develop the method to ensure standards of quality and safety in relation to tissue banking as demanded by European Directive 2004/23/EC and its technical annexes. It is organized in 4 Working Groups, the objectives of each one being focused in a specific area.

Standards: The Guide of Recommendations for Tissue Banking is structured into 4 parts: (1) quality systems that apply to tissue banking and general quality system requirements, (2) regulatory framework in Europe, (3) standards available, and (4) recommendations of the fundamental quality and safety keypoints.

View Article and Find Full Text PDF