Publications by Erfaneh Gharavi

Publications by authors named "Erfaneh Gharavi"

Page 1 of 1

Methods for evaluating unsupervised vector representations of genomic regions.

Guangtao Zheng Julia Rymuza Erfaneh Gharavi Nathan J LeRoy Aidong Zhang

NAR Genom Bioinform

September 2024

Article Synopsis

Representation learning models are essential in genomics, creating vector representations (or embeddings) of biological entities like cells and genes.
Unsupervised methods can uncover relationships among genomic regions and derive meaningful insights without relying on curated metadata.
To assess the quality of these region embeddings, four evaluation metrics are proposed: CTS, RCS, GDSS, and NPS, which measure clustering ability and how well genomic relationships are captured in the embeddings.

View Article and Find Full Text PDF

Fast clustering and cell-type annotation of scATAC data using pre-trained embeddings.

Nathan J LeRoy Jason P Smith Guangtao Zheng Julia Rymuza Erfaneh Gharavi

NAR Genom Bioinform

September 2024

Data from the single-cell assay for transposase-accessible chromatin using sequencing (scATAC-seq) are now widely available. One major computational challenge is dealing with high dimensionality and inherent sparsity, which is typically addressed by producing lower dimensional representations of single cells for downstream clustering tasks. Current approaches produce such individual cell embeddings directly through a one-step learning process.

View Article and Find Full Text PDF

Joint Representation Learning for Retrieval and Annotation of Genomic Interval Sets.

Erfaneh Gharavi Nathan J LeRoy Guangtao Zheng Aidong Zhang Donald E Brown

Bioengineering (Basel)

March 2024

As available genomic interval data increase in scale, we require fast systems to search them. A common approach is simple string matching to compare a search term to metadata, but this is limited by incomplete or inaccurate annotations. An alternative is to compare data directly through genomic region overlap analysis, but this approach leads to challenges like sparsity, high dimensionality, and computational expense.

View Article and Find Full Text PDF

Embeddings of genomic region sets capture rich biological associations in lower dimensions.

Erfaneh Gharavi Aaron Gu Guangtao Zheng Jason P Smith Hyun Jae Cho

Bioinformatics

December 2021

Motivation: Genomic region sets summarize functional genomics data and define locations of interest in the genome such as regulatory regions or transcription factor binding sites. The number of publicly available region sets has increased dramatically, leading to challenges in data analysis.

Results: We propose a new method to represent genomic region sets as vectors, or embeddings, using an adapted word2vec approach.

View Article and Find Full Text PDF

Publications by authors named "Erfaneh Gharavi"

Methods for evaluating unsupervised vector representations of genomic regions.

Article Synopsis

Fast clustering and cell-type annotation of scATAC data using pre-trained embeddings.

Joint Representation Learning for Retrieval and Annotation of Genomic Interval Sets.

Embeddings of genomic region sets capture rich biological associations in lower dimensions.

A PHP Error was encountered

A PHP Error was encountered