Publications by authors named "Luis F Paulin"

The abundance of Lp(a) protein holds significant implications for the risk of cardiovascular disease (CVD), which is directly impacted by the copy number (CN) of KIV-2, a 5.5 kbp sub-region. KIV-2 is highly polymorphic in the population and accurate analysis is challenging.

View Article and Find Full Text PDF
Article Synopsis
  • The Long-Read Personalized OncoGenomics (POG) dataset features 189 patient tumors and 41 matched normal samples, sequenced with Oxford Nanopore Technologies, providing a comprehensive resource for cancer research.
  • It highlights the advantages of long-read sequencing in identifying complex structural variants, viral integrations, and specific DNA behaviors, such as prominent methylation patterns associated with various cancers.
  • The findings underscore the potential of this dataset in precision medicine, serving as a tool for advancing analytical techniques in cancer genomics.
View Article and Find Full Text PDF
Article Synopsis
  • The Genome in a Bottle Consortium (GIAB) is creating matched tumor-normal samples that are publicly consented for sharing genomic data and cell lines, focusing on pancreatic ductal adenocarcinoma (PDAC).
  • They provide a comprehensive genomic dataset from the first individual, combining high-depth DNA from tumor and normal cells using advanced whole genome sequencing technologies.
  • This open-access resource aims to help develop benchmarks for detecting genetic variants in cancer, fostering innovation in genome measurement and analysis tools.
View Article and Find Full Text PDF

The duplication-triplication/inverted-duplication (DUP-TRP/INV-DUP) structure is a complex genomic rearrangement (CGR). Although it has been identified as an important pathogenic DNA mutation signature in genomic disorders and cancer genomes, its architecture remains unresolved. Here, we studied the genomic architecture of DUP-TRP/INV-DUP by investigating the DNA of 24 patients identified by array comparative genomic hybridization (aCGH) on whom we found evidence for the existence of 4 out of 4 predicted structural variant (SV) haplotypes.

View Article and Find Full Text PDF

The complexities of cancer genomes are becoming more easily interpreted due to advancements in sequencing technologies and improved bioinformatic analysis. Structural variants (SVs) represent an important subset of somatic events in tumors. While detection of SVs has been markedly improved by the development of long-read sequencing, somatic variant identification and annotation remains challenging.

View Article and Find Full Text PDF
Article Synopsis
  • * It is 11.8 times faster and 29% more accurate than existing SV callers, effectively handling various sequencing technologies and SV types while also generating fully genotyped VCF files for population-level analysis.
  • * The tool successfully identified complex SVs around the MECP2 gene and detected diverse mosaic SVs in brain tissue, revealing significant implications for genes related to neuron function in conditions like multiple system atrophy.
View Article and Find Full Text PDF

Complete, telomere-to-telomere (T2T) genome assemblies promise improved analyses and the discovery of new variants, but many essential genomic resources remain associated with older reference genomes. Thus, there is a need to translate genomic features and read alignments between references. Here we describe a method called levioSAM2 that performs fast and accurate lift-over between assemblies using a whole-genome map.

View Article and Find Full Text PDF

Background: The duplication-triplication/inverted-duplication (DUP-TRP/INV-DUP) structure is a type of complex genomic rearrangement (CGR) hypothesized to result from replicative repair of DNA due to replication fork collapse. It is often mediated by a pair of inverted low-copy repeats (LCR) followed by iterative template switches resulting in at least two breakpoint junctions . Although it has been identified as an important mutation signature of pathogenicity for genomic disorders and cancer genomes, its architecture remains unresolved and is predicted to display at least four structural variation (SV) haplotypes.

View Article and Find Full Text PDF

The human Y chromosome has been notoriously difficult to sequence and assemble because of its complex repeat structure that includes long palindromes, tandem repeats and segmental duplications. As a result, more than half of the Y chromosome is missing from the GRCh38 reference sequence and it remains the last human chromosome to be finished. Here, the Telomere-to-Telomere (T2T) consortium presents the complete 62,460,029-base-pair sequence of a human Y chromosome from the HG002 genome (T2T-Y) that corrects multiple errors in GRCh38-Y and adds over 30 million base pairs of sequence to the reference, showing the complete ampliconic structures of gene families TSPY, DAZ and RBMY; 41 additional protein-coding genes, mostly from the TSPY family; and an alternating pattern of human satellite 1 and 3 blocks in the heterochromatic Yq12 region.

View Article and Find Full Text PDF
Article Synopsis
  • The human reference genome, GRCh38, has errors like duplicated and collapsed regions, which can affect important genes related to health.
  • FixItFelix is a new method that helps correct these errors quickly and easily without changing the original data's layout.
  • The improvements help scientists study different populations better and understand how genes can affect traits and diseases.
View Article and Find Full Text PDF

Background: Recent population studies are ever growing in number of samples to investigate the diversity of a population or species. These studies reveal new polymorphism that lead to important insights into the mechanisms of evolution, but are also important for the interpretation of these variations. Nevertheless, while the full catalog of variations across entire species remains unknown, we can predict which regions harbor additional not yet detected variations and investigate their properties, thereby enhancing the analysis for potentially missed variants.

View Article and Find Full Text PDF

In October 2021, 59 scientists from 14 countries and 13 U.S. states collaborated virtually in the Third Annual Baylor College of Medicine & DNANexus Structural Variation hackathon.

View Article and Find Full Text PDF

Meiosis is a specialized cell division that gives rise to genetically distinct gametic cells. Meiosis relies on the tightly controlled formation of DNA double-strand breaks (DSBs) and their repair via homologous recombination for correct chromosome segregation. Like all forms of DNA damage, meiotic DSBs are potentially harmful and their formation activates an elaborate response to inhibit excessive DNA break formation and ensure successful repair.

View Article and Find Full Text PDF

Poly(ADP-ribosyl)ation is a reversible post-translational modification synthetized by ADP-ribose transferases and removed by poly(ADP-ribose) glycohydrolase (PARG), which plays important roles in DNA damage repair. While well-studied in somatic tissues, much less is known about poly(ADP-ribosyl)ation in the germline, where DNA double-strand breaks are introduced by a regulated program and repaired by crossover recombination to establish a tether between homologous chromosomes. The interaction between the parental chromosomes is facilitated by meiotic specific adaptation of the chromosome axes and cohesins, and reinforced by the synaptonemal complex.

View Article and Find Full Text PDF

Recent evidence suggests that most influenza A virus gene segments can contribute to the pathogenicity of the virus. In this regard, the hemagglutinin (HA) subtype of the circulating strains has been closely surveyed, but the reassortment of internal gene segments is usually not monitored as a potential source of an increased pathogenicity. In this work, an oligonucleotide DNA microarray (PhyloFlu) designed to determine the phylogenetic origins of the eight segments of the influenza virus genome was constructed and validated.

View Article and Find Full Text PDF