Publications by authors named "Carla M Mann"

We seek to transform how new and emergent variants of pandemic-causing viruses, specifically SARS-CoV-2, are identified and classified. By adapting large language models (LLMs) for genomic data, we build genome-scale language models (GenSLMs) which can learn the evolutionary landscape of SARS-CoV-2 genomes. By pre-training on over 110 million prokaryotic gene sequences and fine-tuning a SARS-CoV-2-specific model on 1.

View Article and Find Full Text PDF

Efficient precision genome engineering requires high frequency and specificity of integration at the genomic target site. Multiple design strategies for zebrafish gene targeting have previously been reported with widely varying frequencies for germline recovery of integration alleles. The GeneWeld protocol and pGTag (plasmids for Gene Tagging) vector series provide a set of resources to streamline precision gene targeting in zebrafish.

View Article and Find Full Text PDF
Article Synopsis
  • This research presents a method for effectively integrating reporter genes in zebrafish and mammalian cells using short DNA sequences for high precision.
  • The approach utilizes a series of plasmids, known as pGTag, which incorporate CRISPR technology, allowing easy release of DNA sequences at targeted locations.
  • Results showed high rates of successful gene targeting in zebrafish and efficient integration in human and pig cells, making the method practical and budget-friendly for various gene editing applications.
View Article and Find Full Text PDF

The discovery and development of DNA-editing nucleases (Zinc Finger Nucleases, TALENs, CRISPR/Cas systems) has given scientists the ability to precisely engineer or edit genomes as never before. Several different platforms, protocols and vectors for precision genome editing are now available, leading to the development of supporting web-based software. Here we present the Gene Sculpt Suite (GSS), which comprises three tools: (i) GTagHD, which automatically designs and generates oligonucleotides for use with the GeneWeld knock-in protocol; (ii) MEDJED, a machine learning method, which predicts the extent to which a double-stranded DNA break site will utilize the microhomology-mediated repair pathway; and (iii) MENTHU, a tool for identifying genomic locations likely to give rise to a single predominant microhomology-mediated end joining allele (PreMA) repair outcome.

View Article and Find Full Text PDF

One key problem in precision genome editing is the unpredictable plurality of sequence outcomes at the site of targeted DNA double stranded breaks (DSBs). This is due to the typical activation of the versatile Non-homologous End Joining (NHEJ) pathway. Such unpredictability limits the utility of somatic gene editing for applications including gene therapy and functional genomics.

View Article and Find Full Text PDF

Experimental methods for identifying protein(s) bound by a specific promoter-associated RNA (paRNA) of interest can be expensive, difficult, and time-consuming. This chapter describes a general computational framework for identifying potential binding partners in RNA-protein complexes or RNA-protein interaction networks. Protocols for using three web-based tools to predict RNA-protein interaction partners are outlined.

View Article and Find Full Text PDF

Efforts to predict interfacial residues in protein-RNA complexes have largely focused on predicting RNA-binding residues in proteins. Computational methods for predicting protein-binding residues in RNA sequences, however, are a problem that has received relatively little attention to date. Although the value of sequence motifs for classifying and annotating protein sequences is well established, sequence motifs have not been widely applied to predicting interfacial residues in macromolecular complexes.

View Article and Find Full Text PDF