MacSyFinder: a program to mine genomes for molecular systems with an application to CRISPR-Cas systems.

PLoS One

Microbial Evolutionary Genomics, Institut Pasteur, Paris, France; UMR3525, CNRS, Paris, France.

Published: December 2015

AI Article Synopsis

  • MacSyFinder is a tool created for biologists to identify molecular systems and their homologs in genomic data, providing a flexible framework for modeling molecular properties and evolutionary connections.* -
  • It uses Hidden Markov models to search for sequence similarities and determine system assignments based on a defined model's content and structure, along with a graphical interface (MacSyView) for result visualization.* -
  • The tool is implemented in Python and is available for free, requiring specific software versions, and includes a "Cas-finder" to detect CRISPR-Cas systems with user-friendly access to protein profiles.*

Article Abstract

Motivation: Biologists often wish to use their knowledge on a few experimental models of a given molecular system to identify homologs in genomic data. We developed a generic tool for this purpose.

Results: Macromolecular System Finder (MacSyFinder) provides a flexible framework to model the properties of molecular systems (cellular machinery or pathway) including their components, evolutionary associations with other systems and genetic architecture. Modelled features also include functional analogs, and the multiple uses of a same component by different systems. Models are used to search for molecular systems in complete genomes or in unstructured data like metagenomes. The components of the systems are searched by sequence similarity using Hidden Markov model (HMM) protein profiles. The assignment of hits to a given system is decided based on compliance with the content and organization of the system model. A graphical interface, MacSyView, facilitates the analysis of the results by showing overviews of component content and genomic context. To exemplify the use of MacSyFinder we built models to detect and class CRISPR-Cas systems following a previously established classification. We show that MacSyFinder allows to easily define an accurate "Cas-finder" using publicly available protein profiles.

Availability And Implementation: MacSyFinder is a standalone application implemented in Python. It requires Python 2.7, Hmmer and makeblastdb (version 2.2.28 or higher). It is freely available with its source code under a GPLv3 license at https://github.com/gem-pasteur/macsyfinder. It is compatible with all platforms supporting Python and Hmmer/makeblastdb. The "Cas-finder" (models and HMM profiles) is distributed as a compressed tarball archive as Supporting Information.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4201578PMC
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0110726PLOS

Publication Analysis

Top Keywords

molecular systems
12
systems
8
crispr-cas systems
8
macsyfinder
5
macsyfinder program
4
program mine
4
mine genomes
4
molecular
4
genomes molecular
4
systems application
4

Similar Publications

Benzothiadiazole (BT) has shown promising applications in fullerene solar cells. However, few BT-based polymer donors exhibited a noticeable power conversion efficiency (PCE) for the fused-ring small molecular acceptor-based polymer solar cells (PSCs). Herein, we developed a D-A (D: donor, A: acceptor) polymer donor F-1 based on fluorinated BT (ffBT) as A unit and chlorinated benzo [1,2-b:4,5-b'] dithiophene (BDT-2Cl) as D unit.

View Article and Find Full Text PDF

A bis(triarylamine) (BTA) radical cation, bridged by two o-terphenylene moieties, was prepared and characterized to explore the impact of the double-π-bridge on the intramolecular charge/spin transfer process in the 2-site organic mixed-valence (MV) compound. Spectroscopic analyses on optically and thermally assisted intervalence charge-transfer (IVCT) processes revealed that the doubly π-bridging enhanced the charge delocalization between two nitrogen redox-active centers, whereas the electronic coupling was not so strengthened, in comparison with the singly π-bridging reference compound.

View Article and Find Full Text PDF

Nanographenes and polycyclic aromatic hydrocarbons, both finite forms of graphene, are promising organic semiconducting materials because their optoelectronic and magnetic properties can be modulated through precise control of their molecular peripheries. Several atomically precise edge structures have been prepared by bottom-up synthesis; however, no systematic elucidation of these edge topologies at the molecular level has been reported. Herein, we describe rationally designed modular syntheses of isomeric dibenzoixenes with diverse molecular peripheries, including cove, zigzag, bay, fjord, and gulf structured.

View Article and Find Full Text PDF

Purpose/background: Clozapine is the recommended drug for treatment-resistant schizophrenia. Drug response could be affected by numerous factors such as age, sex, body mass index, co-medication, consumption of xanthine-containing beverages, smoking, and genetic variants of the enzymes involved in clozapine metabolism (CYP1A2, CYP3A4, and, to a lesser extent, CYP2C19 and CYP2D6). This study evaluated genetic and nongenetic variables that may affect clozapine plasma concentrations in Uruguayan patients with schizophrenia.

View Article and Find Full Text PDF

Multiyear and seasonal wide-scale indicators for French surface waters contamination by WFD substances.

Environ Sci Pollut Res Int

December 2024

Office Français de la Biodiversité (OFB), 5 Allée Félix Nadar, 94300, Vincennes, France.

This study offers an unprecedented valuation of the French surface waters WFD chemical monitoring dataset, covering 101 substances (metals, industrial and persistent organic pollutants (POPs), plant protection product (PPP) and biocides active substances, combustion residues) measured monthly on 4000 sites of the 6 main continental river basins, during 12 years (2009-2020). The concentration data were first made comparable through an original process removing the bias induced by the space-and-time heterogeneity of the monitoring labs performance, to gather a reference workable set of monthly contamination indicators. These were then used to display the substances' seasonal and interannual timeseries, revealing, e.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!

A PHP Error was encountered

Severity: Notice

Message: fwrite(): Write of 34 bytes failed with errno=28 No space left on device

Filename: drivers/Session_files_driver.php

Line Number: 272

Backtrace:

A PHP Error was encountered

Severity: Warning

Message: session_write_close(): Failed to write session data using user defined save handler. (session.save_path: /var/lib/php/sessions)

Filename: Unknown

Line Number: 0

Backtrace: