Publications by authors named "Matthew Vaughn"

CyVerse, the largest publicly-funded open-source research cyberinfrastructure for life sciences, has played a crucial role in advancing data-driven research since the 2010s. As the technology landscape evolved with the emergence of cloud computing platforms, machine learning and artificial intelligence (AI) applications, CyVerse has enabled access by providing interfaces, Software as a Service (SaaS), and cloud-native Infrastructure as Code (IaC) to leverage new technologies. CyVerse services enable researchers to integrate institutional and private computational resources, custom software, perform analyses, and publish data in accordance with open science principles.

View Article and Find Full Text PDF

Computational tools addressing various components of design-build-test-learn (DBTL) loops for the construction of synthetic genetic networks exist but do not generally cover the entire DBTL loop. This manuscript introduces an end-to-end sequence of tools that together form a DBTL loop called Design Assemble Round Trip (DART). DART provides rational selection and refinement of genetic parts to construct and test a circuit.

View Article and Find Full Text PDF

We describe an experimental campaign that replicated the performance assessment of logic gates engineered into cells of by Gander Our experimental campaign used a novel high-throughput experimentation framework developed under Defense Advanced Research Projects Agency's Synergistic Discovery and Design program: a remote robotic lab at Strateos executed a parameterized experimental protocol. Using this protocol and robotic execution, we generated two orders of magnitude more flow cytometry data than the original experiments. We discuss our results, which largely, but not completely, agree with the original report and make some remarks about lessons learned.

View Article and Find Full Text PDF

Sequencing technologies, in particular RNASeq, have become critical tools in the design, build, test and learn cycle of synthetic biology. They provide a better understanding of synthetic designs, and they help identify ways to improve and select designs. While these data are beneficial to design, their collection and analysis is a complex, multistep process that has implications on both discovery and reproducibility of experiments.

View Article and Find Full Text PDF

Engineered proteins generally must possess a stable structure in order to achieve their designed function. Stable designs, however, are astronomically rare within the space of all possible amino acid sequences. As a consequence, many designs must be tested computationally and experimentally in order to find stable ones, which is expensive in terms of time and resources.

View Article and Find Full Text PDF

Synthetic biology is a complex discipline that involves creating detailed, purpose-built designs from genetic parts. This process is often phrased as a Design-Build-Test-Learn loop, where iterative design improvements can be made, implemented, measured, and analyzed. Automation can potentially improve both the end-to-end duration of the process and the utility of data produced by the process.

View Article and Find Full Text PDF

Microbes drive myriad ecosystem processes, but under strong influence from viruses. Because studying viruses in complex systems requires different tools than those for microbes, they remain underexplored. To combat this, we previously aggregated double-stranded DNA (dsDNA) virus analysis capabilities and resources into 'iVirus' on the CyVerse collaborative cyberinfrastructure.

View Article and Find Full Text PDF

Motivation: Applications in synthetic and systems biology can benefit from measuring whole-cell response to biochemical perturbations. Execution of experiments to cover all possible combinations of perturbations is infeasible. In this paper, we present the host response model (HRM), a machine learning approach that maps response of single perturbations to transcriptional response of the combination of perturbations.

View Article and Find Full Text PDF

Plant cells undergo two types of cell cycles-the mitotic cycle in which DNA replication is coupled to mitosis, and the endocycle in which DNA replication occurs in the absence of cell division. To investigate DNA replication programs in these two types of cell cycles, we pulse labeled intact root tips of maize (Zea mays) with 5-ethynyl-2'-deoxyuridine (EdU) and used flow sorting of nuclei to examine DNA replication timing (RT) during the transition from a mitotic cycle to an endocycle. Comparison of the sequence-based RT profiles showed that most regions of the maize genome replicate at the same time during S phase in mitotic and endocycling cells, despite the need to replicate twice as much DNA in the endocycle and the fact that endocycling is typically associated with cell differentiation.

View Article and Find Full Text PDF

The selection and firing of DNA replication origins play key roles in ensuring that eukaryotes accurately replicate their genomes. This process is not well documented in plants due in large measure to difficulties in working with plant systems. We developed a new functional assay to label and map very early replicating loci that must, by definition, include at least a subset of replication origins.

View Article and Find Full Text PDF

Improvements in next-generation sequencing technologies have resulted in dramatically reduced sequencing costs. This has led to an explosion of '-seq'-based methods, of which RNA sequencing (RNA-seq) for generating transcriptomic data is the most popular. By analysing global patterns of gene expression in organs/tissues/cells of interest or in response to chemical or environmental perturbations, researchers can better understand an organism's biology.

View Article and Find Full Text PDF

DNA methylation is a chromatin modification that can provide epigenetic regulation of gene and transposon expression. Plants utilize several pathways to establish and maintain DNA methylation in specific sequence contexts. The chromomethylase (CMT) genes maintain CHG (where H = A, C or T) methylation.

View Article and Find Full Text PDF

Biomedical data are quickly growing in volume and in variety, providing clinicians an opportunity for better clinical decision support. Here, we demonstrate a robust platform that uses software automation and high performance computing (HPC) resources to achieve real-time analytics of clinical data, specifically magnetic resonance imaging (MRI) data. We used the Agave application programming interface to facilitate communication, data transfer, and job control between an MRI scanner and an off-site HPC resource.

View Article and Find Full Text PDF

Eukaryotes use a temporally regulated process, known as the replication timing program, to ensure that their genomes are fully and accurately duplicated during S phase. Replication timing programs are predictive of genomic features and activity and are considered to be functional readouts of chromatin organization. Although replication timing programs have been described for yeast and animal systems, much less is known about the temporal regulation of plant DNA replication or its relationship to genome sequence and chromatin structure.

View Article and Find Full Text PDF

Training for experimental plant biologists needs to combine bioinformatics, quantitative approaches, computational biology, and training in the art of collaboration, best achieved through fully integrated curriculum development.

View Article and Find Full Text PDF

All plants and animals must replicate their DNA, using a regulated process to ensure that their genomes are completely and accurately replicated. DNA replication timing programs have been extensively studied in yeast and animal systems, but much less is known about the replication programs of plants. We report a novel adaptation of the "Repli-seq" assay for use in intact root tips of maize () that includes several different cell lineages and present whole-genome replication timing profiles from cells in early, mid, and late S phase of the mitotic cell cycle.

View Article and Find Full Text PDF

Background: Replication timing experiments that use label incorporation and high throughput sequencing produce peaked data similar to ChIP-Seq experiments. However, the differences in experimental design, coverage density, and possible results make traditional ChIP-Seq analysis methods inappropriate for use with replication timing.

Results: To accurately detect and classify regions of replication across the genome, we present Repliscan.

View Article and Find Full Text PDF

ThaleMine (https://apps.araport.org/thalemine/) is a comprehensive data warehouse that integrates a wide array of genomic information of the model plant Arabidopsis thaliana.

View Article and Find Full Text PDF

Epigenetic modification of DNA through methylation is known to be involved in multiple biological processes such as gene suppression. However, the exact mechanism of how DNA methylations play their part is yet unclear. In mammals, CpG islands (CGI) have been studied extensively for their involvement in cancer.

View Article and Find Full Text PDF

Background: Drinking can occur because of expectations to drink (reasoned pathway) or because of willingness to drink under certain circumstances (reactive pathway). These pathways are thought to be influenced by different cognitions such as alcohol-related attitudes, norms, or drinking prototypes (Gerrard et al., 2008).

View Article and Find Full Text PDF

Objective WHO and UNICEF recommend cup feeding for neonates unable to breastfeed in low-resource settings. In developed countries, cup feeding in lieu of bottle feeding in the neonatal period is hypothesized to improve breastfeeding outcomes for those initially unable to breastfeed. Our aim was to synthesize the entire body of evidence on cup feeding.

View Article and Find Full Text PDF

The iPlant Collaborative provides life science research communities access to comprehensive, scalable, and cohesive computational infrastructure for data management; identity management; collaboration tools; and cloud, high-performance, high-throughput computing. iPlant provides training, learning material, and best practice resources to help all researchers make the best use of their data, expand their computational skill set, and effectively manage their data and computation when working as distributed teams. iPlant's platform permits researchers to easily deposit and share their data and deploy new computational tools and analysis workflows, allowing the broader community to easily use and reuse those data and computational analyses.

View Article and Find Full Text PDF

The maize genome is relatively large (∼ 2.3 Gb) and has a complex organization of interspersed genes and transposable elements, which necessitates frequent boundaries between different types of chromatin. The examination of maize genes and conserved noncoding sequences revealed that many of these are flanked by regions of elevated asymmetric CHH (where H is A, C, or T) methylation (termed mCHH islands).

View Article and Find Full Text PDF

This article develops a conceptual framework for implementation of value-based payment (VBP) reform and then draws on that framework to systematically examine six distinct multi-stakeholder coalition VBP initiatives in three different regions of the United States. The VBP initiatives deploy the following payment models: reference pricing, "shadow" primary care capitation, bundled payment, pay for performance, shared savings within accountable care organizations, and global payment. The conceptual framework synthesizes prior models of VBP implementation.

View Article and Find Full Text PDF

DNA methylation is a stable modification of chromatin that can contribute to epigenetic variation through the regulation of genes or transposons. Profiling of DNA methylation in five maize (Zea mays) inbred lines found that while DNA methylation levels for more than 99% of the analyzed genomic regions are similar, there are still 5,000 to 20,000 context-specific differentially methylated regions (DMRs) between any two genotypes. The analysis of identical-by-state genomic regions that have limited genetic variation provided evidence that DMRs can occur without local sequence variation, but they are less common than in regions with genetic variation.

View Article and Find Full Text PDF