105 results match your criteria: "Language Technologies Institute[Affiliation]"

Eliciting and receiving online support: using computer-aided content analysis to examine the dynamics of online social support.

J Med Internet Res

April 2015

Language Technologies Institute, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, United States.

Background: Although many people with serious diseases participate in online support communities, little research has investigated how participants elicit and provide social support on these sites.

Objective: The first goal was to propose and test a model of the dynamic process through which participants in online support communities elicit and provide emotional and informational support. The second was to demonstrate the value of computer coding of conversational data using machine learning techniques (1) by replicating results derived from human-coded data about how people elicit support and (2) by answering questions that are intractable with small samples of human-coded data, namely how exposure to different types of social support predicts continued participation in online support communities.

View Article and Find Full Text PDF

Logsum using Garbled Circuits.

PLoS One

March 2016

INESC-ID, Lisbon, Portugal; Instituto Superior Técnico, Universidade de Lisboa, Lisbon, Portugal.

Secure multiparty computation allows for a set of users to evaluate a particular function over their inputs without revealing the information they possess to each other. Theoretically, this can be achieved using fully homomorphic encryption systems, but so far they remain in the realm of computational impracticability. An alternative is to consider secure function evaluation using homomorphic public-key cryptosystems or Garbled Circuits, the latter being a popular trend in recent times due to important breakthroughs.

View Article and Find Full Text PDF

We consider the problem of building a model to predict protein-protein interactions (PPIs) between the bacterial species Salmonella Typhimurium and the plant host Arabidopsis thaliana which is a host-pathogen pair for which no known PPIs are available. To achieve this, we present approaches, which use homology and statistical learning methods called "transfer learning." In the transfer learning setting, the task of predicting PPIs between Arabidopsis and its pathogen S.

View Article and Find Full Text PDF

Comparing human-Salmonella with plant-Salmonella protein-protein interaction predictions.

Front Microbiol

February 2015

Klein-Seetharaman Laboratory, Division of Metabolic and Vascular Health, Warwick Medical School, University of Warwick, Coventry, UK.

Salmonellosis is the most frequent foodborne disease worldwide and can be transmitted to humans by a variety of routes, especially via animal and plant products. Salmonella bacteria are believed to use not only animal and human but also plant hosts despite their evolutionary distance. This raises the question if Salmonella employs similar mechanisms in infection of these diverse hosts.

View Article and Find Full Text PDF

Insights from computational modeling in inflammation and acute rejection in limb transplantation.

PLoS One

May 2015

Department of Visceral, Transplant and Thoracic Surgery, Innsbruck Medical University, Innsbruck, Austria; Department of Plastic and Reconstructive Surgery, Johns Hopkins University School of Medicine, Baltimore, Maryland, United States of America.

Acute skin rejection in vascularized composite allotransplantation (VCA) is the major obstacle for wider adoption in clinical practice. This study utilized computational modeling to identify biomarkers for diagnosis and targets for treatment of skin rejection. Protein levels of 14 inflammatory mediators in skin and muscle biopsies from syngeneic grafts [n = 10], allogeneic transplants without immunosuppression [n = 10] and allografts treated with tacrolimus [n = 10] were assessed by multiplexed analysis technology.

View Article and Find Full Text PDF

GWAS in a box: statistical and visual analytics of structured associations via GenAMap.

PLoS One

August 2015

Human Computer Interaction Institute, Carnegie Mellon University, Pittsburgh, Pennsylvania, United States of America.

With the continuous improvement in genotyping and molecular phenotyping technology and the decreasing typing cost, it is expected that in a few years, more and more clinical studies of complex diseases will recruit thousands of individuals for pan-omic genetic association analyses. Hence, there is a great need for algorithms and software tools that could scale up to the whole omic level, integrate different omic data, leverage rich structure information, and be easily accessible to non-technical users. We present GenAMap, an interactive analytics software platform that 1) automates the execution of principled machine learning methods that detect genome- and phenome-wide associations among genotypes, gene expression data, and clinical or other macroscopic traits, and 2) provides new visualization tools specifically designed to aid in the exploration of association mapping results.

View Article and Find Full Text PDF

Belief polarization occurs when 2 people with opposing prior beliefs both strengthen their beliefs after observing the same data. Many authors have cited belief polarization as evidence of irrational behavior. We show, however, that some instances of polarization are consistent with a normative account of belief revision.

View Article and Find Full Text PDF

Abstract 102: inflammatory mediators modulate alloreactive T cell susceptibility to immune-regulation in reconstructive transplantation.

Plast Reconstr Surg

March 2014

Johns Hopkins University School of Medicine, Dept of Plastic & Reconstructive Surgery, Baltimore, MD, University of Pittsburgh Medical Center, Starzl Transplantation Institute, Pittsburgh, PA, Language Technologies Institute, Carnegie Mellon University, Pittsburgh, PA.

View Article and Find Full Text PDF

Objective: Coding of clinical communication for fine-grained features such as speech acts has produced a substantial literature. However, annotation by humans is laborious and expensive, limiting application of these methods. We aimed to show that through machine learning, computers could code certain categories of speech acts with sufficient reliability to make useful distinctions among clinical encounters.

View Article and Find Full Text PDF

Multitask learning for host-pathogen protein interactions.

Bioinformatics

July 2013

Language Technologies Institute, School of Computer Science, Carnegie Mellon University, 5000 Forbes Ave, PA 15213, USA.

Article Synopsis
  • The research focuses on understanding how different infectious diseases work by analyzing host-pathogen interactions using systems biology and multitask learning.
  • The study specifically predicts human host interactions with four bacterial pathogens, using advanced optimization techniques to improve model accuracy.
  • Results indicate that this integrated approach is more effective than traditional methods, with insights gained from the protein interaction predictions being available online.
View Article and Find Full Text PDF

The emerging field of vascular composite allotransplantation (VCA) has become a clinical reality. Building upon cutting edge understandings of transplant surgery and immunology, complex grafts such as hands and faces can now be transplanted with success. Many of the challenges that have historically been limiting factors in transplantation, such as rejection and the morbidity of immunosuppression, remain challenges in VCA.

View Article and Find Full Text PDF

We introduce three algorithms for learning generative models of molecular structures from molecular dynamics simulations. The first algorithm learns a Bayesian-optimal undirected probabilistic model over user-specified covariates (e.g.

View Article and Find Full Text PDF

TVNViewer: an interactive visualization tool for exploring networks that change over time or space.

Bioinformatics

July 2011

Joint CMU-Pitt PhD Program in Computational Biology, Lane Center for Computational Biology, School of Computer Science, Language Technologies Institute and Department of Machine Learning, Carnegie Mellon University, Pittsburgh, PA 15213, USA.

Unlabelled: The relationship between genes and proteins is a dynamic relationship that changes across time and differs in different cells. The study of these differences can reveal various insights into biological processes and disease progression, especially with the aid of proper tools for network visualization. Toward this purpose, we have developed TVNViewer, a novel visualization tool, which is specifically designed to aid in the exploration and analysis of dynamic networks.

View Article and Find Full Text PDF

We introduce a new approach to learning statistical models from multiple sequence alignments (MSA) of proteins. Our method, called GREMLIN (Generative REgularized ModeLs of proteINs), learns an undirected probabilistic graphical model of the amino acid composition within the MSA. The resulting model encodes both the position-specific conservation statistics and the correlated mutation statistics between sequential and long-range pairs of residues.

View Article and Find Full Text PDF

Structured Literature Image Finder: Parsing Text and Figures in Biomedical Literature.

Web Semant

July 2010

Machine Learning Department, Carnegie Mellon University ; Joint Carnegie Mellon University-University of Pittsburgh Ph.D. Program in Computational Biology ; Center for Bioimage Informatics, Carnegie Mellon University ; Lane Center for Computational Biology, Carnegie Mellon University ; Department of Biological Sciences, Carnegie Mellon University ; Department of Biomedical Engineering, Carnegie Mellon University.

The SLIF project combines text-mining and image processing to extract structured information from biomedical literature. SLIF extracts images and their captions from published papers. The captions are automatically parsed for relevant biological entities (protein and cell type names), while the images are classified according to their type (e.

View Article and Find Full Text PDF

Recent multivariate analyses of fMRI activation have shown that discriminative classifiers such as Support Vector Machines (SVM) are capable of decoding fMRI-sensed neural states associated with the visual presentation of categories of various objects. However, the lack of a generative model of neural activity limits the generality of these discriminative classifiers for understanding the underlying neural representation. In this study, we propose a generative classifier that models the hidden factors that underpin the neural representation of objects, using a multivariate multiple linear regression model.

View Article and Find Full Text PDF

Alternative paths in HIV-1 targeted human signal transduction pathways.

BMC Genomics

December 2009

Language Technologies Institute, School of Computer Science, Carnegie Mellon University, 5000 Forbes Avenue, Pittsburgh, PA 15213, USA.

Background: Human immunodeficiency virus-1 (HIV-1) has a minimal genome of only 9 genes, which encode 15 proteins. HIV-1 thus depends on the human host for virtually every aspect of its life cycle. The universal language of communication in biological systems, including between pathogen and host, is via signal transduction pathways.

View Article and Find Full Text PDF

Semi-automatically labeling objects in images.

IEEE Trans Image Process

June 2009

Language Technologies Institute, School of Computer Science at Carnegie Mellon University, Pittsburgh, PA 15213, USA.

Labeling objects in images plays a crucial role in many visual learning and recognition applications that need training data, such as image retrieval, object detection and recognition. Manually creating object labels in images is time consuming and, thus, becomes impossible for labeling a large image dataset. In this paper, we present a family of semi-automatic methods based on a graph-based semi-supervised learning algorithm for labeling objects in images.

View Article and Find Full Text PDF

Loops connecting the transmembrane (TM) alpha-helices in membrane proteins are expected to affect the structural organization of the thereby connected helices and the helical bundles as a whole. This effect, which has been largely ignored previously, is studied here by analyzing the x-ray structures of 41 alpha-helical membrane proteins. First we define the loop flexibility ratio, R, and find that 53% of the loops are stretched, where a stretched loop constrains the distance between the two connected helices.

View Article and Find Full Text PDF

Background: Prediction of transmembrane (TM) helices by statistical methods suffers from lack of sufficient training data. Current best methods use hundreds or even thousands of free parameters in their models which are tuned to fit the little data available for training. Further, they are often restricted to the generally accepted topology "cytoplasmic-transmembrane-extracellular" and cannot adapt to membrane proteins that do not conform to this topology.

View Article and Find Full Text PDF

Unlabelled: TMpro is a transmembrane (TM) helix prediction algorithm that uses language processing methodology for TM segment identification. It is primarily based on the analysis of statistical distributions of properties of amino acids in transmembrane segments. This article describes the availability of TMpro on the internet via a web interface.

View Article and Find Full Text PDF

Comparison of stability predictions and simulated unfolding of rhodopsin structures.

Photochem Photobiol

October 2007

Language Technologies Institute, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA.

Developing a better mechanistic understanding of membrane protein folding is urgently needed because of the discovery of an increasing number of human diseases, where membrane protein instability and misfolding is involved. Towards this goal, we investigated folding and stability of 7-transmembrane (TM) helical bundles by computational methods. We compared the results of three different algorithms for predicting changes in stability of proteins against an experimental mutation dataset obtained for bacteriorhodopsin (BR) and mammalian rhodopsin and find that 61.

View Article and Find Full Text PDF

Inferring pairwise regulatory relationships from multiple time series datasets.

Bioinformatics

March 2007

Machine Learning Department, Language Technologies Institute, Computer Science Department and Department of Biological Sciences, Carnegie Mellon University, 5000 Forbes Ave., Pittsburgh, PA 15213, USA.

Motivation: Time series expression experiments have emerged as a popular method for studying a wide range of biological systems under a variety of conditions. One advantage of such data is the ability to infer regulatory relationships using time lag analysis. However, such analysis in a single experiment may result in many false positives due to the small number of time points and the large number of genes.

View Article and Find Full Text PDF

A discriminative learning framework with pairwise constraints for video object classification.

IEEE Trans Pattern Anal Mach Intell

April 2006

Language Technologies Institute, Carnegie Mellon University, 5000 Forbes Ave., Pittsburgh, PA 15213, USA.

To deal with the problem of insufficient labeled data in video object classification, one solution is to utilize additional pairwise constraints that indicate the relationship between two examples, i.e., whether these examples belong to the same class or not.

View Article and Find Full Text PDF

BLMT: statistical sequence analysis using N-grams.

Appl Bioinformatics

December 2005

Language Technologies Institute, Carnegie Mellon University, 4400 Fifth Avenue, Pittsburgh, PA 15213, USA.

Unlabelled: Statistical analysis of amino acid and nucleotide sequences, especially sequence alignment, is one of the most commonly performed tasks in modern molecular biology. However, for many tasks in bioinformatics, the requirement for the features in an alignment to be consecutive is restrictive and "n-grams" (aka k-tuples) have been used as features instead. N-grams are usually short nucleotide or amino acid sequences of length n, but the unit for a gram may be chosen arbitrarily.

View Article and Find Full Text PDF