Publications by authors named "Andrew R Dalby"

The origin of recent parallel outbreaks of the high pathogenicity H5N8 avian flu virus in Europe and in Japan can be traced to a single source population, which has most likely been spread by migratory birds. By using Bayesian coalescent methods to analyze the DNA sequences of the virus to find the times for divergence and combining this sequence data with bird migration data we can show the most likely locations and migratory pathways involved in the origin of the current outbreak. This population was most likely located in the Siberian summer breeding grounds of long-range migratory birds.

View Article and Find Full Text PDF

A complete phylogenetic analysis of all of the H9N2 hemagglutinin sequences that were collected between 1966 and 2012 was carried out in order to build a picture of the geographical and host specific evolution of the hemagglutinin protein. To improve the quality and applicability of the output data the sequences were divided into subsets based upon location and host species. The phylogenetic analysis of hemagglutinin reveals that the protein has distinct lineages between China and the Middle East, and that wild birds in both regions retain a distinct form of the H9 molecule, from the same lineage as the ancestral hemagglutinin.

View Article and Find Full Text PDF

Microarray data from cell lines of Non-Small Cell Lung Carcinoma (NSCLC) can be used to look for differences in gene expression between the cell lines derived from different tumour samples, and to investigate if these differences can be used to cluster the cell lines into distinct groups. Dividing the cell lines into classes can help to improve diagnosis and the development of screens for new drug candidates. The micro-array data is first subjected to quality control analysis and then subsequently normalised using three alternate methods to reduce the chances of differences being artefacts resulting from the normalisation process.

View Article and Find Full Text PDF

Background: MicroRNAs (miRNAs) constitute a class of single-stranded RNAs which play a crucial role in regulating development and controlling gene expression by targeting mRNAs and triggering either translation repression or messenger RNA (mRNA) degradation. miRNAs are widespread in eukaryotes and to date over 14,000 miRNAs have been identified by computational and experimental approaches. Several miRNAs are highly conserved across species.

View Article and Find Full Text PDF

Background: Biological systems are inherently inhomogeneous and spatial effects play a significant role in processes such as pattern formation. At the cellular level proteins are often localised either through static attachment or via a dynamic equilibrium. As well as spatial heterogeneity many cellular processes exhibit stochastic fluctuations and so to make inferences about the location of molecules there is a need for spatial stochastic models.

View Article and Find Full Text PDF

Background: Microsatellites have been used extensively in the field of comparative genomics. By studying microsatellites in coding regions we have a simple model of how genotypic changes undergo selection as they are directly expressed in the phenotype as altered proteins. The simplest of these tandem repeats in coding regions are the tri-nucleotide repeats which produce a repeat of a single amino acid when translated into proteins.

View Article and Find Full Text PDF

Previous molecular dynamic simulations have reported elongation of the existing beta-sheet in prion proteins. Detailed examination has shown that these elongations do not extend beyond the proline residues flanking these beta-sheets. In addition, proline has also been suggested to possess a possible structural role in preserving protein interaction sites by preventing invasion of neighboring secondary structures.

View Article and Find Full Text PDF

Background: Single amino acid repeats make up a significant proportion in all of the proteomes that have currently been determined. They have been shown to be functionally and medically significant, and are associated with cancers and neuro-degenerative diseases such as Huntington's Chorea, where a poly-glutamine repeat is responsible for causing the disease. The COPASAAR database is a new tool to facilitate the rapid analysis of single amino acid repeats at a proteome level.

View Article and Find Full Text PDF

Accurately predicting phosphorylation sites in proteins is an important issue in postgenomics, for which how to efficiently extract the most predictive features from amino acid sequences for modeling is still challenging. Although both the distributed encoding method and the bio-basis function method work well, they still have some limits in use. The distributed encoding method is unable to code the biological content in sequences efficiently, whereas the bio-basis function method is a nonparametric method, which is often computationally expensive.

View Article and Find Full Text PDF

Fatal familial insomnia (FFI) and Creutzfeldt-Jakob disease (CJD) are associated to the same mutation at codon 178 but differentiate into clinicopathologically distinct diseases determined by this mutation and a naturally occurring methionine-valine polymorphism at codon 129 of the prion protein gene. It has been suggested that the clinical and pathological difference between FFI and CJD is caused by different conformations of the prion protein. Using molecular dynamics (MD), we investigated the effect of the mutation at codon 178 and the polymorphism at codon 129 on prion protein dynamics and conformation at normal and elevated temperatures.

View Article and Find Full Text PDF

Feature selection is a key step in Quantitative Structure Activity Relationship (QSAR) analysis. Chance correlations and multicollinearity are two major problems often encountered when attempting to find generalized QSAR models for use in drug design. Optimal QSAR models require an objective variable relevance analysis step for producing robust classifiers with low complexity and good predictive accuracy.

View Article and Find Full Text PDF

Motivation: In order to design effective HIV inhibitors, studying and understanding the mechanism of HIV protease cleavage specification is critical. Various methods have been developed to explore the specificity of HIV protease cleavage activity. However, success in both extracting discriminant rules and maintaining high prediction accuracy is still challenging.

View Article and Find Full Text PDF

Protein phosphorylation is a post-translational modification performed by a group of enzymes known as the protein kinases or phosphotransferases (Enzyme Commission classification 2.7). It is essential to the correct functioning of both proteins and cells, being involved with enzyme control, cell signalling and apoptosis.

View Article and Find Full Text PDF