Massive amounts of metagenomics data are currently being produced, and in all such projects a sizeable fraction of the resulting data shows no or little homology to known sequences. It is likely that this fraction contains novel viruses, but identification is challenging since they frequently lack homology to known viruses. To overcome this problem, we developed a strategy to detect ORFan protein families in shotgun metagenomics data, using similarity-based clustering and a set of filters to extract bona fide protein families.
View Article and Find Full Text PDFBackground: The AKT/mTORC1/S6K pathway is frequently overstimulated in breast cancer, constituting a promising therapeutic target. The benefit from mTOR inhibitors varies, likely as a consequence of tumour heterogeneity, and upregulation of several compensatory feed-back mechanisms. The mTORC1 downstream effectors S6K1, S6K2, and 4EBP1 are amplified and overexpressed in breast cancer, associated with a poor outcome and divergent endocrine treatment benefit.
View Article and Find Full Text PDFBMC Bioinformatics
September 2012
Background: Roche 454 sequencing is the leading sequencing technology for producing long read high throughput sequence data. Unlike most methods where sequencing errors translate to base uncertainties, 454 sequencing inaccuracies create nucleotide gaps. These gaps are particularly troublesome for translated search tools such as BLASTx where they introduce frame-shifts and result in regions of decreased identity and/or terminated alignments, which affect further analysis.
View Article and Find Full Text PDFInfections during pregnancy have been suggested to be involved in childhood leukemias. We used high-throughput sequencing to describe the viruses most readily detectable in serum samples of pregnant women. Serum DNA of 112 mothers to leukemic children was amplified using whole genome amplification.
View Article and Find Full Text PDFThe human respiratory tract is heavily exposed to microorganisms. Viral respiratory tract pathogens, like RSV, influenza and rhinoviruses cause major morbidity and mortality from respiratory tract disease. Furthermore, as viruses have limited means of transmission, viruses that cause pathogenicity in other tissues may be transmitted through the respiratory tract.
View Article and Find Full Text PDFBMC Res Notes
October 2011
Background: Roche 454 is one of the major 2nd generation sequencing platforms. The particular characteristics of 454 sequence data pose new challenges for bioinformatic analyses, e.g.
View Article and Find Full Text PDFBackground: High throughput pyrosequencing (454 sequencing) is the major sequencing platform for producing long read high throughput data. While most other sequencing techniques produce reading errors mainly comparable with substitutions, pyrosequencing produce errors mainly comparable with gaps. These errors are less efficiently detected by most conventional alignment programs and may produce inaccurate alignments.
View Article and Find Full Text PDFBackground: Chronic fatigue syndrome is an idiopathic syndrome widely suspected of having an infectious or immune etiology. We applied an unbiased metagenomic approach to try to identify known or novel infectious agents in the serum of 45 cases with chronic fatigue syndrome or idiopathic chronic fatigue. Controls were the unaffected monozygotic co-twins of cases, and serum samples were obtained at the same place and time.
View Article and Find Full Text PDF