Mass spectral libraries are collections of reference spectra, usually associated with specific analytes from which the spectra were generated, that are used for further downstream analysis of new spectra. There are many different formats used for encoding spectral libraries, but none have undergone a standardization process to ensure broad applicability to many applications. As part of the Human Proteome Organization Proteomics Standards Initiative (PSI), we have developed a standardized format for encoding spectral libraries, called mzSpecLib (https://psidev.
View Article and Find Full Text PDFRegulatory T cells (Tregs) play a key role in suppressing systemic effector immune responses, thereby preventing autoimmune diseases but also potentially contributing to tumor progression. Thus, there is great interest in clinically manipulating Tregs, but the precise mechanisms governing in vitro-induced Treg (iTreg) differentiation are not yet fully understood. Here, we used multiparametric mass cytometry to phenotypically profile human iTregs during the early stages of in vitro differentiation at single-cell level.
View Article and Find Full Text PDFThe function of hydroxysteroid dehydrogenase 12 (HSD17B12) in lipid metabolism is poorly understood. To study this further, we created mice with hepatocyte-specific knockout of HSD17B12 (LiB12cKO). From 2 months on, these mice showed significant fat accumulation in their liver.
View Article and Find Full Text PDFAims: Heterogeneity in the rate of β-cell loss in newly diagnosed type 1 diabetes patients is poorly understood and creates a barrier to designing and interpreting disease-modifying clinical trials. Integrative analyses of baseline multi-omics data obtained after the diagnosis of type 1 diabetes may provide mechanistic insight into the diverse rates of disease progression after type 1 diabetes diagnosis.
Methods: We collected samples in a pan-European consortium that enabled the concerted analysis of five different omics modalities in data from 97 newly diagnosed patients.
and dietary factors make important contributions toward health and development in early childhood. In this respect, serum proteomics of maturing infants can provide insights into studies of childhood diseases, which together with perinatal proteomes could reveal further biological perspectives. Accordingly, to determine differences between feeding groups and changes in infancy, serum proteomics analyses of mother-infant dyads with HLA-conferred susceptibility to type 1 diabetes ( = 22), weaned to either an extensively hydrolyzed or regular cow's milk formula, were made.
View Article and Find Full Text PDFPrevious studies have revealed heterogeneity in the progression to clinical type 1 diabetes in children who develop islet-specific antibodies either to insulin (IAA) or glutamic acid decarboxylase (GADA) as the first autoantibodies. Here, we test the hypothesis that children who later develop clinical disease have different early immune responses, depending on the type of the first autoantibody to appear (GADA-first or IAA-first). We use mass cytometry for deep immune profiling of peripheral blood mononuclear cell samples longitudinally collected from children who later progressed to clinical disease (IAA-first, GADA-first, ≥2 autoantibodies first groups) and matched for age, sex, and HLA controls who did not, as part of the Type 1 Diabetes Prediction and Prevention study.
View Article and Find Full Text PDFAims/hypothesis: There is a growing need for markers that could help indicate the decline in beta cell function and recognise the need and efficacy of intervention in type 1 diabetes. Measurements of suitably selected serum markers could potentially provide a non-invasive and easily applicable solution to this challenge. Accordingly, we evaluated a broad panel of proteins previously associated with type 1 diabetes in serum from newly diagnosed individuals during the first year from diagnosis.
View Article and Find Full Text PDFBackground: Type 1 diabetes is a complex heterogenous autoimmune disease without therapeutic interventions available to prevent or reverse the disease. This study aimed to identify transcriptional changes associated with the disease progression in patients with recent-onset type 1 diabetes.
Methods: Whole-blood samples were collected as part of the INNODIA study at baseline and 12 months after diagnosis of type 1 diabetes.
Background: In coeliac disease (CoD), the role of B-cells has mainly been considered to be production of antibodies. The functional role of B-cells has not been analysed extensively in CoD.
Methods: We conducted a study to characterize gene expression in B-cells from children developing CoD early in life using samples collected before and at the diagnosis of the disease.
Quantitative proteomics has matured into an established tool and longitudinal proteomics experiments have begun to emerge. However, no effective, simple-to-use differential expression method for longitudinal proteomics data has been released. Typically, such data is noisy, contains missing values, and has only few time points and biological replicates.
View Article and Find Full Text PDFTranscriptome level expression data connected to the spatial organization of the cells and molecules would allow a comprehensive understanding of how gene expression is connected to the structure and function in the biological systems. The spatial transcriptomics platforms may soon provide such information. However, the current platforms still lack spatial resolution, capture only a fraction of the transcriptome heterogeneity, or lack the throughput for large scale studies.
View Article and Find Full Text PDFThe coronavirus disease 2019 (COVID-19) caused by the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is spreading across the world despite vast global vaccination efforts. Consequently, many studies have looked for potential human host factors and immune mechanisms associated with the disease. However, most studies have focused on comparing COVID-19 patients to healthy controls, while fewer have elucidated the specific host factors distinguishing COVID-19 from other infections.
View Article and Find Full Text PDFMass spectrometry-based metaproteomics is a relatively new field of research that enables the characterization of the functionality of microbiota. Recently, we demonstrated the applicability of data-independent acquisition (DIA) mass spectrometry to the analysis of complex metaproteomic samples. This allowed us to circumvent many of the drawbacks of the previously used data-dependent acquisition (DDA) mass spectrometry, mainly the limited reproducibility when analyzing samples with complex microbial composition.
View Article and Find Full Text PDFMass spectrometry proteomics has become an important part of modern immunology, making major contributions to understanding protein expression levels, subcellular localizations, posttranslational modifications, and interactions in various immune cell populations. New developments in both experimental and computational techniques offer increasing opportunities for exploring the immune system and the molecular mechanisms involved in immune responses. Here, we focus on current computational approaches to infer relevant information from large mass spectrometry based protein profiling datasets, covering the different steps of the analysis from protein identification and quantification to further mining and modelling of the protein abundance data.
View Article and Find Full Text PDFLarge-scale phosphoproteome profiling using mass spectrometry (MS) provides functional insight that is crucial for disease biology and drug discovery. However, extracting biological understanding from these data is an arduous task requiring multiple analysis platforms that are not adapted for automated high-dimensional data analysis. Here, we introduce an integrated pipeline that combines several R packages to extract high-level biological understanding from large-scale phosphoproteomic data by seamless integration with existing databases and knowledge resources.
View Article and Find Full Text PDFBackground: The existing risk prediction models for chemotherapy-induced febrile neutropenia (FN) do not necessarily apply to real-life patients in different healthcare systems and the external validation of these models are often lacking. Our study evaluates whether a machine learning-based risk prediction model could outperform the previously introduced models, especially when validated against real-world patient data from another institution not used for model training.
Methods: Using Turku University Hospital electronic medical records, we identified all patients who received chemotherapy for non-hematological cancer between the years 2010 and 2017 (N = 5879).
Breast cancer is now globally the most frequent cancer and leading cause of women's death. Two thirds of breast cancers express the luminal estrogen receptor-positive (ERα + ) phenotype that is initially responsive to antihormonal therapies, but drug resistance emerges. A major barrier to the understanding of the ERα-pathway biology and therapeutic discoveries is the restricted repertoire of luminal ERα + breast cancer models.
View Article and Find Full Text PDFMetagenomic approaches focus on taxonomy or gene annotation but lack power in defining functionality of gut microbiota. Therefore, metaproteomics approaches have been introduced to overcome this limitation. However, the common metaproteomics approach uses data-dependent acquisition mass spectrometry, which is known to have limited reproducibility when analyzing samples with complex microbial composition.
View Article and Find Full Text PDFData-independent acquisition (DIA) mode of mass spectrometry, such as the SWATH-MS technology, enables accurate and consistent measurement of proteins, which is crucial for comparative proteomics studies. However, there is lack of free and easy to implement data analysis protocols that can handle the different data processing steps from raw spectrum files to peptide intensity matrix and its downstream analysis. Here, we provide a data analysis protocol, named diatools, covering all these steps from spectral library building to differential expression analysis of DIA proteomics data.
View Article and Find Full Text PDFMotivation: Mass spectrometry combined with enrichment strategies for phosphorylated peptides has been successfully employed for two decades to identify sites of phosphorylation. However, unambiguous phosphosite assignment is considered challenging. Given that site-specific phosphorylation events function as different molecular switches, validation of phosphorylation sites is of utmost importance.
View Article and Find Full Text PDFMotivation: Global centering-based normalization is a commonly used normalization approach in mass spectrometry-based label-free proteomics. It scales the peptide abundances to have the same median intensities, based on an assumption that the majority of abundances remain the same across the samples. However, especially in phosphoproteomics, this assumption can introduce bias, as the samples are enriched during sample preparation which can mask the underlying biological changes.
View Article and Find Full Text PDFWe describe a new reproducibility-optimization method ROPECA for statistical analysis of proteomics data with a specific focus on the emerging data-independent acquisition (DIA) mass spectrometry technology. ROPECA optimizes the reproducibility of statistical testing on peptide-level and aggregates the peptide-level changes to determine differential protein-level expression. Using a 'gold standard' spike-in data and a hybrid proteome benchmark data we show the competitive performance of ROPECA over conventional protein-based analysis as well as state-of-the-art peptide-based tools especially in DIA data with consistent peptide measurements.
View Article and Find Full Text PDF