Analysis of large-scale data-independent acquisition mass spectrometry metaproteomics data remains a computational challenge. Here, we present a computational pipeline called metaExpertPro for metaproteomics data analysis. This pipeline encompasses spectral library generation using data-dependent acquisition MS, protein identification and quantification using data-independent acquisition mass spectrometry, functional and taxonomic annotation, as well as quantitative matrix generation for both microbiota and hosts. By integrating FragPipe and DIA-NN, metaExpertPro offers compatibility with both Orbitrap and timsTOF MS instruments. To evaluate the depth and accuracy of identification and quantification, we conducted extensive assessments using human fecal samples and benchmark tests. Performance tests conducted on human fecal samples indicated that metaExpertPro quantified an average of 45,000 peptides in a 60-min diaPASEF injection. Notably, metaExpertPro outperformed three existing software tools by characterizing a higher number of peptides and proteins. Importantly, metaExpertPro maintained a low factual false discovery rate of approximately 5% for protein groups across four benchmark tests. Applying a filter of five peptides per genus, metaExpertPro achieved relatively high accuracy (F-score = 0.67-0.90) in genus diversity and showed a high correlation (r = 0.73-0.82) between the measured and true genus relative abundance in benchmark tests. Additionally, the quantitative results at the protein, taxonomy, and function levels exhibited high reproducibility and consistency across the commonly adopted public human gut microbial protein databases IGC and UHGP. In a metaproteomic analysis of dyslipidemia patients, metaExpertPro revealed characteristic alterations in microbial functions and potential interactions between the microbiota and the host.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1016/j.mcpro.2024.100840 | DOI Listing |
Biotechnol J
January 2025
Biologics Process Research & Development, Merck & Co., Inc., Rahway, New Jersey, USA.
Chinese hamster ovary (CHO) cells are widely used to produce recombinant proteins, including monoclonal antibodies (mAbs), through various process modes. While fed-batch (FB) processes have been the standard, a shift toward high-density perfusion processes is being driven by increased productivity, flexible facility footprints, and lower costs. Ensuring the clearance of process-related impurities, such as host cell proteins (HCPs), is crucial in biologics manufacturing.
View Article and Find Full Text PDFNat Protoc
January 2025
Department Proteomics and Signal Transduction, Max Planck Institute of Biochemistry, Martinsried, Germany.
Deep and accurate proteome analysis is crucial for understanding cellular processes and disease mechanisms; however, it is challenging to implement in routine settings. In this protocol, we combine a robust chromatographic platform with a high-performance mass spectrometric setup to enable routine yet in-depth proteome coverage for a broad community. This entails tip-based sample preparation and pre-formed gradients (Evosep One) combined with a trapped ion mobility time-of-flight mass spectrometer (timsTOF, Bruker).
View Article and Find Full Text PDFTalanta
January 2025
State Key Laboratory of Component-based Chinese Medicine, Tianjin Key Laboratory of TCM Chemistry and Analysis, Tianjin University of Traditional Chinese Medicine, 10 Poyanghu Road, Jinghai District, Tianjin, 301617, PR China; Haihe Laboratory of Modern Chinese Medicine, Tianjin, 301617, PR China. Electronic address:
Metabolites identification is the major bottleneck in untargeted LC-MS metabolomics, primarily due to the limited availability of MS information for most detected metabolites in data dependent acquisition (DDA) mode. To solve this problem, we have integrated the iterative, interval, and segmented window acquisition concepts to develop an innovative non-fixed segmented window interval data dependency acquisition (NFSWI-DDA) mode, which achieves comparable MS coverage to data independent acquisition (DIA) mode. This acquisition strategy harnesses the strengths of both DDA and DIA, which could provide extensive coverage and excellent reproducibility of MS spectra.
View Article and Find Full Text PDFAnal Chem
January 2025
Department of Applied Biology and Chemical Technology, State Key Laboratory of Chemical Biology and Drug Discovery, Hong Kong Polytechnic University, Hong Kong 999077, China.
Alternative proteins (AltProts) are a class of proteins encoded by DNA sequences previously classified as noncoding. Despite their historically being overlooked, recent studies have highlighted their widespread presence and distinctive biological roles. So far, direct detection of AltProt has been relying on data-dependent acquisition (DDA) mass spectrometry (MS).
View Article and Find Full Text PDFJ Agric Food Chem
January 2025
Guangdong Subcenter of the National Center for Soybean Improvement, College of Agriculture, South China Agricultural University, Guangzhou 510642, China.
As complex quantitative traits, soybean seed oil and protein contents are governed by dynamic proteome networks that remain largely unknown. Here, we investigated the dynamic changes of the proteome during seed maturation across two soybean varieties with contrasting seed oil and protein content. Through optimizing the detectability of low-abundance proteins and utilizing library-free data-independent acquisition (directDIA) mass spectrometry, we unprecedentedly identified 7414 proteins and 3975 protein groups (PGs), substantially expanding the soybean seed proteome.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!