Project description:A Metaproteomic Workflow for Sample Preparation and Data Analysis Applied to Mouse Faeces: 1 MTD project_description Many diseases have been associated with gut microbiome abnormalities. The root cause of such diseases is not only due to bacterial dysbiosis, but also to change in bacterial functions, which are best studied by proteomic approaches. Although bacterial proteomics is well established, metaproteomics is hindered by challenges associated with the sample physical structure, contaminating proteins, the simultaneous analysis of hundreds of species and the subsequent data analysis. Here, we present a systematic assessment of sample preparation and data analysis methodologies applied to LC-MS/MS metaproteomics experiment. We could show that low speed centrifugation (LSC) has a significant impact on both peptide identifications and reproducibility. LSC led to increase in peptide and proteins identifications compare to no LSC. Notably, the dominant bacterial phyla, i.e. Firmicutes and Bacteroidetes, showed divergent representation between LSC and no-LSC. In terms of data processing, protein sequence databases derived from mouse faeces metagenome provided at least four times more MS/MS identification compared to databases of concatenated single organisms. We also demonstrated that two-steps database search strategy comes at the expense of a dramatic rise in number of false positives compared to single-step strategy. Overall, we found a positive correlation between matching metaproteome and metagenome abundance, which could be linked to core microbial functions, such as glycolysis-gluconeogenesis, citrate cycle and carbon metabolism. We observed significant overlap and correlation at the phylum, class, order and family taxonomic levels between taxonomy-derived from metagenome and metaproteome. Notably, nearly all functional categories (e.g., membrane transport, translation, transcription) were differentially abundant in the metaproteome (activity) compared to what would be expected from the metagenome (potential). In conclusion, these results highlight the need to perform metaproteomics when studying complex microbiome samples.