Dataset Information

CAncer bioMarker Prediction Pipeline (CAMPP)-A standardized framework for the analysis of quantitative biological data.

ABSTRACT: With the improvement of -omics and next-generation sequencing (NGS) methodologies, along with the lowered cost of generating these types of data, the analysis of high-throughput biological data has become standard both for forming and testing biomedical hypotheses. Our knowledge of how to normalize datasets to remove latent undesirable variances has grown extensively, making for standardized data that are easily compared between studies. Here we present the CAncer bioMarker Prediction Pipeline (CAMPP), an open-source R-based wrapper (https://github.com/ELELAB/CAncer-bioMarker-Prediction-Pipeline -CAMPP) intended to aid bioinformatic software-users with data analyses. CAMPP is called from a terminal command line and is supported by a user-friendly manual. The pipeline may be run on a local computer and requires little or no knowledge of programming. To avoid issues relating to R-package updates, a renv .lock file is provided to ensure R-package stability. Data-management includes missing value imputation, data normalization, and distributional checks. CAMPP performs (I) k-means clustering, (II) differential expression/abundance analysis, (III) elastic-net regression, (IV) correlation and co-expression network analyses, (V) survival analysis, and (VI) protein-protein/miRNA-gene interaction networks. The pipeline returns tabular files and graphical representations of the results. We hope that CAMPP will assist in streamlining bioinformatic analysis of quantitative biological data, whilst ensuring an appropriate bio-statistical framework.

SUBMITTER: Terkelsen T

PROVIDER: S-EPMC7108742 | biostudies-literature | 2020 Mar

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

CAncer bioMarker Prediction Pipeline (CAMPP)-A standardized framework for the analysis of quantitative biological data.

Terkelsen Thilde T Krogh Anders A Papaleo Elena E

PLoS computational biology 20200316 3

With the improvement of -omics and next-generation sequencing (NGS) methodologies, along with the lowered cost of generating these types of data, the analysis of high-throughput biological data has become standard both for forming and testing biomedical hypotheses. Our knowledge of how to normalize datasets to remove latent undesirable variances has grown extensively, making for standardized data that are easily compared between studies. Here we present the CAncer bioMarker Prediction Pipeline ( ...[more]

PMID: 32176694

Dataset Information

CAncer bioMarker Prediction Pipeline (CAMPP)-A standardized framework for the analysis of quantitative biological data.

Publications

CAncer bioMarker Prediction Pipeline (CAMPP)-A standardized framework for the analysis of quantitative biological data.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

CancerDiscover: an integrative pipeline for cancer biomarker and cancer class prediction from high-throughput sequencing data.
| S-EPMC5788660 | biostudies-literature

Systematic data analysis pipeline for quantitative morphological cell phenotyping.
| S-EPMC11298594 | biostudies-literature

TRAPLINE: a standardized and automated pipeline for RNA sequencing data analysis, evaluation and annotation.
| S-EPMC4702420 | biostudies-literature

Data-driven learning of structure augments quantitative prediction of biological responses.
| S-EPMC11233023 | biostudies-literature

Evaluation of polygenic prediction methodology within a reference-standardized framework.
| S-EPMC8121285 | biostudies-literature

A standardized immune phenotyping and automated data analysis platform for multicenter biomarker studies.
| S-EPMC6328091 | biostudies-literature

A computational framework for extracting biological insights from SRA cancer data.
| S-EPMC11890766 | biostudies-literature

Design and implementation of a standardized framework to generate and evaluate patient-level prediction models using observational healthcare data.
| S-EPMC6077830 | biostudies-literature

scAN1.0: A reproducible and standardized pipeline for processing 10X single cell RNAseq data.
| S-EPMC10741331 | biostudies-literature

PyQuant: A Versatile Framework for Analysis of Quantitative Mass Spectrometry Data.
| S-EPMC4974355 | biostudies-literature