Gene expression profiling in Inflammatory Bowel Disease identifies characteristics for anti-TNF-alpha response and targets for alternative therapies
Ontology highlight
ABSTRACT: The objective of this study was to identify pre-treatment colonic gene expression patterns in patients with inflammatory bowel disease (Ulcerative Colitis and colonic Crohn’s disease) that are predictive of a good versus poor response to treatment with the bioloic anti-TNFa. By identifying key pathways that are altered in responders versus non-responders we also aimed to propose novel targets for therapy. Pre-treatment colonic endoscopic biopsies were collected in RNAlater from 22 patients (14 CD and 8 UC) who were starting anti-TNFa therapy. RNA was extracted using RNeasy columns (Qiagen) and quantified by Qubit (Life Technologies). 0.5ng RNA was used to prepare uniquely indexed cDNA QIAseq UPX 3’ Transcriptome libraries according to manufacturer’s instructions (Qiagen). Libraries were quantified and quality-controlled using the QIAseq Library Quant Assay Kit and tapestation analysis and sequenced on the Miseq and Nextseq Illumina platforms using the illumina sequencing primer and PhiX 15% to a depth of 1-3milion reads/sample. Read numbers were set as 101 bp (R1) x 51 bp (R2). Fastq files were obtained through BaseSpace and reads de-multiplexed, aligned to human genome GRCh38, quantified and normalised by the TPM method using the CLC Genomics Workbench (Qiagen). The following differential expression procedure was performed for participants classified as responders, partial responders or non-responders according to their endoscopic presentation 12 weeks after therapy. For UC patients the Mayo score was used: Responders Mayo= 0, Partial responders 1<= Mayo <3 and Non Responders Mayo >=3. For CD patients % reduction in the SES-CD score was used: Responders >=50% of SES-CD, Partial responders 50% 75%. Normalised values were log-transformed with a pseudocount of 1 added. Transcripts with < 1 transformed count in any sample were excluded from further analysis, as were transcripts with low variance (defined as less than 10% unique counts across both conditions and greater than a 19:1 ratio of the most frequent count to the second most frequent across both conditions). Differential expression analysis between conditions was conducted with the limma package. Library size was estimated using reduced maximum likelihood estimator with 500 iterations. Initial fitting was performed using a robust M-estimation, and moderated test statistics computed by empirical Bayes. A FDR corrected p value <0.05 was considered and filtered for further downstream data analysis. Partial least squares discriminant analysis (PLS-DA) modelling was performed on these genes to calculate their variable Importance in Projection (VIP) score. To understand the biological significance and pathways represented in the DEGS, we also performed enrichment analysis using the DAVID Gene Functional Classification Tool (REF I,II). Finally, strongest indicators of response were predicted by Random Forest Area Under the Curve (AUC) analysis.
INSTRUMENT(S): NextSeq 500
ORGANISM(S): Homo sapiens
SUBMITTER: Louisa Jeffery
PROVIDER: E-MTAB-9944 | biostudies-arrayexpress |
REPOSITORIES: biostudies-arrayexpress
ACCESS DATA