Unknown

Dataset Information

0

Using machine learning to detect the differential usage of novel gene isoforms.


ABSTRACT:

Background

Differential isoform usage is an important driver of inter-individual phenotypic diversity and is linked to various diseases and traits. However, accurately detecting the differential usage of different gene transcripts between groups can be difficult, in particular in less well annotated genomes where the spectrum of transcript isoforms is largely unknown.

Results

We investigated whether machine learning approaches can detect differential isoform usage based purely on the distribution of reads across a gene region. We illustrate that gradient boosting and elastic net approaches can successfully identify large numbers of genes showing potential differential isoform usage between Europeans and Africans, that are enriched among relevant biological pathways and significantly overlap those identified by previous approaches. We demonstrate that diversity at the 3' and 5' ends of genes are primary drivers of these differences between populations.

Conclusion

Machine learning methods can effectively detect differential isoform usage from read fraction data, and can provide novel insights into the biological differences between groups.

SUBMITTER: Zhang X 

PROVIDER: S-EPMC8764765 | biostudies-literature | 2022 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications

Using machine learning to detect the differential usage of novel gene isoforms.

Zhang Xiaopu X   Hassan Musa A MA   Prendergast James G D JGD  

BMC bioinformatics 20220118 1


<h4>Background</h4>Differential isoform usage is an important driver of inter-individual phenotypic diversity and is linked to various diseases and traits. However, accurately detecting the differential usage of different gene transcripts between groups can be difficult, in particular in less well annotated genomes where the spectrum of transcript isoforms is largely unknown.<h4>Results</h4>We investigated whether machine learning approaches can detect differential isoform usage based purely on  ...[more]

Similar Datasets

2021-09-03 | GSE183191 | GEO
| S-EPMC9823370 | biostudies-literature
2021-09-03 | GSE183189 | GEO
2021-09-03 | GSE183190 | GEO
| S-EPMC2945940 | biostudies-literature
| S-EPMC10248971 | biostudies-literature
| S-EPMC10467215 | biostudies-literature
2013-01-01 | E-GEOD-29210 | biostudies-arrayexpress
| S-EPMC8164276 | biostudies-literature
| S-EPMC7314597 | biostudies-literature