Dataset Information

MiRBind: a Deep Learning Method for miRNA Binding Classification

ABSTRACT: miRBind: a Deep Learning Method for miRNA Binding Classification

PROVIDER: PRJNA903902 | ENA |

REPOSITORIES: ENA

ACCESS DATA

Dataset's files

Source:

			Action	DRS
	SRR22360854.fastq.gz	Fastqsanger.gz
	SRR22360855.fastq.gz	Fastqsanger.gz

Items per page:

1 - 2 of 2

Similar Datasets

Project description:Background: Primary knee osteoarthritis (KOA) is a heterogeneous disease with clinical and molecular contributors. Biofluids contain microRNAs and metabolites that can be measured by omic technologies. Deep learning captures complex non-linear associations within multimodal data but, to date, has not been used for multi-omic-based endotyping of KOA patients. We developed a novel multimodal deep learning framework for clustering of multi-omic data from three subject-matched biofluids to identify distinct KOA endotypes and classify one-year post-total knee arthroplasty (TKA) pain/function responses. Materials and Methods: In 414 KOA patients, subject-matched plasma, synovial fluid and urine were analyzed by microRNA sequencing or metabolomics. Integrating 4 high-dimensional datasets comprising metabolites from plasma (n=151 features), along with microRNAs from plasma (n=421), synovial fluid (n=930), or urine (n=1225), a multimodal deep learning variational autoencoder architecture with K-means clustering was employed. Features influencing cluster assignment were identified and pathway analyses conducted. An integrative machine learning framework combining 4 molecular domains and a clinical domain was then used to classify WOMAC pain/function responses post-TKA within each cluster. Findings: Multimodal deep learning-based clustering of subjects across 4 domains yielded 3 distinct patient clusters. Feature signatures comprising microRNAs and metabolites across biofluids included 30, 16, and 24 features associated with Clusters 1-3, respectively. Pathway analyses revealed distinct pathways associated with each cluster. Integration of 4 multi-omic domains along with clinical data improved response classification performance, with Cluster 3 achieving AUC=0·879 for subject pain response classification and Cluster 2 reaching AUC=0·808 for subject function response, surpassing individual domain classifications by 12% and 15% respectively. Interpretation: We have developed a deep learning-based multimodal clustering model capable of integrating complex multi-fluid, multi-omic data to assist in KOA patient endotyping and test outcome response to TKA surgery.

Project description:Fusarium head blight (FHB) incited by Fusarium graminearum Schwabe is a devastating disease of barley and other cereal crops worldwide. Fusarium head blight is associated with trichothecene mycotoxins such as deoxynivalenol (DON), where contaminated grains are unfit for malting or animal feed industries. While genetically resistant cultivars offer the best economic and environmentally responsible means to mitigate disease, parent lines with adequate resistance are limited in barley. Resistancebreeding based upon quantitative genetic gains has been slow to date, due to intensive labour requirements of disease nurseries. The development of high throughput genome-wide molecular markers, allow application in genomic prediction models. A diverse genomic panel consisting of 400 two-row spring barley lines was assembled to focus on Canadian barley breeding programs. The panel was evaluated for FHB and DON content in three environments and over two years. Moreover, it was genotyped using an Illumina Infinium HTS iSelect custom beadchip array of single nucleotide polymorphic molecular markers (50K SNP), where over 23K molecular markers were polymorphic. Genomic prediction has been successfully demonstrated for reducing FHB and DON content in cereals using various statistically-based models of different underlying assumptions. Herein, we have studied an alternative method basedon machine learning and compare it with a statistical approach. Two encoding techniques were utilized (categorical or Hardy-Weinberg frequencies), followed by selecting essential genomic markers for phenotype prediction. Subsequently, we applied a transformer-based deep learning algorithm to predict FHB and DON. Apart from the transformer method, we also implemented a Residual Fully Connected Neural Network (RFCNN). Pearson correlation coefficients were calculated to compare true vs. predicted outputs. Under most model scenarios, the use of all markers vs. selected markers marginally improved prediction performance except for RFCNN method for FHB (27.6%). Hardy-Weinberg encoding generally improved correlation for FHB (6.9%) and DON (9.6%) for transformer. This study suggests the potential of the transformer based method for genomic prediction of complex traits such as FHB or DON, having performed better or equally compared with existing machine learning and statistical method. To genomic prediction in barley for Fusarium head blight and deoxynivalenol content using a custom Illumina Infinium array (BarleySNP50-JHI) (www.illumina.com). Sample types included leaves from 400 barley genotypes mostly of Canadian origin. This series includes 400 genotypes assayed on an Illumina infinium HTS platform 50K BeadChip.

Dataset Information

MiRBind: a Deep Learning Method for miRNA Binding Classification

Dataset's files

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets