Unknown

Dataset Information

0

PENGUINN: Precise Exploration of Nuclear G-Quadruplexes Using Interpretable Neural Networks.


ABSTRACT: G-quadruplexes (G4s) are a class of stable structural nucleic acid secondary structures that are known to play a role in a wide spectrum of genomic functions, such as DNA replication and transcription. The classical understanding of G4 structure points to four variable length guanine strands joined by variable length nucleotide stretches. Experiments using G4 immunoprecipitation and sequencing experiments have produced a high number of highly probable G4 forming genomic sequences. The expense and technical difficulty of experimental techniques highlights the need for computational approaches of G4 identification. Here, we present PENGUINN, a machine learning method based on Convolutional neural networks, that learns the characteristics of G4 sequences and accurately predicts G4s outperforming state-of-the-art methods. We provide both a standalone implementation of the trained model, and a web application that can be used to evaluate sequences for their G4 potential.

SUBMITTER: Klimentova E 

PROVIDER: S-EPMC7653191 | biostudies-literature | 2020

REPOSITORIES: biostudies-literature

altmetric image

Publications

PENGUINN: Precise Exploration of Nuclear G-Quadruplexes Using Interpretable Neural Networks.

Klimentova Eva E   Polacek Jakub J   Simecek Petr P   Alexiou Panagiotis P  

Frontiers in genetics 20201027


G-quadruplexes (G4s) are a class of stable structural nucleic acid secondary structures that are known to play a role in a wide spectrum of genomic functions, such as DNA replication and transcription. The classical understanding of G4 structure points to four variable length guanine strands joined by variable length nucleotide stretches. Experiments using G4 immunoprecipitation and sequencing experiments have produced a high number of highly probable G4 forming genomic sequences. The expense an  ...[more]

Similar Datasets

| S-EPMC8211802 | biostudies-literature
| S-EPMC11297229 | biostudies-literature
| S-EPMC7336835 | biostudies-literature
| S-EPMC8790755 | biostudies-literature
| S-EPMC9325818 | biostudies-literature
| S-EPMC11366030 | biostudies-literature
| S-EPMC4988787 | biostudies-literature
| S-EPMC6423721 | biostudies-literature
| S-EPMC11009346 | biostudies-literature