Unknown

Dataset Information

0

New methodology for repetitive sequences identification in human X and Y chromosomes.


ABSTRACT: Repetitive DNA sequences occupy the major proportion of DNA in the human genome and even in the other species' genomes. The importance of each repetitive DNA type depends on many factors: structural and functional roles, positions, lengths and numbers of these repetitions are clear examples. Conserving such DNA sequences or not in different locations in the chromosome remains a challenge for researchers in biology. Detecting their location despite their great variability and finding novel repetitive sequences remains a challenging task. To side-step this problem, we developed a new method based on signal and image processing tools. In fact, using this method we could find repetitive patterns in DNA images regardless of the repetition length. This new technique seems to be more efficient in detecting new repetitive sequences than bioinformatics tools. In fact, the classical tools present limited performances especially in case of mutations (insertion or deletion). However, modifying one or a few numbers of pixels in the image doesn't affect the global form of the repetitive pattern. As a consequence, we generated a new repetitive patterns database which contains tandem and dispersed repeated sequences. The highly repetitive sequences, we have identified in X and Y chromosomes, are shown to be located in other human chromosomes or in other genomes. The data we have generated is then taken as input to a Convolutional neural network classifier in order to classify them. The system we have constructed is efficient and gives an average of 94.4% as recognition score.

SUBMITTER: Touati R 

PROVIDER: S-EPMC7572123 | biostudies-literature | 2021 Feb

REPOSITORIES: biostudies-literature

altmetric image

Publications

New methodology for repetitive sequences identification in <i>human</i> X and Y chromosomes.

Touati Rabeb R   Tajouri Asma A   Mesaoudi Imen I   Oueslati Afef Elloumi AE   Lachiri Zied Z   Kharrat Maher M  

Biomedical signal processing and control 20201019


Repetitive DNA sequences occupy the major proportion of DNA in the human genome and even in the other species' genomes. The importance of each repetitive DNA type depends on many factors: structural and functional roles, positions, lengths and numbers of these repetitions are clear examples. Conserving such DNA sequences or not in different locations in the chromosome remains a challenge for researchers in biology. Detecting their location despite their great variability and finding novel repeti  ...[more]

Similar Datasets

| S-EPMC3876203 | biostudies-literature
| S-EPMC318249 | biostudies-other
| S-EPMC4570811 | biostudies-literature
| S-EPMC309273 | biostudies-other
| S-EPMC7963086 | biostudies-literature
| S-EPMC6856378 | biostudies-literature
| S-EPMC97424 | biostudies-literature
| S-EPMC328716 | biostudies-other
| S-EPMC5672077 | biostudies-literature
| S-EPMC4736629 | biostudies-literature