Unknown

Dataset Information

0

Geno2proteo, a Tool for Batch Retrieval of DNA and Protein Sequences from Any Genomic or Protein Regions.


ABSTRACT: The interconversion of sequences that constitute the genome and the proteome is becoming increasingly important due to the generation of large amounts of DNA sequence data. Following mapping of DNA segments to the genome, one fundamentally important task is to find the amino acid sequences which are coded within a list of genomic sections. Conversely, given a series of protein segments, an important task is to find the genomic loci which code for a list of protein regions. To perform these tasks on a region by region basis is extremely laborious when a large number of regions are being studied. We have therefore implemented an R package geno2proteo which performs the two mapping tasks and subsequent sequence retrieval in a batch fashion. In order to make the tool more accessible to users, we have created a web interface of the R package which allows the users to perform the mapping tasks by going to the web page http://sharrocksresources.manchester.ac.uk/tofigaps and using the web service.

SUBMITTER: Li Y 

PROVIDER: S-EPMC6798850 | biostudies-literature | 2019 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

Geno2proteo, a Tool for Batch Retrieval of DNA and Protein Sequences from Any Genomic or Protein Regions.

Li Yaoyong Y   Aguilar-Martinez Elisa E   Sharrocks Andrew D AD  

Journal of integrative bioinformatics 20190713 3


The interconversion of sequences that constitute the genome and the proteome is becoming increasingly important due to the generation of large amounts of DNA sequence data. Following mapping of DNA segments to the genome, one fundamentally important task is to find the amino acid sequences which are coded within a list of genomic sections. Conversely, given a series of protein segments, an important task is to find the genomic loci which code for a list of protein regions. To perform these tasks  ...[more]

Similar Datasets

| S-EPMC4166778 | biostudies-literature
| S-EPMC8761809 | biostudies-literature
| S-EPMC5549384 | biostudies-literature
| S-EPMC2093931 | biostudies-literature
| S-EPMC168949 | biostudies-literature
| S-EPMC5054166 | biostudies-literature
| S-EPMC6642542 | biostudies-literature
| S-EPMC6761934 | biostudies-literature
| S-EPMC6101578 | biostudies-literature
| S-EPMC4920124 | biostudies-literature