Unknown

Dataset Information

0

HUPAN: a pan-genome analysis pipeline for human genomes.


ABSTRACT: The human reference genome is still incomplete, especially for those population-specific or individual-specific regions, which may have important functions. Here, we developed a HUman Pan-genome ANalysis (HUPAN) system to build the human pan-genome. We applied it to 185 deep sequencing and 90 assembled Han Chinese genomes and detected 29.5?Mb novel genomic sequences and at least 188 novel protein-coding genes missing in the human reference genome (GRCh38). It can be an important resource for the human genome-related biomedical studies, such as cancer genome analysis. HUPAN is freely available at http://cgm.sjtu.edu.cn/hupan/ and https://github.com/SJTU-CGM/HUPAN .

SUBMITTER: Duan Z 

PROVIDER: S-EPMC6670167 | biostudies-literature | 2019 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications


The human reference genome is still incomplete, especially for those population-specific or individual-specific regions, which may have important functions. Here, we developed a HUman Pan-genome ANalysis (HUPAN) system to build the human pan-genome. We applied it to 185 deep sequencing and 90 assembled Han Chinese genomes and detected 29.5 Mb novel genomic sequences and at least 188 novel protein-coding genes missing in the human reference genome (GRCh38). It can be an important resource for the  ...[more]

Similar Datasets

| S-EPMC3268234 | biostudies-literature
| S-EPMC5780747 | biostudies-literature
| S-EPMC4829868 | biostudies-literature
| S-EPMC4363492 | biostudies-literature
| S-EPMC9290807 | biostudies-literature
| S-EPMC6964052 | biostudies-literature
| EGAS00001003657 | EGA
| S-EPMC9850592 | biostudies-literature
| S-EPMC7025898 | biostudies-literature
2020-02-05 | E-MTAB-5200 | biostudies-arrayexpress