Unknown

Dataset Information

0

KRGDB: the large-scale variant database of 1722 Koreans based on whole genome sequencing.


ABSTRACT: Since 2012, the Center for Genome Science of the Korea National Institute of Health (KNIH) has been sequencing complete genomes of 1722 Korean individuals. As a result, more than 32 million variant sites have been identified, and a large proportion of the variant sites have been detected for the first time. In this article, we describe the Korean Reference Genome Database (KRGDB) and its genome browser. The current version of our database contains both single nucleotide and short insertion/deletion variants. The DNA samples were obtained from four different origins and sequenced in different sequencing depths (10× coverage of 63 individuals, 20× coverage of 194 individuals, combined 10× and 20× coverage of 135 individuals, 30× coverage of 230 individuals and 30× coverage of 1100 individuals). The major features of the KRGDB are that it contains information on the Korean genomic variant frequency, frequency difference between the Korean and other populations and the variant functional annotation (such as regulatory elements in ENCODE regions and coding variant functions) of the variant sites. Additionally, we performed the genome-wide association study (GWAS) between Korean genome variant sites for the 30×230 individuals and three major common diseases (diabetes, hypertension and metabolic syndrome). The association results are displayed on our browser. The KRGDB uses the MySQL database and Apache-Tomcat web server adopted with Java Server Page (JSP) and is freely available at http://coda.nih.go.kr/coda/KRGDB/index.jsp. Availability: http://coda.nih.go.kr/coda/KRGDB/index.jsp.

SUBMITTER: Jung KS 

PROVIDER: S-EPMC7056612 | biostudies-literature | 2020 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications

KRGDB: the large-scale variant database of 1722 Koreans based on whole genome sequencing.

Jung Kwang Su KS   Hong Kyung-Won KW   Jo Hyun Youn HY   Choi Jongpill J   Ban Hyo-Jeong HJ   Cho Seong Beom SB   Chung Myungguen M  

Database : the journal of biological databases and curation 20200101


Since 2012, the Center for Genome Science of the Korea National Institute of Health (KNIH) has been sequencing complete genomes of 1722 Korean individuals. As a result, more than 32 million variant sites have been identified, and a large proportion of the variant sites have been detected for the first time. In this article, we describe the Korean Reference Genome Database (KRGDB) and its genome browser. The current version of our database contains both single nucleotide and short insertion/delet  ...[more]

Similar Datasets

| S-EPMC4785574 | biostudies-literature
| S-EPMC2760884 | biostudies-other
| S-EPMC6370902 | biostudies-other