Unknown

Dataset Information

0

Korean Variant Archive (KOVA): a reference database of genetic variations in the Korean population.


ABSTRACT: Despite efforts to interrogate human genome variation through large-scale databases, systematic preference toward populations of Caucasian descendants has resulted in unintended reduction of power in studying non-Caucasians. Here we report a compilation of coding variants from 1,055 healthy Korean individuals (KOVA; Korean Variant Archive). The samples were sequenced to a mean depth of 75x, yielding 101 singleton variants per individual. Population genetics analysis demonstrates that the Korean population is a distinct ethnic group comparable to other discrete ethnic groups in Africa and Europe, providing a rationale for such independent genomic datasets. Indeed, KOVA conferred 22.8% increased variant filtering power in addition to Exome Aggregation Consortium (ExAC) when used on Korean exomes. Functional assessment of nonsynonymous variant supported the presence of purifying selection in Koreans. Analysis of copy number variants detected 5.2 deletions and 10.3 amplifications per individual with an increased fraction of novel variants among smaller and rarer copy number variable segments. We also report a list of germline variants that are associated with increased tumor susceptibility. This catalog can function as a critical addition to the pre-existing variant databases in pursuing genetic studies of Korean individuals.

SUBMITTER: Lee S 

PROVIDER: S-EPMC5487339 | biostudies-literature | 2017 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

Korean Variant Archive (KOVA): a reference database of genetic variations in the Korean population.

Lee Sangmoon S   Seo Jihae J   Park Jinman J   Nam Jae-Yong JY   Choi Ahyoung A   Ignatius Jason S JS   Bjornson Robert D RD   Chae Jong-Hee JH   Jang In-Jin IJ   Lee Sanghyuk S   Park Woong-Yang WY   Baek Daehyun D   Choi Murim M  

Scientific reports 20170627 1


Despite efforts to interrogate human genome variation through large-scale databases, systematic preference toward populations of Caucasian descendants has resulted in unintended reduction of power in studying non-Caucasians. Here we report a compilation of coding variants from 1,055 healthy Korean individuals (KOVA; Korean Variant Archive). The samples were sequenced to a mean depth of 75x, yielding 101 singleton variants per individual. Population genetics analysis demonstrates that the Korean  ...[more]

Similar Datasets

| S-EPMC4931044 | biostudies-literature
| S-EPMC8319390 | biostudies-literature
| S-EPMC7599103 | biostudies-literature
| S-EPMC11220213 | biostudies-literature
2015-05-01 | GSE58431 | GEO
| S-EPMC7288905 | biostudies-literature
2015-05-01 | E-GEOD-58431 | biostudies-arrayexpress
| S-EPMC3081314 | biostudies-literature
| S-EPMC6406302 | biostudies-literature
| S-EPMC2892595 | biostudies-literature