Unknown

Dataset Information

0

SnpHub: an easy-to-set-up web server framework for exploring large-scale genomic variation data in the post-genomic era with applications in wheat.


ABSTRACT: BACKGROUND:The cost of high-throughput sequencing is rapidly decreasing, allowing researchers to investigate genomic variations across hundreds or even thousands of samples in the post-genomic era. The management and exploration of these large-scale genomic variation data require programming skills. The public genotype querying databases of many species are usually centralized and implemented independently, making them difficult to update with new data over time. Currently, there is a lack of a widely used framework for setting up user-friendly web servers to explore new genomic variation data in diverse species. RESULTS:Here, we present SnpHub, a Shiny/R-based server framework for retrieving, analysing, and visualizing large-scale genomic variation data that can be easily set up on any Linux server. After a pre-building process based on the provided VCF files and genome annotation files, the local server allows users to interactively access single-nucleotide polymorphisms and small insertions/deletions with annotation information by locus or gene and to define sample sets through a web page. Users can freely analyse and visualize genomic variations in heatmaps, phylogenetic trees, haplotype networks, or geographical maps. Sample-specific sequences can be accessed as replaced by detected sequence variations. CONCLUSIONS:SnpHub can be applied to any species, and we build up a SnpHub portal website for wheat and its progenitors based on published data in recent studies. SnpHub and its tutorial are available at http://guoweilong.github.io/SnpHub/. The wheat-SnpHub-portal website can be accessed at http://wheat.cau.edu.cn/Wheat_SnpHub_Portal/.

SUBMITTER: Wang W 

PROVIDER: S-EPMC7274028 | biostudies-literature | 2020 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

SnpHub: an easy-to-set-up web server framework for exploring large-scale genomic variation data in the post-genomic era with applications in wheat.

Wang Wenxi W   Wang Zihao Z   Li Xintong X   Ni Zhongfu Z   Hu Zhaorong Z   Xin Mingming M   Peng Huiru H   Yao Yingyin Y   Sun Qixin Q   Guo Weilong W  

GigaScience 20200601 6


<h4>Background</h4>The cost of high-throughput sequencing is rapidly decreasing, allowing researchers to investigate genomic variations across hundreds or even thousands of samples in the post-genomic era. The management and exploration of these large-scale genomic variation data require programming skills. The public genotype querying databases of many species are usually centralized and implemented independently, making them difficult to update with new data over time. Currently, there is a la  ...[more]

Similar Datasets

| S-EPMC8170118 | biostudies-literature
| S-EPMC4987924 | biostudies-literature
| S-EPMC9252808 | biostudies-literature
| S-EPMC5700660 | biostudies-literature
| S-EPMC310868 | biostudies-literature
| S-EPMC9252728 | biostudies-literature
| S-EPMC4086104 | biostudies-literature
| S-EPMC8262705 | biostudies-literature
| S-EPMC9252824 | biostudies-literature
| S-EPMC8262711 | biostudies-literature