Unknown

Dataset Information

0

Genomic Variants Revealed by Invariably Missing Genotypes in Nelore Cattle.


ABSTRACT: High density genotyping panels have been used in a wide range of applications. From population genetics to genome-wide association studies, this technology still offers the lowest cost and the most consistent solution for generating SNP data. However, in spite of the application, part of the generated data is always discarded from final datasets based on quality control criteria used to remove unreliable markers. Some discarded data consists of markers that failed to generate genotypes, labeled as missing genotypes. A subset of missing genotypes that occur in the whole population under study may be caused by technical issues but can also be explained by the presence of genomic variations that are in the vicinity of the assayed SNP and that prevent genotyping probes from annealing. The latter case may contain relevant information because these missing genotypes might be used to identify population-specific genomic variants. In order to assess which case is more prevalent, we used Illumina HD Bovine chip genotypes from 1,709 Nelore (Bos indicus) samples. We found 3,200 missing genotypes among the whole population. NGS re-sequencing data from 8 sires were used to verify the presence of genomic variations within their flanking regions in 81.56% of these missing genotypes. Furthermore, we discovered 3,300 novel SNPs/Indels, 31% of which are located in genes that may affect traits of importance for the genetic improvement of cattle production.

SUBMITTER: da Silva JM 

PROVIDER: S-EPMC4549312 | biostudies-literature | 2015

REPOSITORIES: biostudies-literature

altmetric image

Publications


High density genotyping panels have been used in a wide range of applications. From population genetics to genome-wide association studies, this technology still offers the lowest cost and the most consistent solution for generating SNP data. However, in spite of the application, part of the generated data is always discarded from final datasets based on quality control criteria used to remove unreliable markers. Some discarded data consists of markers that failed to generate genotypes, labeled  ...[more]

Similar Datasets

| S-EPMC4198703 | biostudies-other
| S-EPMC5804041 | biostudies-literature
| S-EPMC4791965 | biostudies-literature
| S-EPMC6728309 | biostudies-literature
| S-EPMC3531262 | biostudies-literature
| S-EPMC6591902 | biostudies-literature
| S-EPMC5261778 | biostudies-literature
| S-EPMC4988672 | biostudies-literature
| S-EPMC4922624 | biostudies-literature
| S-EPMC5930444 | biostudies-literature