Dataset Information

Finding alternative expression quantitative trait loci by exploring sparse model space.

ABSTRACT: Sparse modeling, a feature selection method widely used in the machine-learning community, has been recently applied to identify associations in genetic studies including expression quantitative trait locus (eQTL) mapping. These genetic studies usually involve high dimensional data where the number of features is much larger than the number of samples. The high dimensionality of genetic data introduces a problem that there exist multiple solutions for optimizing a sparse model. In such situations, a single optimization result provides only an incomplete view of the data and lacks power to find alternative features associated with the same trait. In this article, we propose a novel method aimed to detecting alternative eQTLs where two genetic variants have alternative relationships regarding their associations with the expression of a particular gene. Our method accomplishes this goal by exploring multiple solutions sampled from the solution space. We proved our method theoretically and demonstrated its usage on simulated data. We then applied our method to a real eQTL data and identified a set of alternative eQTLs with potential biological insights. Additionally, these alternative eQTLs implicate a network view of understanding gene regulation.

SUBMITTER: Wang Z

PROVIDER: S-EPMC4010169 | biostudies-literature | 2014 May

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Finding alternative expression quantitative trait loci by exploring sparse model space.

Wang Zhiyong Z Xu Jinbo J Shi Xinghua X

Journal of computational biology : a journal of computational molecular cell biology 20140401 5

Sparse modeling, a feature selection method widely used in the machine-learning community, has been recently applied to identify associations in genetic studies including expression quantitative trait locus (eQTL) mapping. These genetic studies usually involve high dimensional data where the number of features is much larger than the number of samples. The high dimensionality of genetic data introduces a problem that there exist multiple solutions for optimizing a sparse model. In such situation ...[more]

PMID: 24689773

Similar Datasets

Project description:Genetic dissection of the S rat genome has provided strong evidence for the presence of two interacting blood pressure (BP) quantitative trait loci (QTLs), termed QTL1 and QTL2, on rat chromosome 5. However, the identities of the underlying interacting genetic factors remain unknown. Further experiments targeted to identify the interacting genetic factors by the substitution mapping approach alone are difficult because of the interdependency of natural recombinations to occur at the two QTLs. We hypothesized that the interacting genetic factors underlying these two QTLs may interact at the level of gene transcription and thereby represent expression QTLs (eQTLs). To detect these interacting eQTLs, a custom QTL chip containing the annotated genes within QTL1 and QTL2 was developed and used to conduct a transcriptional profiling study of S and two congenic strains that retain either one or both the QTLs. The results uncovered an interaction between two transcription factors, DMRTA2 and NFIA. Further, the âbiological signatureâ elicited by these two transcription factors was differential between the congenic strain that retained LEW alleles at both QTL1 and 2 compared to the congenic strain that retained LEW alleles at QTL1 alone. A network of transcription factors potentially affecting BP could be traced, lending support to our hypothesis. Pairs of Cy5 and Cy3 labeled targets were co-hybridized onto either a custom long oligonucleotide microarray for the interrogation of 231 genes encompassed by QTL1 and QTL2, or a TIGR rat cDNA array consisting of 26,401 probe elements representing 20,465 unique non-QTL genes. A âflip-dyeâ or âbalanced blockâ design was used as the experimental method of choice to account for potential dye-bias labeling effects. Six âbalanced blockâ normalized files are submitted for the long oligonucleotide array interrogating the hearts from S versus S.LEW(5)x6x9 animals, fourteen âflip-dyeâ hybridizations are submitted for the cDNA array interrogating the hearts from S versus S.LEW(5)x6x9 animals, and twelve âflip-dyeâ hybridizations are submitted for the cDNA array interrogating the hearts from S versus S.LEW(5)x6x11 animals. GPR and MEV files cannot be located.

Dataset Information

Finding alternative expression quantitative trait loci by exploring sparse model space.

Publications

Finding alternative expression quantitative trait loci by exploring sparse model space.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets