Unknown

Dataset Information

0

Applying family analyses to electronic health records to facilitate genetic research.


ABSTRACT: Motivation:Pedigree analysis is a longstanding and powerful approach to gain insight into the underlying genetic factors in human health, but identifying, recruiting and genotyping families can be difficult, time consuming and costly. Development of high throughput methods to identify families and foster downstream analyses are necessary. Results:This paper describes simple methods that allowed us to identify 173?368 family pedigrees with high probability using basic demographic data available in most electronic health records (EHRs). We further developed and validate a novel statistical method that uses EHR data to identify families more likely to have a major genetic component to their diseases risk. Lastly, we showed that incorporating EHR-linked family data into genetic association testing may provide added power for genetic mapping without additional recruitment or genotyping. The totality of these results suggests that EHR-linked families can enable classical genetic analyses in a high-throughput manner. Availability and implementation:Pseudocode is provided as supplementary information. Contact:HEBBRING.SCOTT@marshfieldresearch.org. Supplementary information:Supplementary data are available at Bioinformatics online.

SUBMITTER: Huang X 

PROVIDER: S-EPMC5860602 | biostudies-literature | 2018 Feb

REPOSITORIES: biostudies-literature

altmetric image

Publications

Applying family analyses to electronic health records to facilitate genetic research.

Huang Xiayuan X   Elston Robert C RC   Rosa Guilherme J GJ   Mayer John J   Ye Zhan Z   Kitchner Terrie T   Brilliant Murray H MH   Page David D   Hebbring Scott J SJ  

Bioinformatics (Oxford, England) 20180201 4


<h4>Motivation</h4>Pedigree analysis is a longstanding and powerful approach to gain insight into the underlying genetic factors in human health, but identifying, recruiting and genotyping families can be difficult, time consuming and costly. Development of high throughput methods to identify families and foster downstream analyses are necessary.<h4>Results</h4>This paper describes simple methods that allowed us to identify 173 368 family pedigrees with high probability using basic demographic d  ...[more]

Similar Datasets

| PRJNA683675 | ENA
| S-EPMC8195357 | biostudies-literature
| S-EPMC7473254 | biostudies-literature
| S-EPMC7329072 | biostudies-literature
| S-EPMC7878193 | biostudies-literature
| S-EPMC6390331 | biostudies-literature
| S-EPMC5891135 | biostudies-literature
| S-EPMC3129641 | biostudies-literature
| S-EPMC8214390 | biostudies-literature
| S-EPMC4416392 | biostudies-literature