Dataset Information

Correlation Analysis of Variables From the Atherosclerosis Risk in Communities Study.

ABSTRACT: The need to test chemicals in a timely and cost-effective manner has driven the development of new alternative methods (NAMs) that utilize in silico and in vitro approaches for toxicity prediction. There is a wealth of existing data from human studies that can aid in understanding the ability of NAMs to support chemical safety assessment. This study aims to streamline the integration of data from existing human cohorts by programmatically identifying related variables within each study. Study variables from the Atherosclerosis Risk in Communities (ARIC) study were clustered based on their correlation within the study. The quality of the clusters was evaluated via a combination of manual review and natural language processing (NLP). We identified 391 clusters including 3,285 variables. Manual review of the clusters containing more than one variable determined that human reviewers considered 95% of the clusters related to some degree. To evaluate potential bias in the human reviewers, clusters were also scored via NLP, which showed a high concordance with the human classification. Clusters were further consolidated into cluster groups using the Louvain community finding algorithm. Manual review of the cluster groups confirmed that clusters within a group were more related than clusters from different groups. Our data-driven approach can facilitate data harmonization and curation efforts by providing human annotators with groups of related variables reflecting the themes present in the data. Reviewing groups of related variables should increase efficiency of the human review, and the number of variables reviewed can be reduced by focusing curator attention on variable groups whose theme is relevant for the topic being studied.

SUBMITTER: Mandal M

PROVIDER: S-EPMC9310100 | biostudies-literature |

REPOSITORIES: biostudies-literature

ACCESS DATA

Similar Datasets

Project description:The Atherosclerosis Risk in Communities (ARIC) Study, sponsored by the National Heart, Lung and Blood Institute (NHLBI), is a prospective epidemiologic study conducted in four U.S. communities. The four communities are Forsyth County, NC; Jackson, MS; the northwest suburbs of Minneapolis, MN; and Washington County, MD. ARIC is designed to investigate the etiology and natural history of atherosclerosis, the etiology of clinical atherosclerotic diseases, and variation in cardiovascular risk factors, medical care and disease by race, gender, location, and date. ARIC includes two parts: the Cohort Component and the Community Surveillance Component. The Cohort Component began in 1987, and each ARIC field center randomly selected and recruited a cohort sample of approximately 4,000 individuals aged 45-64 from a defined population in their community. A total of 15,792 participants received an extensive examination, including medical, social, and demographic data. These participants were reexamined every three years with the first screen (baseline) occurring in 1987-89, the second in 1990-92, the third in 1993-95, and the fourth and last exam was in 1996-98. Follow-up occurs yearly by telephone to maintain contact with participants and to assess health status of the cohort. In the Community Surveillance Component, currently ongoing, these four communities are investigated to determine the community-wide occurrence of hospitalized myocardial infarction and coronary heart disease deaths in men and women aged 35-84 years. Hospitalized stroke is investigated in cohort participants only. Starting in 2006, the study conducts community surveillance of inpatient (ages 55 years and older) and outpatient heart failure (ages 65 years and older) for heart failure events beginning in 2005. ARIC is currently funded through January 31, 2012. This study is part of the Gene Environment Association Studies initiative (GENEVA, <a href="http://www.genevastudy.org" target="_blank">http://www.genevastudy.org</a>) funded by the trans-NIH Genes, Environment, and Health Initiative (GEI). The overarching goal is to identify novel genetic factors that contribute to atherosclerosis and cardiovascular disease through large-scale genome-wide association studies of well-characterized cohorts of adults in four defined populations. Genotyping was performed at the Broad Institute of MIT and Harvard, a GENEVA genotyping center. Data cleaning and harmonization were done at the GEI-funded GENEVA Coordinating Center at the University of Washington.

Project description:ObjectiveTo describe the association between midlife carotid atherosclerosis and late-life hearing loss among participants in the Atherosclerosis Risk in Communities (ARIC) study.Design, setting, and participantsFor this cross-sectional study and temporal analysis of a cohort within the ongoing ARIC prospective cohort study, participants were recruited from 4 communities in the US. The analysis evaluated information on mean carotid intima-media thickness (cIMT), from visit 1 (1987-1989) to visit 4 (1994-1996), carotid plaque presence at visit 4, and audiometric data from visit 6 (2016-2017). The cIMT measures were calculated from ultrasonography recordings by trained readers at the ARIC Ultrasound Reading Center. At each visit, cIMT was computed as the average of 3 segments: the distal common carotid, the carotid artery bifurcation, and the proximal internal carotid arteries. Presence of carotid plaque was determined based on an abnormal wall thickness, shape, or wall texture. Audiometric 4-frequency pure tone average (PTA) was measured and calculated for the better-hearing ear and modeled as a continuous variable. Linear regression estimated the association between cIMT and carotid plaque with hearing, adjusting for age, sex, race and study center, education level, body mass index (calculated as weight in kilograms divided by height in meters squared), smoking status, hypertension, cholesterol levels, diabetes, and exposure to occupational noise. Missing data (exposure and covariates) were imputed with multiple imputation by chained equations. Data analyses were performed from April 6 to July 13, 2022.Main outcomes and measuresHearing loss assessed using 4-frequency (0.5, 1.0, 2.0, and 4.0 kilohertz) PTA for both ears and carotid plaque at visit 4 and mean cIMT from visit 1 to visit 4.ResultsAmong a total of 3594 participants (mean [SD] age at visit 4, 59.4 [4.6] years; 2146 [59.7%] female; 819 [22.8%] Black and 2775 [77.2%] White individuals), fully adjusted models indicated that an additional 0.1 mm higher mean cIMT was associated with 0.59 dB (95% CI, 0.17 to 1.02 dB) higher PTA. Compared with participants without carotid plaque, plaque presence was associated with 0.63 dB (95% CI, -0.57 to 1.84 dB) higher PTA.Conclusion and relevanceThe findings of this cross-sectional study with temporal analyses of a cohort with the ongoing ARIC study found that subclinical atherosclerosis in midlife was associated with worse hearing in older adulthood. Prevention and control of carotid atherosclerosis during middle age may positively affect the hearing health of older adults.

Project description:BACKGROUND:Previous studies suggest that heart failure (HF) is an independent risk factor for cognitive decline. A better understanding of the relationship between HF, cognitive status, and cognitive decline in a community-based sample may help clinicians understand disease risk. OBJECTIVE:To examine whether persons with HF have a higher prevalence of cognitive impairment and whether persons developing HF have more rapid cognitive decline. DESIGN:This observational cohort study of American adults in the Atherosclerosis Risk in Communities (ARIC) study has two components: cross-sectional analysis examining the association between prevalent HF and cognition using multinomial logistic regression, and change over time analysis detailing the association between incident HF and change in cognition over 15 years. PARTICIPANTS:Among visit 5 (2011-2013) participants (median age 75 years), 6495 had neurocognitive information available for cross-sectional analysis. Change over time analysis examined the 5414 participants who had cognitive scores and no prevalent HF at visit 4 (1996-1998). MEASUREMENTS:The primary outcome was cognitive status, classified as normal, mild cognitive impairment [MCI], and dementia on the basis of standardized cognitive tests (delayed word recall, word fluency, and digit symbol substitution). Cognitive change was examined over a 15-year period. Control variables included socio-demographic, vascular, and smoking/drinking measures. RESULTS:At visit 5, participants with HF had a higher prevalence of dementia (adjusted relative risk ratio [RRR] = 1.60 [95% CI 1.13, 2.25]) and MCI (RRR = 1.36 [1.12, 1.64]) than those without HF. A decline in cognition between visits 4 and 5 was - 0.07 standard deviation units [- 0.13, - 0.01] greater among persons who developed HF compared to those who did not. Results did not differ by ejection fraction. CONCLUSION:HF is associated with neurocognitive dysfunction and decline independent of other co-morbid conditions. Further study is needed to determine the underlying pathophysiology.

Project description:BackgroundCardiovascular disease (CVD) is associated with a greater frailty risk, but it remains unknown if pathways that contribute to CVD are associated with the frailty risk. Thus, we aimed to investigate whether elevations in high-sensitivity cardiac troponin T (hs-cTnT) and N-terminal pro-B-type natriuretic peptide (NT-proBNP) for those without known CVD at baseline are associated with a higher frailty risk.MethodsThis study used data from the Atherosclerosis Risk in Communities study. Cardiac biomarkers were measured from stored plasma samples collected at Visit 2 (1991-1993). Frailty was recorded at Visit 5 (2011-2013). Cox regression models were used to determine the association of cardiac biomarkers with frailty risk.ResultsOverall, 360/5199 (6.9%) participants aged 55.1 ± 5.1 years developed frailty during a median follow-up of 21.7 years. The incidence of frailty was significantly higher in participants with hs-cTnT ≥14 ng/L (vs. < 14 ng/L: 17.9% vs. 6.7%) or NT-proBNP ≥300 pg/ml (vs. < 300 pg/ml: 19.7% vs. 6.8%) (all P < 0.001). Comparing higher vs. lower cut-off levels of either hs-cTnT (14 ng/l) or NT-proBNP (300 pg/ml) demonstrated a greater than two-fold higher frailty risk, with hazard ratios (HRs) of 2.13 (95% confidence interval (CI): 1.130-4.01, P = 0.020) and 2.61 (95% CI: 1.28-5.33, P = 0.008), respectively. Individuals with both elevated hs-cTnT and NT-proBNP had a higher frailty risk than those without it (HR: 4.15; 95% CI: 1.50-11.48, P = 0.006).ConclusionsHigh hs-cTnT and NT-proBNP levels are strongly associated with incident frailty in the community-dwelling population without known CVD. Subclinical cardiac damage (hs-cTnT) and/or wall strain (NT-proBNP) may be the key pathway of CVD patients developing frailty. Detection of hs-cTnT and NT-proBNP may help for early screening of high-risk frailty and providing individualised intervention.Trial registrationURL: https://www.Clinicaltrialsgov ; Unique identifier: NCT00005131 .

Dataset Information

Correlation Analysis of Variables From the Atherosclerosis Risk in Communities Study.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets