Dataset Information

Estimation of breast percent density in raw and processed full field digital mammography images via adaptive fuzzy c-means clustering and support vector machine segmentation.

ABSTRACT: The amount of fibroglandular tissue content in the breast as estimated mammographically, commonly referred to as breast percent density (PD%), is one of the most significant risk factors for developing breast cancer. Approaches to quantify breast density commonly focus on either semiautomated methods or visual assessment, both of which are highly subjective. Furthermore, most studies published to date investigating computer-aided assessment of breast PD% have been performed using digitized screen-film mammograms, while digital mammography is increasingly replacing screen-film mammography in breast cancer screening protocols. Digital mammography imaging generates two types of images for analysis, raw (i.e., "FOR PROCESSING") and vendor postprocessed (i.e., "FOR PRESENTATION"), of which postprocessed images are commonly used in clinical practice. Development of an algorithm which effectively estimates breast PD% in both raw and postprocessed digital mammography images would be beneficial in terms of direct clinical application and retrospective analysis.This work proposes a new algorithm for fully automated quantification of breast PD% based on adaptive multiclass fuzzy c-means (FCM) clustering and support vector machine (SVM) classification, optimized for the imaging characteristics of both raw and processed digital mammography images as well as for individual patient and image characteristics. Our algorithm first delineates the breast region within the mammogram via an automated thresholding scheme to identify background air followed by a straight line Hough transform to extract the pectoral muscle region. The algorithm then applies adaptive FCM clustering based on an optimal number of clusters derived from image properties of the specific mammogram to subdivide the breast into regions of similar gray-level intensity. Finally, a SVM classifier is trained to identify which clusters within the breast tissue are likely fibroglandular, which are then aggregated into a final dense tissue segmentation that is used to compute breast PD%. Our method is validated on a group of 81 women for whom bilateral, mediolateral oblique, raw and processed screening digital mammograms were available, and agreement is assessed with both continuous and categorical density estimates made by a trained breast-imaging radiologist.Strong association between algorithm-estimated and radiologist-provided breast PD% was detected for both raw (r = 0.82, p < 0.001) and processed (r = 0.85, p < 0.001) digital mammograms on a per-breast basis. Stronger agreement was found when overall breast density was assessed on a per-woman basis for both raw (r = 0.85, p < 0.001) and processed (0.89, p < 0.001) mammograms. Strong agreement between categorical density estimates was also seen (weighted Cohen's κ ≥ 0.79). Repeated measures analysis of variance demonstrated no statistically significant differences between the PD% estimates (p > 0.1) due to either presentation of the image (raw vs processed) or method of PD% assessment (radiologist vs algorithm).The proposed fully automated algorithm was successful in estimating breast percent density from both raw and processed digital mammographic images. Accurate assessment of a woman's breast density is critical in order for the estimate to be incorporated into risk assessment models. These results show promise for the clinical application of the algorithm in quantifying breast density in a repeatable manner, both at time of imaging as well as in retrospective studies.

SUBMITTER: Keller BM

PROVIDER: S-EPMC3416877 | biostudies-other | 2012 Aug

REPOSITORIES: biostudies-other

ACCESS DATA

Similar Datasets

Project description:IntroductionMammographic density has been established as a strong risk factor for breast cancer, primarily using digitized film mammograms. Full-field digital mammography (FFDM) is replacing film mammography, has different properties than film, and provides both raw and processed clinical display representation images. We evaluated and compared FFDM raw and processed breast density measures and their associations with breast cancer.MethodsA case-control study of 180 cases and 180 controls matched by age, postmenopausal hormone use, and screening history was conducted. Mammograms were acquired from a General Electric Senographe 2000D FFDM unit. Percent density (PD) was assessed for each FFDM representation using the operator-assisted Cumulus method. Reproducibility within image type (n = 80) was assessed using Lin's concordance correlation coefficient (rc). Correlation of PD between image representations (n = 360) was evaluated using Pearson's correlation coefficient (r) on the continuous measures and the weighted kappa statistic (κ) for quartiles. Conditional logistic regression was used to estimate odds ratios (ORs) for the PD and breast cancer associations for both image representations with 95% confidence intervals. The area under the receiver operating characteristic curve (AUC) was used to assess the discriminatory accuracy.ResultsPercent density from the two representations provided similar intra-reader reproducibility (rc= 0.92 for raw and rc= 0.87 for processed images) and was correlated (r = 0.82 and κ = 0.64). When controlling for body mass index, the associations of quartiles of PD with breast cancer and discriminatory accuracy were similar for the raw (OR: 1.0 (ref.), 2.6 (1.2 to 5.4), 3.1 (1.4 to 6.8), 4.7 (2.1 to 10.6); AUC = 0.63) and processed representations (OR: 1.0 (ref.), 2.2 (1.1 to 4.1), 2.2 (1.1 to 4.4), 3.1 (1.5 to 6.6); AUC = 0.64).ConclusionsPercent density measured with an operator-assisted method from raw and processed FFDM images is reproducible and correlated. Both percent density measures provide similar associations with breast cancer.

Dataset Information

Estimation of breast percent density in raw and processed full field digital mammography images via adaptive fuzzy c-means clustering and support vector machine segmentation.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets