Dataset Information

A Data Set and Deep Learning Algorithm for the Detection of Masses and Architectural Distortions in Digital Breast Tomosynthesis Images.

ABSTRACT:

Importance

Breast cancer screening is among the most common radiological tasks, with more than 39 million examinations performed each year. While it has been among the most studied medical imaging applications of artificial intelligence, the development and evaluation of algorithms are hindered by the lack of well-annotated, large-scale publicly available data sets.

Objectives

To curate, annotate, and make publicly available a large-scale data set of digital breast tomosynthesis (DBT) images to facilitate the development and evaluation of artificial intelligence algorithms for breast cancer screening; to develop a baseline deep learning model for breast cancer detection; and to test this model using the data set to serve as a baseline for future research.

Design, setting, and participants

In this diagnostic study, 16 802 DBT examinations with at least 1 reconstruction view available, performed between August 26, 2014, and January 29, 2018, were obtained from Duke Health System and analyzed. From the initial cohort, examinations were divided into 4 groups and split into training and test sets for the development and evaluation of a deep learning model. Images with foreign objects or spot compression views were excluded. Data analysis was conducted from January 2018 to October 2020.

Exposures

Screening DBT.

Main outcomes and measures

The detection algorithm was evaluated with breast-based free-response receiver operating characteristic curve and sensitivity at 2 false positives per volume.

Results

The curated data set contained 22 032 reconstructed DBT volumes that belonged to 5610 studies from 5060 patients with a mean (SD) age of 55 (11) years and 5059 (100.0%) women. This included 4 groups of studies: (1) 5129 (91.4%) normal studies; (2) 280 (5.0%) actionable studies, for which where additional imaging was needed but no biopsy was performed; (3) 112 (2.0%) benign biopsied studies; and (4) 89 studies (1.6%) with cancer. Our data set included masses and architectural distortions that were annotated by 2 experienced radiologists. Our deep learning model reached breast-based sensitivity of 65% (39 of 60; 95% CI, 56%-74%) at 2 false positives per DBT volume on a test set of 460 examinations from 418 patients.

Conclusions and relevance

The large, diverse, and curated data set presented in this study could facilitate the development and evaluation of artificial intelligence algorithms for breast cancer screening by providing data for training as well as a common set of cases for model validation. The performance of the model developed in this study showed that the task remains challenging; its performance could serve as a baseline for future model development.

SUBMITTER: Buda M

PROVIDER: S-EPMC8369362 | biostudies-literature | 2021 Aug

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

A Data Set and Deep Learning Algorithm for the Detection of Masses and Architectural Distortions in Digital Breast Tomosynthesis Images.

Buda Mateusz M Saha Ashirbani A Walsh Ruth R Ghate Sujata S Li Nianyi N Swiecicki Albert A Lo Joseph Y JY Mazurowski Maciej A MA

JAMA network open 20210802 8

<h4>Importance</h4>Breast cancer screening is among the most common radiological tasks, with more than 39 million examinations performed each year. While it has been among the most studied medical imaging applications of artificial intelligence, the development and evaluation of algorithms are hindered by the lack of well-annotated, large-scale publicly available data sets.<h4>Objectives</h4>To curate, annotate, and make publicly available a large-scale data set of digital breast tomosynthesis ( ...[more]

PMID: 34398205

Similar Datasets

Project description:Digital breast tomosynthesis (DBT) offers poor image quality along the depth direction. This paper presents a new method that improves the image quality of DBT considerably through the a priori information from automated ultrasound (AUS) images.DBT and AUS images of a complex breast-mimicking phantom are acquired by a DBT/AUS dual-modality system. The AUS images are taken in the same geometry as the DBT images and the gradient information of the in-slice AUS images is adopted into the new loss functional during the DBT reconstruction process. The additional data allow for new iterative equations through solving the optimization problem utilizing the gradient descent method. Both visual comparison and quantitative analysis are employed to evaluate the improvement on DBT images. Normalized line profiles of lesions are obtained to compare the edges of the DBT and AUS-corrected DBT images. Additionally, image quality metrics such as signal difference to noise ratio (SDNR) and artifact spread function (ASF) are calculated to quantify the effectiveness of the proposed method.In traditional DBT image reconstructions, serious artifacts can be found along the depth direction (Z direction), resulting in the blurring of lesion edges in the off-focus planes parallel to the detector. However, by applying the proposed method, the quality of the reconstructed DBT images is greatly improved. Visually, the AUS-corrected DBT images have much clearer borders in both in-focus and off-focus planes, fewer Z direction artifacts and reduced overlapping effect compared to the conventional DBT images. Quantitatively, the corrected DBT images have better ASF, indicating a great reduction in Z direction artifacts as well as better Z resolution. The sharper line profiles along the Y direction show enhancement on the edges. Besides, noise is also reduced, evidenced by the obviously improved SDNR values.The proposed method provides great improvement on the quality of DBT images. This improvement makes it easier to locate and to distinguish a lesion, which may help improve the accuracy of the diagnosis using DBT imaging.

Project description:Study designRetrospective diagnostic study.ObjectiveTo automatically detect osteolytic bone metastasis lesions in the thoracolumbar region using conventional computed tomography (CT) scans, we developed a new deep learning (DL)-based computer-aided detection model.Summary of background dataRadiographic detection of bone metastasis is often difficult, even for orthopedic surgeons and diagnostic radiologists, with a consequent risk for pathologic fracture or spinal cord injury. If we can improve detection rates, we will be able to prevent the deterioration of patients' quality of life at the end stage of cancer.Materials and methodsThis study included CT scans acquired at Tokyo Medical and Dental University (TMDU) Hospital between 2016 and 2022. A total of 263 positive CT scans that included at least one osteolytic bone metastasis lesion in the thoracolumbar spine and 172 negative CT scans without bone metastasis were collected for the datasets to train and validate the DL algorithm. As a test data set, 20 positive and 20 negative CT scans were separately collected from the training and validation datasets. To evaluate the performance of the established artificial intelligence (AI) model, sensitivity, precision, F1-score, and specificity were calculated. The clinical utility of our AI model was also evaluated through observer studies involving six orthopaedic surgeons and six radiologists.ResultsOur AI model showed a sensitivity, precision, and F1-score of 0.78, 0.68, and 0.72 (per slice) and 0.75, 0.36, and 0.48 (per lesion), respectively. The observer studies revealed that our AI model had comparable sensitivity to orthopaedic or radiology experts and improved the sensitivity and F1-score of residents.ConclusionWe developed a novel DL-based AI model for detecting osteolytic bone metastases in the thoracolumbar spine. Although further improvement in accuracy is needed, the current AI model may be applied to current clinical practice.Level of evidenceLevel III.

Dataset Information

A Data Set and Deep Learning Algorithm for the Detection of Masses and Architectural Distortions in Digital Breast Tomosynthesis Images.

Importance

Objectives

Design, setting, and participants

Exposures

Main outcomes and measures

Results

Conclusions and relevance

Publications

A Data Set and Deep Learning Algorithm for the Detection of Masses and Architectural Distortions in Digital Breast Tomosynthesis Images.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets