AI Detects More Breast Cancers with Fewer False Positives

Recall rate and radiologist reading workload decreased significantly in AI-screened group


Andreas Lauritzen PhD
Lauritzen

Using AI, breast radiologists in Denmark have improved breast cancer screening performance and reduced the rate of false-positive findings. Results of the study were published in Radiology

Mammography successfully reduces breast cancer mortality, but also carries the risk of false-positive findings. In recent years, researchers have studied the use of AI systems in screening.

“We believe AI has the potential to improve screening performance,” said Andreas D. Lauritzen, PhD, a post-doctoral student at the University of Copenhagen and researcher at Gentofte Hospital in Denmark.

When used to triage likely normal screening results or assist with decision support, AI also can substantially reduce radiologist workload.

“Population-based screening with mammography reduces breast cancer mortality, but it places a substantial workload on radiologists who must read a large number of mammograms, the majority of which don’t warrant a recall of the patient,” Dr. Lauritzen said. “The reading workload is further compounded when screening programs employ double reading to improve cancer detection and decrease false-positive recalls.”

Dr. Lauritzen discusses his research on using AI to improve breast cancer screening performance and reducing the rate of false-positive findings.


Dr. Lauritzen and colleagues set out to compare workload and screening performance in two cohorts of women between the ages of 50 and 69 who underwent screening before and after AI implementation.

In the first group, two radiologists read the mammograms of women screened between October 2020 and November 2021 before the implementation of AI. The screening mammograms of the second group of women performed between November 2021 and October 2022 were initially analyzed by AI.

Mammograms deemed likely to be normal by AI were then read by one of 19 specialized full-time breast radiologists. The remaining mammograms were read by two radiologists with AI-assisted decision support.

The commercially available AI system used for screening was trained by deep learning models to highlight and rate suspicious lesions and calcifications within mammograms. All women who underwent mammographic screening were followed for at least 180 days. Invasive cancers and ductal carcinoma in situ (DCIS) detected through screening were confirmed through needle biopsy or surgical specimens.

Lauritzen RY Fig 4 Left mediolateral oblique full-field digital mammographic view in a 67-year-old woman with BIRADS 1

Left mediolateral oblique full-field digital mammographic view in a 67-year-old woman with a Breast Imaging Reporting and Data System density of 1 who underwent screening with the artificial intelligence (AI) system. (A) Image shows AI-provided marking (square). The screening received a high AI examination score of 10, based on this area with arterial calcifications being given a score of 85 out of 100 by the AI system. (B) Same image as in A, but with findings by the radiologists. Because of the high AI examination score, the screening was double read by two radiologists, who determined that the arterial calcifications (circle) did not yield suspicions for breast cancer. The woman was not recalled for diagnostic assessment.

https://doi.org/10.1148/radiol.232479 © RSNA 2024

Screening Performance Indicators Improved

In total, 60,751 women were screened without AI, and 58,246 women were screened with the AI system. In the AI implementation group, 66.9% (38,977) of the screenings were single-read, and 33.1% (19,269) were double-read with AI assistance.

Compared to screening without AI, screening with the AI system detected significantly more breast cancers (0.82% versus 0.70%) and had a lower false-positive rate (1.63% versus 2.39%).

“In the AI-screened group, the recall rate decreased by 20.5 percent, and the radiologists’ reading workload was lowered by 33.4 percent,” Dr. Lauritzen said.

The positive predictive value of AI screening was also greater than that of screening without AI (33.5% versus 22.5%). In the AI group, a higher proportion of invasive cancers detected were 1 centimeter or less in size (44.93% vs. 36.60%).

“All screening performance indicators improved except for the node-negative rate which showed no evidence of change,” Dr. Lauritzen said.

Dr. Lauritzen said more research is needed to evaluate long-term outcomes and ensure overdiagnosis does not increase.

“Radiologists typically have access to the women’s previous screening mammograms, but the AI system does not,” he said. “That’s something we’d like to work on in the future.”

It is also important to note that not all countries follow the same breast cancer screening protocols and intervals. U.S. breast cancer screening protocols differ from protocols used in Denmark.

For More Information 

Access the Radiology study, “Early Indicators of the Impact of Using AI in Mammography Screening for Breast Cancer.”

Read previous RSNA News stories about AI in medical imaging: