Large Mammography Study Suggests AI is Equivalent to Radiologists for Double Reading of Exams

September 11, 2023

News

Article

In a prospective study of over 55,000 women who had screening mammography, researchers found that double-reading by a radiologist and artificial intelligence (AI) was non-inferior to double-reading by two radiologists in detecting breast cancer.

Can artificial intelligence (AI) provide a viable alternative for second reading of screening mammography?

In a new prospective study, recently published in Lancet Digital Health, researchers reviewed data from 55,581 women who had mammography screening from April 1, 2021 to June 9, 2022 at a Sweden hospital in order to assess the capabilities of AI (Lunit Insight MMG version 1.1.6, Lunit) as a second reader for mammography exams. The study authors compared double reading by two radiologists to double reading by a radiologist and AI, single AI reading and triple reading by two radiologists and AI.

According to the study, mammography findings were deemed as abnormal for 6,002 women with 1,716 women recalled for further investigation after consensus discussions. Of the 269 cases of diagnosed breast cancer, the researchers noted that 200 patients had invasive breast cancer and 63 women had ductal carcinoma in situ.

The study authors found that double reading by two radiologists diagnosed 250 cases of breast cancer in comparison to 261 cases of breast cancer detected by double reading with one radiologist and AI. Single AI reading diagnosed 246 of the breast cancer cases and was deemed non-inferior to double radiologist reading. Triple reading (with two radiologists and AI) detected breast cancer in 269 cases and was deemed superior by the researchers to the double reading by two radiologists.

The use of AI in double mammography reading led to a 21 percent increase in abnormal findings, according to the study authors. However, they pointed out that subsequent consensus discussions, which took medical history into account with review of mammography and AI findings, reduced the recall rate by 4 percent in comparison to double reading by two radiologists.

“Thus, the consensus discussion was effective in ensuring that the higher abnormal interpretation rate for AI plus one radiologist did not translate into an increased recall rate. … In a screening population of 100,000 women, replacing one radiologist with AI would save 100,000 radiologist reads while increasing consensus discussions by 1,562. Even if the consensus discussions would take five times longer than an independent read, the workload reduction would be considerable,” wrote lead study author Karin Dembrower, M.D., the head physician in the Department of Breast Radiology at Capio Sankt Gorans Hospital in Stockholm, Sweden, and colleagues.

For Related Content

• “Can AI Match Radiologist Assessment of Screening Mammography Exams?”

• “Combining AI Lesion Detection, Mammographic Texture Model Improves Breast Cancer Risk Assessment”

• “Large Mammography Study Shows Significant Benefits with AI-Aided Screening

The researchers also noted that the 11 radiologists who participated in the study had a median experience of 17 years.

While the triple reading approach had a slightly higher detection rate for breast cancer in comparison to double reading with radiologists, the study authors said there were corresponding increases of 50 percent more consensus discussions and a 5 percent higher recall rate.

“The additional cost in terms of workload for radiologists and worry for women must be weighed against the incremental increase in cancer detection,” cautioned Dembrower and colleagues.

In regard to study limitations, the researchers conceded that basing the threshold for AI abnormality detection on data from retrospective studies may not be optimal and that subsequent calibration may be necessary to obtain a viable abnormality threshold in clinical practice. The single-arm paired design in the study prevented comparison of interval breast cancer rates between the different reader strategies assessed in the study, according to Dembrower and colleagues.

Related Content

Emerging AI Algorithm Shows Promise for Abbreviated Breast MRI in Multicenter Study

Jeff Hall

April 25th 2025

Article

An artificial intelligence algorithm for dynamic contrast-enhanced breast MRI offered a 93.9 percent AUC for breast cancer detection, and a 92.3 percent sensitivity in BI-RADS 3 cases, according to new research presented at the Society for Breast Imaging (SBI) conference.

The Reading Room Podcast: Emerging Concepts in Breast Cancer Screening and Health Equity Implications, Part 3

Jeff Hall

September 1st 2023

Podcast

In the third episode of a three-part podcast, Anand Narayan, M.D., Ph.D., and Amy Patel, M.D., discuss the challenges of expanded breast cancer screening amid a backdrop of radiologist shortages and ever-increasing volume on radiology worklists.

Could AI-Powered Abbreviated MRI Reinvent Detection for Structural Abnormalities of the Knee?

Jeff Hall

April 24th 2025

Article

Employing deep learning image reconstruction, parallel imaging and multi-slice acceleration in a sub-five-minute 3T knee MRI, researchers noted 100 percent sensitivity and 99 percent specificity for anterior cruciate ligament (ACL) tears.

The Reading Room Podcast: Emerging Concepts in Breast Cancer Screening and Health Equity Implications, Part 2

Jeff Hall

August 23rd 2023

Podcast

In the second episode of a three-part podcast, Anand Narayan, M.D., Ph.D., and Amy Patel, M.D., discuss recent studies published by the Journal of the American Medical Association (JAMA) that suggested moving to more of a risk-adapted model for mammography screening.

Can Deep Learning Enhance Low-Field MRI for Multiple Sclerosis Assessment?

Jeff Hall

April 22nd 2025

Article

In comparison to native 64-mT MRI, the deep learning generative model LowGAN offered enhanced white matter lesion conspicuity and image quality in a study involving patients with multiple sclerosis.

What is the Best Use of AI in CT Lung Cancer Screening?

Jeff Hall

April 18th 2025

Article

In comparison to radiologist assessment, the use of AI to pre-screen patients with low-dose CT lung cancer screening provided a 12 percent reduction in mean interpretation time with a slight increase in specificity and a slight decrease in the recall rate, according to new research.