Mammography Study Compares False Positives Between AI and Radiologists in DBT Screening

May 8, 2025

News

Article

For DBT breast cancer screening, 47 percent of radiologist-only flagged false positives involved mass presentations whereas 40 percent of AI-only flagged false positive cases involved benign calcifications, according to research presented at the recent American Roentgen Ray Society (ARRS) conference.

While a recent study revealed a 10 percent false positive rate for AI software and unassisted radiologist assessment for digital breast tomosynthesis (DBT), there were significant differences with the nature of the false positive findings, according to a poster presentation at the American Roentgen Ray Society (ARRS) conference.

For the retrospective study, researchers reviewed data from the use of the AI software (Transpara v1.7.1, ScreenPoint Medical) for 3,183 DBT screening exams to compare false positive findings between the AI software and radiologists. The study authors acknowledged differences between the AI false positive and radiology false positive cohorts with respect to mean patient age (60 vs. 53).

For the 304 false positive cases flagged only by the AI software, 40 percent involved benign calcifications with 13 percent of cases focusing on asymmetries and 12 percent of findings representing benign post-surgical changes.¹

Mammography Study Compares False Positives Between AI and Radiologists in DBT Screening

Benign calcifications accounted for 40 percent of the findings flagged only by AI software in a recent study comparing false positives of radiologists and AI in digital breast tomosynthesis (DBT) screening. Examples of benign calcifications only flagged by the AI software (shown above) include dystrophic, round calcifications (A), prominent skin calcifications (B) and very prominent vascular calcifications (C). (Images courtesy of ARRS.)

Of the 308 false positive findings flagged only by radiologists, the study authors noted that masses were involved in 47 percent of cases, followed by asymmetries (19 percent) and indeterminate calcifications (15 percent).¹

“ … AI was more likely to flag benign calcifications, asymmetries and benign post-surgical changes, and these findings (occurred) more than 50 percent of the time … compared to the radiologists who tended to flag masses, asymmetries and indeterminate calcifications more often,” noted lead study author Tara Shahrvini, an MD/MBA candidate at the David Geffen School of Medicine at the University of California-Los Angeles (UCLA), and colleagues.

For Related Content

• “What New Research Reveals About the Impact of AI and DBT Screening: An Interview with Manisha Bahl, MD”

• “Can AI Bolster Breast Cancer Detection in DBT Screening?”

• “Mammography Study Shows Merits of AI for Improving Breast Cancer Detection and Effectiveness of Recalls”

The researchers noted higher percentages of false positives in the AI cohort with Asian (16 percent vs. 9 percent) and African American women (14 percent vs. 8 percent) in comparison to false positives with unassisted radiologists.¹

Reviewing radiologists also had higher percentages of false positives in women with dense breasts with the study authors citing a 37 percent false positive rate in BI-RADS category C cases (vs. 22 percent for the AI software) and a 14 percent false positive rate in BI-RADS category D cases (vs. 5 percent for AI).¹

In cases that were flagged by AI and unassisted radiologists, the researchers pointed out a 39 percent rate of biopsy recommendations and pathology-confirmed high-risk lesions in 44 percent of those cases. However, they also noted that overlapping findings between AI and unassisted radiologist interpretation only occurred in 1.4 percent of the larger DBT screening cohort.¹

“Given the minimal overlap between AI and radiologist FPs, these findings suggest the potential for a synergistic interpretation by both AI and radiologists to decrease the recall rate in real-world practice,” maintained Shahrvini and colleagues.

Reference

1. Shahrvini T, Wood EJ, Joines MM, et al. Radiologist versus artificial intelligence false positives in digital breast tomosynthesis. Presented at the American Roentgen Ray Society (ARRS) conference April 27-May 1, 2025, San Diego. Available at: https://www2.arrs.org/am25/ . Accessed May 7, 2025.

Related Content

Considering Breast- and Lesion-Level Assessments with Mammography AI: What New Research Reveals

Jeff Hall

June 27th 2025

Article

While there was a decline of AUC for mammography AI software from breast-level assessments to lesion-level evaluation, the authors of a new study, involving 1,200 women, found that AI offered over a seven percent higher AUC for lesion-level interpretation in comparison to unassisted expert readers.

The Reading Room Podcast: Current Insights on Recent Research About Radiation-Induced Cancers with CT Scans, Part 2

Jeff Hall

May 5th 2025

Podcast

In a second part of a new podcast episode on recently published research on projected radiation-induced cancers from computed tomography (CT) scans, Mahadevappa Mahesh, MS, Ph.D., and Joseph Cavallo, M.D., offer current perspectives on cardiac CT dosing, AI advances and the importance of teamwork in ensuring appropriate dosing for CT.

New Study Examines Key Factors with False Negatives on AI Mammography Analysis

Jeff Hall

June 25th 2025

Article

Artificial intelligence (AI) software had a 14 percent false negative rate in a new study involving over 1,082 women with invasive breast cancer.

The Reading Room Podcast: Emerging Concepts in Breast Cancer Screening and Health Equity Implications, Part 3

Jeff Hall

September 1st 2023

Podcast

In the third episode of a three-part podcast, Anand Narayan, M.D., Ph.D., and Amy Patel, M.D., discuss the challenges of expanded breast cancer screening amid a backdrop of radiologist shortages and ever-increasing volume on radiology worklists.

SNMMI: Can 18F-Fluciclovine PET/CT Bolster Detection of PCa Recurrence in the Prostate Bed?

Jeff Hall

June 24th 2025

Article

In an ongoing prospective study of patients with biochemical recurrence of PCa and an initial negative PSMA PET/CT, preliminary findings revealed positive 18F-fluciclovine PET/CT scans in over 54 percent of the cohort, according to a recent poster presentation at the SNMMI conference.

Could an Emerging PET Tracer be a Game Changer for Detecting Hepatocellular Carcinoma?

Jeff Hall

June 23rd 2025

Article

In addition to over 90 percent sensitivity in detecting hepatocellular carcinoma (HCC), the glypican-3 (GPC3) targeted PET tracer 68Ga-aGPC3-scFv appeared to be advantageous in identifying HCC tumors smaller than one centimeter, according to pilot study findings presented at the SNMMI conference.