News
Media
Conferences
DI Executive
Resources
Event Calendar
Subscribe

News|Videos|April 14, 2026

Large Language Models and Clinical Reasoning: What New Research Reveals

Author(s)Jeff Hall

In a recent interview, Marc Succi, MD, discussed findings from a new study examining the clinical reasoning capabilities of 21 large language models (LLMs), including GPT-5, Grok 4 and Claude 4.5 Opus.

Are large language models (LLMs) capable of reliable clinical reasoning?

In an attempt to answer this questions, researchers performed a cross-sectional study to assess 21 LLMs (including GPT-5, Gemini 3.0 Flash, Grok 4) for clinical reasoning. For the research, recently published in JAMA Network Open, the study authors developed and utilized the Proportional Index of Medical Evaluation for LLMs (PrIME-LLM) score, which evaluated LLM response to 29 clinical vignettes for five clinical reasoning domains that ranged from differential diagnosis and diagnostic testing to final diagnosis and management.

The researchers found that all of the reviewed LLM models had higher than 80 percent failure rates for differential diagnosis but less than 40 percent failure rates for final diagnosis.

In a recent interview with Diagnostic Imaging, Marc Succi, MD, a co-author of the study, posited that while LLMs can be effective “when it’s an open book test with all the data,” the models struggle with decision-making when there is uncertain and disorganized data.

“I think it hits at a really important issue in why we did the study the way we did. That differential for us is really the art of medicine and coming up with a proper differential really sets the tone for the rest of the visit. If you have the wrong differential, but still get to the right answer, that also isn't okay, because that means you may have done 20 extra tests to go through the wrong differential and delayed care, extra costs, etc.,” explained Dr. Succi, an associate professor at Harvard Medical School and executive director of the MESH (Medically Engineered Solutions in Healthcare) Incubator at Mass General Brigham.

For Related Content

• “Cybersecurity Risks with Large Language Models: What Radiologists Should. Know”

• “A Closer Look at Automated LLM Protocoling for Abdominal and Pelvic CT”

• “Clinical Applications of LLMs in Radiology: Key Takeaways from RSNA 2025”

While noting that LLMs can offer high feasibility and low risk for ambient documentation and radiology worklist triage, Dr. Succi maintained that LLMs currently can’t go beyond possible adjunctive use in clinical workflows.

“… It's really not whether the models can sometimes or most of the time get the answer right. It's whether it reasons reliably in an uncertain environment and with uncertain data. For me, medicine is an environment with a lot of uncertainty and a lot of high stakes. … These LLMs as they're presented, as studied, are not ready for clinical integration in a meaningful way without extensive human involvement or oversight,” emphasized Dr. Succi.

Newsletter

Stay at the forefront of radiology with the Diagnostic Imaging newsletter, delivering the latest news, clinical insights, and imaging advancements for today’s radiologists.

Related Content

FDA Expands Approval of Pluvicto in Combination with ARPI for PSMA-Positive mAPMN/S Prostate Cancer

FDA Expands Approval of Pluvicto in Combination with ARPI for PSMA-Positive mAPMN/S Prostate Cancer

August 1st 2026

FDA Clears AI-Powered Breast Ultrasound Software from DeepHealth

FDA Clears AI-Powered Breast Ultrasound Software from DeepHealth

July 31st 2026

FDA Issues 510(k) Clearance for Emerging Enterprise Imaging Platform

FDA Issues 510(k) Clearance for Emerging Enterprise Imaging Platform

July 30th 2026

Top Five Radiology Content in July 2026

Top Five Radiology Content in July 2026

ByDiagnostic Imaging Staff

July 30th 2026

FDA Clears Ultrasound Guidance Software on Vascular Imaging for DVT Evaluation

FDA Clears Ultrasound Guidance Software on Vascular Imaging for DVT Evaluation

July 30th 2026

Latest CME

BURST CME™ Resource Center: Integrating Novel PSMA-Directed Radioligand Approaches for Diagnosis and Management of Prostate Cancer

BURST CME™ Resource Center: Integrating Novel PSMA-Directed Radioligand Approaches for Diagnosis and Management of Prostate Cancer

Jeremie Calais, MD, PhD; Tanya B. Dorff, MD; Nerina McDonald, MSPAS, PA-C; Scott T. Tagawa, MD, MS, FACP, FASCO

Ready for Radioligand Therapy? Patient Selection and Sequencing Simplified

Ready for Radioligand Therapy? Patient Selection and Sequencing Simplified

Jeremie Calais, MD, PhD; Tanya B. Dorff, MD; Scott T. Tagawa, MD, MS, FACP, FASCO

Working Together: Overcoming Barriers to Optimize Outcomes in Patients Treated With Radioligand Therapy Through Multidisciplinary Care

Working Together: Overcoming Barriers to Optimize Outcomes in Patients Treated With Radioligand Therapy Through Multidisciplinary Care

Jeremie Calais, MD, PhD; Tanya B. Dorff, MD; Nerina McDonald, MSPAS, PA-C; Scott T. Tagawa, MD, MS, FACP, FASCO

Radioligand Therapy 101: The Science Behind the Strategy

Radioligand Therapy 101: The Science Behind the Strategy

Jeremie Calais, MD, PhD; Tanya B. Dorff, MD; Scott T. Tagawa, MD, MS, FACP, FASCO

Community Practice Connections™: Beyond the Basics— Revolutionizing Advanced Prostate Cancer Management With PSMA-Targeted Therapies

Community Practice Connections™: Beyond the Basics— Revolutionizing Advanced Prostate Cancer Management With PSMA-Targeted Therapies

Jeremie Calais, MD, PhD; Scott T. Tagawa, MD, MS, FACP, FASCO

Satellite Symposia at the Annual Radiation Oncology Meeting

In-Person + Virtual Event

Satellite Symposia at the Annual Radiation Oncology Meeting

September 26-30, 2026

26th Annual International Lung Cancer Congress

26th Annual International Lung Cancer Congress

Roy S. Herbst, MD, PhD; Sandip Patel, MD, FASCO; Heather A. Wakelee, MD, FASCO

9th Annual School of Nursing Oncology™

9th Annual School of Nursing Oncology™

Beth Faiman, PhD, MSN, APN-BC, BMTCN, AOCN, FAAN, FAPO; Beth Sandy, MSN, CRNP, FAPO; Lindsay Adkins, MSN, FNP-BC, BMTCN; Jeneth Aquino, DNP, FNP-BC; Casey Gormley, MSN, FNP-C, AOCNP; Heather J. Jackson, PhD, FNP-BC; Kelsey Martin, AG-ACNP-BC, AOCNP; Nerina T. McDonald, PA-C; Lauren Verity Moore, DNP, MSN, AGACNP-BC; Faith A. Mutale, DNP, CRNP; Tiffany Richards, PhD, ANP-BC, AOCNP; Emily Skotte, DNP, MSN, ACNP-BC; Leslie Smith, DNP, RN, APRN-CNS, AOCNS, BMTCN; Saneese Stephen, PA-C, MPAS; Sara M. Tinsley-Vance, PhD, APRN, AOCN

Trending on Diagnostic Imaging

FDA Expands Approval of Pluvicto in Combination with ARPI for PSMA-Positive mAPMN/S Prostate Cancer

New Study Shows Substantial Decline in CT Radiation Dosing for Adults Over the Past Decade

FDA Clears AI-Powered Breast Ultrasound Software from DeepHealth

FDA Grants De Novo Authorization for AI-Based Coronary Inflammation Quantification from CCTA Exams

FDA Issues 510(k) Clearance for Emerging Enterprise Imaging Platform