|Articles|July 5, 2022

A Closer Look at AI-Powered Voice Recognition in Radiology

Ambient speech capabilities in emerging voice recognition products and software updates may convert the clinical context of conversational speech into structured data for radiology reports.

Pixel-based artificial intelligence (AI) has dominated market attention in radiology over the past few years. However, a more familiar and less heralded technology has been evolving for at least two decades and has been fueled by major advances with the growth of cloud computing.

What is the technology? Artificial intelligence-powered voice recognition.

To put it in a more colloquial way, today’s radiology voice recognition solutions are not your parents’ speech recognition technology. In fact, they have far surpassed the ones you may have been using just a few years ago.

Voice recognition is so embedded in clinical workflows that many radiologists and other clinicians take it for granted. Indeed, there may only be a peripheral awareness of how much the technology has advanced. Developments in deep learning and natural language processing―based on massive amounts of voice data―have vastly improved the speed and accuracy of voice recognition engines. The rapid expansion of cloud-hosted AI has further fueled the growth and evolution of speech technology.

The early software required users to train the speech recognition engine by reciting prepared training text. Users also had to be careful to review and correct recognition errors. Accuracy depended on the quality of the input device, background noise, and other factors. Accents and special vocabularies were often problematic. Fortunately, capabilities steadily increased as machine learning technology evolved, and developers continually improved the software based on user feedback.

The widespread deployment of cloud computing over the past five years has accelerated neural network and deep learning techniques. Continuously training speech recognition technology with securely anonymized speech data makes the engine “smarter” as more users interact with it. The latest generation of voice recognition technology from Nuance Communications extracts information from thousands of terabytes of voice data while concurrently predicting what the user may say next. The technology anticipates and prepares to render what is spoken based on context, user patterns, and speech characteristics such as accent. The cloud-based radiology reporting system from Nuance Communications is hosted in Microsoft Azure and enables users to benefit immediately from this continuous learning process in ways never before possible.

Voice recognition is becoming the new UX for radiologists. In fact, ambient speech is the current state of the art voice technology used in solutions such as Nuance Dragon Ambient eXperience (DAX) and PowerScribe. The ambient capabilities recognize and understand the relevant clinical context of conversational speech and convert it into structured, organized output for radiology reports and other applications.

Advances in natural language understanding automatically turn free-form dictation into structured data. Structured data supports the American College of Radiology’s Common Data Elements initiative, aimed at creating a common ontological framework that standardizes meaning from the point of read to the point of care. In PowerScribe One, it helps to create organized, consistent reports from spoken narrative, and provides real-time clinical decision support and evidence-based follow-up recommendations. Structured data also expands interoperability with other systems including PACS, viewers, and EHRs with bidirectional, real-time data exchange.

While pixel-based AI models and other technologies often capture the headlines, cloud-hosted and AI-driven voice recognition is quietly and effectively powering a new generation of radiology reporting. Today, instead of users wondering about voice recognition accuracy, they’re seeing improvements of everyday radiology workflows and new ways of applying the technology to enhance efficiency for improved patient outcomes.

Dr. Agarwal is the chief medical information officer for Diagnostic Imaging and AI at Nuance Communications.

Stay at the forefront of radiology with the Diagnostic Imaging newsletter, delivering the latest news, clinical insights, and imaging advancements for today’s radiologists.

Subscribe Now!

Latest CME

In-Person + Virtual Event

Live Tumor Board: Squamous Cell Carcinoma of the Head & Neck – Post-CRT Decisions in the Locally Advanced Setting

February 19, 2026

In-Person Event

43rd Annual Miami Breast Cancer Conference®

March 5-8, 2026

In-Person Event

19th Annual New York GU Cancers Congress™

March 13-14, 2026

Video

Mastering Advances in Managing Unresectable and Metastatic NSCLC—Immunotherapy, Targeted Therapies, and Emerging Strategies

Marina Chiara Garassino, MD; Sarah Goldberg, MD, MPH; Biagio Ricciuti, MD, PhD

Video

Cases & Conversations™: Expert Perspectives on Leveraging Recent Advances to Transform SCLC Treatment

Jacob Sands, MD; Anne Chiang, MD, PhD; Alissa J. Cooper, MD

Multimedia

Community Practice Connections™: Empowering Interventional Radiologists in the Emerging Era of Oncolytic Immunotherapies for Melanoma

Yana G. Najjar, MD; Douglas B. Johnson, MD, MSCI; Rahul A. Sheth, MD, FSIR

Video

(CME Credit) Advancing Outcomes in Limited-Stage Small Cell Lung Cancer: From Evidence to Practice

Lauren Averett Byers, MD; Percy Lee, MD, FASTRO; Erminia Massarelli, MD, PhD, MS

Video

PER Tumor Board®: Applying Recent Advances to Transform the Treatment Paradigm in SCLC—Expert Perspectives on New Approvals and Emerging Strategies

Jonathan W. Goldman, MD; Percy Lee, MD, FASTRO; Erminia Massarelli, MD, PhD, MS; Misty D. Shields, MD, PhD

A Closer Look at AI-Powered Voice Recognition in Radiology

Newsletter

Related Content

Diagnostic Imaging's Weekly Scan: February 1 — February 7

Radiology Roundup of New FDA Clearances — February 1 — February 7

FDA Clears AI-Powered Triage Platform for Digital Breast Tomosynthesis

Is AI Better Than Neuroradiologists at Evaluating Aneurysm Growth on CTA and MRA Scans?

FDA Clears 3T MRI Device for Neonates and Infants

Latest CME

Live Tumor Board: Squamous Cell Carcinoma of the Head & Neck – Post-CRT Decisions in the Locally Advanced Setting

43rd Annual Miami Breast Cancer Conference®

19th Annual New York GU Cancers Congress™

Mastering Advances in Managing Unresectable and Metastatic NSCLC—Immunotherapy, Targeted Therapies, and Emerging Strategies

Cases & Conversations™: Expert Perspectives on Leveraging Recent Advances to Transform SCLC Treatment

Community Practice Connections™: Empowering Interventional Radiologists in the Emerging Era of Oncolytic Immunotherapies for Melanoma

(CME Credit) Advancing Outcomes in Limited-Stage Small Cell Lung Cancer: From Evidence to Practice

PER Tumor Board®: Applying Recent Advances to Transform the Treatment Paradigm in SCLC—Expert Perspectives on New Approvals and Emerging Strategies

Trending on Diagnostic Imaging

Leading Breast Radiologists Discuss the Recent Lancet Study on AI and Interval Breast Cancer

FDA Clears AI-Powered Triage Platform for Digital Breast Tomosynthesis

Is AI Better Than Neuroradiologists at Evaluating Aneurysm Growth on CTA and MRA Scans?

Radiology Roundup of New FDA Clearances — February 1 — February 7

Study Shows Photon-Counting CT Reduces Radiation Exposure by 66 Percent for Patients with Lung Cancer