
In a comparison of image-to-text large language models (LLMs), ChatGPT 4.0 offered a 95 percent sensitivity rate and an 83 percent AUC that were comparable to that of two senior radiologists and one junior radiologist interacting with LLM to differentiate between malignant and benign thyroid nodules on ultrasound.






















































