Study finds wide variation in chest X-ray AI performance
A head-to-head study of 7 commercial AI devices found sensitivity ranged from 20.8% to 77.8% for lung cancer detection on chest X-rays.

Commercial AI devices varied widely in lung cancer detection on chest X-rays, according to a head-to-head study published May 19 in Radiology.
Ahmed Maiter, MB BChir, of Sheffield Teaching Hospitals NHS Foundation Trust, and colleagues tested 7 AI devices on 5,235 posteroanterior chest radiographs from 5,235 adult patients at a single U.K. center. The radiographs were acquired between July 2020 and February 2021.
Confirmed lung cancer with a visible tumor was present in 1.4% of patients. Median patient age was 60 years, 53.4% were female, and 79.4% were White.
Participating manufacturers included Annalise.ai, now Harrison.ai, Gleamer, InferVision, Milvue, Oxipit, Qure.ai, and Rayscape. Device outputs were later anonymized in the analysis.
Performance varied across key metrics. Area under the receiver operating characteristic curve ranged from 0.80 to 0.94, sensitivity from 20.8% to 77.8%, specificity from 58.9% to 98.4%, and positive predictive value from 1.5% to 28.4%.
Pairwise comparisons showed significant differences in 39 of 44 classification-result comparisons. Device classification agreement was minimal, with a Fleiss κ of 0.24.
Compared with radiologist reports, 3 devices detected more tumors and 4 detected fewer tumors. Additional false-positive results ranged from 10 to 2,039, depending on the device.
False-positive failure analysis found that erroneous detection of other pathologies was the most common cause of false-positive results, with a median of 71%.
Study authors concluded that commercial AI tools for this use case do not perform equally. They said future work should examine the impact of different devices on radiologist accuracy, reporting behavior, patient outcomes, and healthcare service delivery.
About the author
RadiologySignal.com writersEditorial Team
Radiology Signal Staff covers developments across medical imaging, radiology AI, imaging informatics, clinical research, and radiology business. The team monitors primary sources, peer-reviewed studies, company announcements, society updates, and healthcare industry news to deliver concise reporting for imaging professionals.
More from this section

Pediatric ED imaging gaps persist by race and insurance status
A cohort study of 857,034 pediatric ED visits found that stronger hospital pediatric capability was not associated with narrower imaging-use differences by race, ethnicity, or insurance status.

MRI radiomics model predicts colorectal cancer subtype
The multicenter study found an MRI-based machine-learning radiomics score predicted CMS4 colorectal cancer and stratified recurrent metastasis risk.

Contrast tied to higher AKI risk in neonatal ICU study
The retrospective study found higher acute kidney injury rates in neonates exposed to iodinated contrast media, although most cases were stage 1.