We use cookies to understand how you use our site and to improve your experience. This includes personalizing content and advertising. To learn more, click here. By continuing to use our site, you accept our use of cookies. Cookie Policy.

Features Partner Sites Information LinkXpress hp
Sign In
Advertise with Us
Radcal IBA  Group

Download Mobile App




Researchers Publish Chest X-Ray Dataset to Train AI Models

By MedImaging International staff writers
Posted on 20 Feb 2019
Image: The CheXpert dataset of chest X-rays is designed for automated chest X-ray interpretation (Photo courtesy of Stanford University School of Medicine).
Image: The CheXpert dataset of chest X-rays is designed for automated chest X-ray interpretation (Photo courtesy of Stanford University School of Medicine).
Researchers from the Stanford University School of Medicine (Stanford, CA, USA) have published CheXpert, a large dataset of chest X-rays and competition for automated chest X-ray interpretation, which features uncertainty labels and radiologist-labeled reference standard evaluation sets. Automated chest radiograph interpretation at the level of practicing radiologists could provide substantial benefit in many medical settings, from improved workflow prioritization and clinical decision support to large-scale screening and global population health initiatives.

CheXpert consists of 224,316 chest radiographs of 65,240 patients collected from Stanford Hospital that were performed between October 2002 and July 2017 in both inpatient and outpatient centers, along with their associated radiology reports. The dataset was co-released with MIMIC-CXR, a large dataset of 371,920 chest X-rays associated with 227,943 imaging studies sourced from the Beth Israel Deaconess Medical Center between 2011-2016.

One of the main obstacles in the development of chest radiograph interpretation models has been the lack of datasets with strong radiologist-annotated groundtruth and expert scores against which researchers can compare their models. CheXpert is expected to address that gap, making it easy to track the progress of models over time on a clinically important task.

The researchers have also developed and open-sourced the CheXpert labeler, an automated rule-based labeler to extract observations from the free text radiology reports to be used as structured labels for the images. This is expected to help other institutions extract structured labels from their reports and release other large repositories of data that will allow for cross-institutional testing of medical imaging models. The dataset is expected to help in the development and validation of chest radiograph interpretation models towards improving healthcare access and delivery worldwide.

Related Links:
Stanford University School of Medicine

Digital Radiographic System
OMNERA 300M
Mammo DR Retrofit Solution
DR Retrofit Mammography
Mammography System (Analog)
MAM VENUS
Post-Processing Imaging System
DynaCAD Prostate

Channels

Nuclear Medicine

view channel
Image: LHSCRI scientist Dr. Glenn Bauman stands in front of the PET scanner (Photo courtesy of LHSCRI)

New Imaging Solution Improves Survival for Patients with Recurring Prostate Cancer

Detecting recurrent prostate cancer remains one of the most difficult challenges in oncology, as standard imaging methods such as bone scans and CT scans often fail to accurately locate small or early-stage tumors.... Read more

General/Advanced Imaging

view channel
Image: Concept of the photo-thermoresponsive SCNPs (J F Thümmler et al., Commun Chem (2025). DOI: 10.1038/s42004-025-01518-x)

New Ultrasmall, Light-Sensitive Nanoparticles Could Serve as Contrast Agents

Medical imaging technologies face ongoing challenges in capturing accurate, detailed views of internal processes, especially in conditions like cancer, where tracking disease development and treatment... Read more

Imaging IT

view channel
Image: The new Medical Imaging Suite makes healthcare imaging data more accessible, interoperable and useful (Photo courtesy of Google Cloud)

New Google Cloud Medical Imaging Suite Makes Imaging Healthcare Data More Accessible

Medical imaging is a critical tool used to diagnose patients, and there are billions of medical images scanned globally each year. Imaging data accounts for about 90% of all healthcare data1 and, until... Read more
Copyright © 2000-2025 Globetech Media. All rights reserved.