ChestX-ray14 ds

From DIR
Jump to: navigation, search
Type declaration dcat:Distribution, dctypes:Dataset
Title ChestX-ray14
Description Background & Motivation: Chest X-ray exam is one of the most frequent and cost-effective medical imaging examination. However clinical diagnosis of chest X-ray can be challenging, and sometimes believed to be harder than diagnosis via chest CT imaging. Even some promising work have been reported in the past, and especially in recent deep learning work on Tuberculosis (TB) classification. To achieve clinically relevant computer-aided detection and diagnosis (CAD) in real world medical sites on all data settings of chest X-rays is still very difficult, if not impossible when only several thousands of images are employed for study. This is evident from where the performance deep neural networks for thorax disease recognition is severely limited by the availability of only 4143 frontal view images (Openi is the previous largest publicly available chest X-ray dataset to date).In this database, we provide an enhanced version (with 6 more disease categories and more images as well) of the dataset used in the recent work which is approximately 27 times of the number of frontal chest x-ray images . Our dataset is extracted from the clinical PACS database at National Institutes of Health Clinical Center and consists of ~60% of all frontal chest x-rays in the hospital. Therefore we expect this dataset is significantly more representative to the real patient population distributions and realistic clinical diagnosis challenges, than any previous chest x-ray datasets. Of course, the size of our dataset, in terms of the total numbers of images and thorax disease frequencies, would better facilitate deep neural network training . Refer to on the details of how the dataset is extracted and image labels are mined through natural language processing (NLP).
Creators https://www.cc.nih.gov/drd/summers.html
Publisher https://www.nih.gov/
License
File format CSV, Images, PDF
Date of issue 2017-12-15
HTML page https://nihcc.app.box.com/v/ChestXray-NIHCC
Language http://lexvo.org/id/iso639-3/eng
Version identifier ChestX-ray14
File URL https://nihcc.app.box.com/v/ChestXray-NIHCC
Byte size 45000000000
Item listing Atelectasis, Cardiomegaly, Consolidation, Edema, Effusion, Emphysema, Fibrosis, Hernia, Infiltration, Mass, Nodule, Pleural_Thickening, Pneumonia, Pneumothorax
File directory https://nihcc.app.box.com/v/ChestXray-NIHCC
Documentation https://nihcc.app.box.com/v/ChestXray-NIHCC
Data elements Atelectasis, Cardiomegaly, Consolidation, Edema, Effusion, Emphysema, Fibrosis, Hernia, Infiltration, Mass, Nodule, Pleural_Thickening, Pneumonia, Pneumothorax
Facts about "ChestX-ray14 ds"
Data elementsAtelectasis +, Cardiomegaly +, Consolidation +, Edema +, Effusion +, Emphysema +, Fibrosis +, Hernia +, Infiltration +, Mass +, Nodule +, Pleural_Thickening +, Pneumonia + and Pneumothorax +
Dcat:accessURLhttps://nihcc.app.box.com/v/ChestXray-NIHCC +
Dcat:byteSize45,000,000,000 +
Dcat:downloadURLhttps://nihcc.app.box.com/v/ChestXray-NIHCC +
Dcat:landingPagehttps://nihcc.app.box.com/v/ChestXray-NIHCC +
Dct:creatorhttps://www.cc.nih.gov/drd/summers.html +
Dct:descriptionBackground & Motivation: Chest X-ray exam
Background & Motivation: Chest X-ray exam is one of the most frequent and cost-effective medical imaging examination. However clinical diagnosis of chest X-ray can be challenging, and sometimes believed to be harder than diagnosis via chest CT imaging. Even some promising work have been reported in the past, and especially in recent deep learning work on Tuberculosis (TB) classification. To achieve clinically relevant computer-aided detection and diagnosis (CAD) in real world medical sites on all data settings of chest X-rays is still very difficult, if not impossible when only several thousands of images are employed for study. This is evident from where the performance deep neural networks for thorax disease recognition is severely limited by the availability of only 4143 frontal view images (Openi is the previous largest publicly available chest X-ray dataset to date).In this database, we provide an enhanced version (with 6 more disease categories and more images as well) of the dataset used in the recent work which is approximately 27 times of the number of frontal chest x-ray images . Our dataset is extracted from the clinical PACS database at National Institutes of Health Clinical Center and consists of ~60% of all frontal chest x-rays in the hospital. Therefore we expect this dataset is significantly more representative to the real patient population distributions and realistic clinical diagnosis challenges, than any previous chest x-ray datasets. Of course, the size of our dataset, in terms of the total numbers of images and thorax disease frequencies, would better facilitate deep neural network training . Refer to on the details of how the dataset is extracted and image labels are mined through natural language processing (NLP).
through natural language processing (NLP). +
Dct:formatCSV +, Images + and PDF +
Dct:issued15 December 2017 +
Dct:languagehttp://lexvo.org/id/iso639-3/eng +
Dct:publisherhttps://www.nih.gov/ +
Dct:titleChestX-ray14 +
Foaf:pagehttps://nihcc.app.box.com/v/ChestXray-NIHCC +
Pav:versionChestX-ray14 +
Rdf:typedcat:Distribution + and dctypes:Dataset +
Sio:has-data-itemAtelectasis +, Cardiomegaly +, Consolidation +, Edema +, Effusion +, Emphysema +, Fibrosis +, Hernia +, Infiltration +, Mass +, Nodule +, Pleural_Thickening +, Pneumonia + and Pneumothorax +