We are continuously adding new COVID CT images and we would like to invite the community to contribute COVID CTs as well.
The COVID-CT-Dataset has 349 CT images containing clinical findings of COVID-19 from 216 patients. They are in ./Images-processed/CT_COVID.zip
Non-COVID CT scans are in ./Images-processed/CT_NonCOVID.zip
We provide a data split in ./Data-split
.
Data split information see README for DenseNet_predict.md
The meta information (e.g., patient ID, DOI, image caption) is in COVID-CT-MetaInfo.xlsx
The images are collected from COVID19-related papers from medRxiv, bioRxiv, NEJM, JAMA, Lancet, etc. CTs containing COVID-19 abnormalities are selected by reading the figure captions in the papers. All copyrights of the data belong to the authors and publishers of these papers.
The dataset details are described in this preprint: COVID-CT-Dataset: A CT Scan Dataset about COVID-19
If you find this dataset and code useful, please cite:
@article{zhao2020COVID-CT-Dataset,
title={COVID-CT-Dataset: a CT scan dataset about COVID-19},
author={Zhao, Jinyu and Zhang, Yichen and He, Xuehai and Xie, Pengtao},
journal={arXiv preprint arXiv:2003.13865},
year={2020}
}
We developed a baseline method for the community to benchmark with.
The details are in README for DenseNet_predict.md
- To contribute to our project, please email your data to [email protected] with the corresponding meta information (Patient ID, DOI and Captions).
- We recommend you also extract images from publications or preprints. Make sure the original papers you crawled have different DOIs from those listed in
COVID-CT-MetaInfo.xlsx
. - In
COVID-CT-MetaInfo.xlsx
, images with the form of2020.mm.dd.xxxx
are crawled from bioRxiv or medRxiv. The DOIs for these preprints are10.1101/2020.mm.dd.xxxx
.