Concepedia

TLDR

COVID‑19 has infected over 1.3 million people worldwide, causing more than 106 000 deaths, and while deep‑learning CT‑based diagnosis methods are being developed, their reproducibility is hampered by non‑public data and the need for large training sets. The study aims to solve the data scarcity and reproducibility challenges in CT‑based COVID‑19 diagnosis. The authors constructed a publicly available dataset of hundreds of COVID‑19 CT scans and introduced the Self‑Trans method, which combines contrastive self‑supervised learning with transfer learning to learn robust, unbiased feature representations that mitigate overfitting. Experiments show that Self‑Trans achieves an F1 of 0.85 and an AUC of 0.94, outperforming several state‑of‑the‑art baselines despite using only a few hundred training CTs.

Abstract

Abstract Coronavirus disease 2019 (COVID-19) has infected more than 1.3 million individuals all over the world and caused more than 106,000 deaths. One major hurdle in controlling the spreading of this disease is the inefficiency and shortage of medical tests. There have been increasing efforts on developing deep learning methods to diagnose COVID-19 based on CT scans. However, these works are difficult to reproduce and adopt since the CT data used in their studies are not publicly available. Besides, these works require a large number of CTs to train accurate diagnosis models, which are difficult to obtain. In this paper, we aim to address these two problems. We build a publicly-available dataset containing hundreds of CT scans positive for COVID-19 and develop sample-efficient deep learning methods that can achieve high diagnosis accuracy of COVID-19 from CT scans even when the number of training CT images are limited. Specifically, we propose a Self-Trans approach, which synergistically integrates contrastive self-supervised learning with transfer learning to learn powerful and unbiased feature representations for reducing the risk of overfitting. Extensive experiments demonstrate the superior performance of our proposed Self-Trans approach compared with several state-of-the-art baselines. Our approach achieves an F1 of 0.85 and an AUC of 0.94 in diagnosing COVID-19 from CT scans, even though the number of training CTs is just a few hundred.

References

YearCitations

Page 1