A robust model training strategy using hard negative mining in a weakly labeled dataset for lymphatic invasion in gastric cancer

  • Jonghyun Lee
  • , Sangjeong Ahn
  • , Hyun Soo Kim
  • , Jungsuk An
  • , Jongmin Sim

Research output: Contribution to journalArticlepeer-review

5 Scopus citations

Abstract

Gastric cancer is a significant public health concern, emphasizing the need for accurate evaluation of lymphatic invasion (LI) for determining prognosis and treatment options. However, this task is time-consuming, labor-intensive, and prone to intra- and interobserver variability. Furthermore, the scarcity of annotated data presents a challenge, particularly in the field of digital pathology. Therefore, there is a demand for an accurate and objective method to detect LI using a small dataset, benefiting pathologists. In this study, we trained convolutional neural networks to classify LI using a four-step training process: (1) weak model training, (2) identification of false positives, (3) hard negative mining in a weakly labeled dataset, and (4) strong model training. To overcome the lack of annotated datasets, we applied a hard negative mining approach in a weakly labeled dataset, which contained only final diagnostic information, resembling the typical data found in hospital databases, and improved classification performance. Ablation studies were performed to simulate the lack of datasets and severely unbalanced datasets, further confirming the effectiveness of our proposed approach. Notably, our results demonstrated that, despite the small number of annotated datasets, efficient training was achievable, with the potential to extend to other image classification approaches used in medicine.

Original languageEnglish
Article numbere355
JournalJournal of Pathology: Clinical Research
Volume10
Issue number1
DOIs
StatePublished - Jan 2024

Keywords

  • artificial intelligence
  • computational pathology
  • gastric cancer
  • hard negative mining
  • lymphatic invasion

Fingerprint

Dive into the research topics of 'A robust model training strategy using hard negative mining in a weakly labeled dataset for lymphatic invasion in gastric cancer'. Together they form a unique fingerprint.

Cite this