Automatic delineation of laryngeal squamous cell carcinoma during endoscopy

Azam, Muhammad Adeel; Sampieri, Claudio; Ioppi, Alessandro; Azam, Muhammad Ashir; Baldini, Chiara; Shunlei, Li; Moccia, Sara; Peretti, Giorgio; Mattos, Leonardo S.

doi:10.1016/j.bspc.2023.105666

White Light (WL) and Narrow Band Imaging (NBI) endoscopy are widely used to assess the superficial spreading of laryngeal squamous cell carcinoma (LSCC). However, the analysis of images requires a high level of attention and extensive clinical expertise, leading to inter-clinician variability on the assessment of tumor margins. Computer-aided segmentation can automate the identification of LSCC margins, supporting clinicians in this challenging task. In this paper, we present SegMENT-Plus, a Deep Learning segmentation convolutional network specifically developed and optimized for accurate delineation of LSCC. SegMENT-Plus uses EfficienstNetB5 as encoder with a new modified Atrous Spatial Pyramid Pooling (m-ASPP) block that integrates Channel Block Attention Module (CBAM) and Squeeze Excitation (SE). In this new architecture, CBAM extracts local and global LSCC features from the encoder, while the SE block refines cancer segmentation on each dilated convolution output. SegMENT-Plus was trained and evaluated on a multi-center dataset including clinical data from three different hospitals. A total of 4289 annotated laryngeal images from 766 patients were included in this study. The experiments showed that SegMENT-Plus achieved a Dice Similarity Coefficient (DSC) between 81.4% and 84.9% and an Intersection over Union (IOU) between 81.8% and 85.7% on the data from the different hospitals, attesting its high performance and generalization capability. The proposed segmentation architecture also demonstrated statistically significant improvement in DSC and IoU compared to other state of the art archi-tectures, showing that this work is a concrete foundation towards a clinical system for the automatic delineation of LSCC margins in endoscopic images.