Purpose To distinguish functioning from failed filtration blebs (FBs) implementing a deep learning (DL) model on slit-lamp images.Methods Retrospective, cross-sectional, multicenter study for development and validation of an artificial intelligence classification algorithm. The dataset consisted of 119 post-trabeculectomy FB images of whom we were aware of the surgical outcome. The ground truth labels were annotated and images splitted into three outcome classes: complete (C) or qualified success (Q), and failure (F). Images were prepared implementing various data cleaning and data transformations techniques. A set of DL models were trained using different ResNet architectures as the backbone. Transfer and ensemble learning were then applied to obtain a final combined model. Accuracy, sensitivity, specificity, area under the ROC curve, and area under the precision-recall curve were calculated to evaluate the final model. Kappa coefficient and P value on the accuracy measure were used to prove the statistical significance level.Results The DL approach reached good results in unraveling FB functionality. Overall, the model accuracy reached a score of 74%, with a sensitivity of 74% and a specificity of 87%. The area under the ROC curve was 0.8, whereas the area under the precision-recall curve was 0.74. The P value was equal to 0.00307, and the Kappa coefficient was 0.58.Conclusions All considered metrics supported that the final DL model was able to discriminate functioning from failed FBs, with good accuracy. This approach could support clinicians in the patients' management after glaucoma surgery in absence of adjunctive clinical data.
A deep learning approach to investigate the filtration bleb functionality after glaucoma surgery: a preliminary study
Mastropasqua, LeonardoCo-primo
;Agnifili, Luca
Co-primo
;Sensi, Stefano L;Mastropasqua, RodolfoUltimo
2024-01-01
Abstract
Purpose To distinguish functioning from failed filtration blebs (FBs) implementing a deep learning (DL) model on slit-lamp images.Methods Retrospective, cross-sectional, multicenter study for development and validation of an artificial intelligence classification algorithm. The dataset consisted of 119 post-trabeculectomy FB images of whom we were aware of the surgical outcome. The ground truth labels were annotated and images splitted into three outcome classes: complete (C) or qualified success (Q), and failure (F). Images were prepared implementing various data cleaning and data transformations techniques. A set of DL models were trained using different ResNet architectures as the backbone. Transfer and ensemble learning were then applied to obtain a final combined model. Accuracy, sensitivity, specificity, area under the ROC curve, and area under the precision-recall curve were calculated to evaluate the final model. Kappa coefficient and P value on the accuracy measure were used to prove the statistical significance level.Results The DL approach reached good results in unraveling FB functionality. Overall, the model accuracy reached a score of 74%, with a sensitivity of 74% and a specificity of 87%. The area under the ROC curve was 0.8, whereas the area under the precision-recall curve was 0.74. The P value was equal to 0.00307, and the Kappa coefficient was 0.58.Conclusions All considered metrics supported that the final DL model was able to discriminate functioning from failed FBs, with good accuracy. This approach could support clinicians in the patients' management after glaucoma surgery in absence of adjunctive clinical data.File | Dimensione | Formato | |
---|---|---|---|
s00417-023-06170-6.pdf
accesso aperto
Tipologia:
PDF editoriale
Dimensione
1.87 MB
Formato
Adobe PDF
|
1.87 MB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.