Self-supervised learning of 3D structure from 2D OCT slices for retinal disease diagnosis on UK biobank scans

Nazlı, Muhammet Serdar; Turkan, Yasemin; Tek, Faik Boray

Self-supervised learning of 3D structure from 2D OCT slices for retinal disease diagnosis on UK biobank scans

Dosyalar

Self_Supervised_Learning_of_3D_Structure_from_2D_OCT_Slices_for_Retinal_Disease_Diagnosis_on_UK_Biobank_Scans.pdf (2.71 MB)

Tarih

2025-09-21

Yazarlar

Nazlı, Muhammet Serdar

Turkan, Yasemin

Tek, Faik Boray

Yayıncı

Institute of Electrical and Electronics Engineers Inc.

Erişim Hakkı

info:eu-repo/semantics/closedAccess

Özet

This study presents a self-supervised learning framework for retinal disease classification using Optical Coherence Tomography (OCT) scans. To balance the contextual richness of 3D volumes with the computational efficiency of 2D architectures, we introduce a quasi-3D input generation strategy. Each input is constructed by stacking three OCT slices, sampled from channel-specific Gaussian distributions centered on the volume midplane, and arranged in a standard three-channel 2D format compatible with existing pre-trained models. These quasi-3D images are used to pre-train a Vision Transformer (ViT-Base) via a Masked Autoencoder (MAE) with a shared masking pattern, encouraging the model to reconstruct masked regions by encoding anatomical continuity across slices. Pre-training is conducted on 10,000 unlabeled OCT volumes from the UK Biobank. The encoder is then fine-tuned on the OCTA-500 dataset for three-class and four-class retinal disease classification tasks, including macular degeneration and diabetic retinopathy. The model achieves 92.57% accuracy on the three-class task, matching the performance of RETFound while using over 150 times less pre-training data and a smaller backbone.

Açıklama

UK Biobank data handling and experiments was performed by Yasemin Turkan as part of her PhD thesis, using the UK Biobank Resource under Application Number 82266. Serdar Nazli developed the model and conducted experiments on the OCTA-500 dataset. Computational resources were provided by the Turkish National High-Performance Computing Center (UHEM) under Grant Number 1017802024. This study was also supported by the Scientific and Technological Research Council of Turkey (T\u00FC BI?TAK) under Grant Number 122E509.

Anahtar Kelimeler

Masked autoencoder, Medical image analysis, Optical coherence tomography, Retinal disease, Self-supervised learning, Vision transformer, Classification (of information), Computer aided diagnosis, Diseases, Eye protection, Learning algorithms, Learning systems, Medical image processing, Ophthalmology, Optical tomography, Supervised learning, Auto encoders, Biobanks, Coherence tomography, Disease classification, Optical-, Quasi-3D, Retinal disease, Computational efficiency

Kaynak

International Conference on Computer Science and Engineering, UBMK

Scopus Q Değeri

N/A

Sayı

2025

Künye

Nazlı, M. S., Turkan, Y. & Tek, F. B. (2025). Self-supervised learning of 3D structure from 2D OCT slices for retinal disease diagnosis on UK biobank scans. Paper presented at the International Conference on Computer Science and Engineering, UBMK, 930-934. doi:https://doi.org/10.1109/UBMK67458.2025.11206892

Bağlantı

https://hdl.handle.net/11729/7106
https://doi.org/10.1109/UBMK67458.2025.11206892

Koleksiyon

Öğrenci Yayınları Bildiri Koleksiyonu
Lisansüstü Eğitim Enstitüsü Diğer Yayınlar Koleksiyonu
Scopus İndeksli Yayınlar Koleksiyonu

Detaylı Öğe Kaydı

Self-supervised learning of 3D structure from 2D OCT slices for retinal disease diagnosis on UK biobank scans

Dosyalar

Tarih

Yazarlar

Dergi Başlığı

Dergi ISSN

Cilt Başlığı

Yayıncı

Erişim Hakkı

Araştırma projeleri

Organizasyon Birimleri

Dergi sayısı

Özet

Açıklama

Anahtar Kelimeler

Kaynak

WoS Q Değeri

Scopus Q Değeri

Cilt

Sayı

Künye

Bağlantı

Koleksiyon