Retinal disease classification from bimodal OCT and OCTA using a CNN-ViT hybrid architecture

Aydın, Ömer Faruk; Tek, Faik Boray; Turkan, Yasemin

Retinal disease classification from bimodal OCT and OCTA using a CNN-ViT hybrid architecture

dc.authorid	0009-0000-3453-1502
dc.authorid	0000-0002-8649-6013
dc.authorid	0000-0001-6309-4524
dc.contributor.author	Aydın, Ömer Faruk	en_US
dc.contributor.author	Tek, Faik Boray	en_US
dc.contributor.author	Turkan, Yasemin	en_US
dc.date.accessioned	2026-03-06T11:02:23Z
dc.date.available	2026-03-06T11:02:23Z
dc.date.issued	2025-09-21
dc.department	Işık Üniversitesi, Lisansüstü Eğitim Enstitüsü, Bilgisayar Mühendisliği Yüksek Lisans Programı	en_US
dc.department	Işık University, School of Graduate Studies, Master’s Program in Computer Engineering	en_US
dc.description	This study was supported by the Scientific and Technological Research Council of Turkey (TUBITAK) under Grant Number 122E509.	en_US
dc.description.abstract	Retinal diseases are the leading cause of vision impairment and blindness worldwide. Early and accurate diagnosis is critical for effective treatment, and recent advances in imaging technologies such as Optical Coherence Tomography (OCT) and OCT Angiography (OCTA), have enabled detailed visualization of the retinal structure and vasculature. By leveraging these modalities, this study proposes an advanced deep learning architecture called MultiModalNet for automated multi-class retinal disease classification. MultiModalNet employs a dual-branch design, where OCTA projection maps are processed through a ResNet101 encoder, and cross-sectional slices from the OCT volume (B-scans) are analyzed using a Vision Transformer (ViT-Large). The extracted features from both branches were fused and passed through the fully connected layers for the final classification. Evaluated on the 3-class OCTA-500 dataset, which includes Age-related Macular Degeneration (AMD), Diabetic Retinopathy (DR), and Normal cases, the proposed model achieved state-of-the-art classification accuracy of 94.59 percent, significantly o utperforming single-modality baselines. This result highlights the effectiveness of integrating vascular and structural information to improve the diagnostic performance. The findings suggest that hybrid multi-modal deep learning approaches can play a transformative role in computer-aided ophthalmology, enhancing both clinical decision-making and screening workflows.	en_US
dc.description.sponsorship	Türkiye Bilimsel ve Teknolojik Araştırma Kurumu	en_US
dc.description.version	Publisher's Version	en_US
dc.identifier.citation	Aydın, Ö. F., Tek, F. B. & Turkan, Y. (2025). Retinal disease classification from bimodal OCT and OCTA using a CNN-ViT hybrid architecture. Paper presented at the International Conference on Computer Science and Engineering, UBMK, 2025, 260-264. doi:https://doi.org/10.1109/UBMK67458.2025.11206835	en_US
dc.identifier.doi	10.1109/UBMK67458.2025.11206835
dc.identifier.endpage	264
dc.identifier.isbn	2521-1641
dc.identifier.isbn	9798331599751
dc.identifier.issue	2025
dc.identifier.scopus	2-s2.0-105030845081
dc.identifier.scopusquality	N/A
dc.identifier.startpage	260
dc.identifier.uri	https://hdl.handle.net/11729/7105
dc.identifier.uri	https://doi.org/10.1109/UBMK67458.2025.11206835
dc.indekslendigikaynak	Scopus	en_US
dc.institutionauthor	Turkan, Yasemin	en_US
dc.institutionauthorid	0000-0001-6309-4524
dc.language.iso	en	en_US
dc.peerreviewed	Yes	en_US
dc.publicationstatus	Published	en_US
dc.publisher	Institute of Electrical and Electronics Engineers Inc.	en_US
dc.relation.ispartof	International Conference on Computer Science and Engineering, UBMK	en_US
dc.relation.publicationcategory	Konferans Öğesi - Uluslararası - Öğrenci	en_US
dc.rights	info:eu-repo/semantics/closedAccess	en_US
dc.subject	Convolutional Neural Networks (CNN)	en_US
dc.subject	Deep learning	en_US
dc.subject	Multi-modal	en_US
dc.subject	Optical Coherence Tomography Angiography (OCTA)	en_US
dc.subject	Retinal disease classification	en_US
dc.subject	Vision Transformer (ViT)	en_US
dc.subject	Architecture	en_US
dc.subject	Classification (of information)	en_US
dc.subject	Computer aided diagnosis	en_US
dc.subject	Convolutional neural networks	en_US
dc.subject	Decision making	en_US
dc.subject	Deep neural networks	en_US
dc.subject	Eye protection	en_US
dc.subject	Learning systems	en_US
dc.subject	Ophthalmology	en_US
dc.subject	Coherence tomography	en_US
dc.subject	Convolutional neural network	en_US
dc.subject	Disease classification	en_US
dc.subject	Retinal disease	en_US
dc.subject	Angiography	en_US
dc.title	Retinal disease classification from bimodal OCT and OCTA using a CNN-ViT hybrid architecture	en_US
dc.type	Conference Object	en_US
dspace.entity.type	Publication	en_US

Dosyalar

Orijinal paket

Listeleniyor 1 - 1 / 1

İsim:: Retinal_Disease_Classification_from_Bimodal_OCT_and_OCTA_Using_a_CNN_ViT_Hybrid_Architecture.pdf
Boyut:: 1.93 MB
Biçim:: Adobe Portable Document Format

İndir

Lisans paketi

Listeleniyor 1 - 1 / 1

İsim:: license.txt
Boyut:: 1.17 KB
Biçim:: Item-specific license agreed upon to submission
Açıklama:

İndir

Koleksiyon

Öğrenci Yayınları Bildiri Koleksiyonu
Lisansüstü Eğitim Enstitüsü Diğer Yayınlar Koleksiyonu
Scopus İndeksli Yayınlar Koleksiyonu