Arama Sonuçları

Listeleniyor 1 - 4 / 4
  • Yayın
    Adaptive locally connected recurrent unit (ALCRU)
    (Springer Science and Business Media Deutschland GmbH, 2025-07-03) Özçelik, Şuayb Talha; Tek, Faik Boray
    Research has shown that adaptive locally connected neurons outperform their fully connected (dense) counterparts, motivating this study on the development of the Adaptive Locally Connected Recurrent Unit (ALCRU). ALCRU modifies the Simple Recurrent Neuron Model (SimpleRNN) by incorporating spatial coordinate spaces for input and hidden state vectors, facilitating the learning of parametric local receptive fields. These modifications add four trainable parameters per neuron, resulting in a minor increase in computational complexity. ALCRU is implemented using standard frameworks and trained with back-propagation-based optimizers. We evaluate the performance of ALCRU using diverse benchmark datasets, including IMDb for sentiment analysis, AdditionRNN for sequence modelling, and the Weather dataset for time-series forecasting. Results show that ALCRU achieves accuracy and loss metrics comparable to GRU and LSTM while consistently outperforming SimpleRNN. In particular, experiments with longer sequence lengths on AdditionRNN and increased input dimensions on IMDb highlight ALCRU’s superior scalability and efficiency in processing complex data sequences. In terms of computational efficiency, ALCRU demonstrates a considerable speed advantage over gated models like LSTM and GRU, though it is slower than SimpleRNN. These findings suggest that adaptive local connectivity enhances both the accuracy and efficiency of recurrent neural networks, offering a promising alternative to standard architectures.
  • Yayın
    TURSpider: a Turkish Text-to-SQL dataset and LLM-based study
    (Institute of Electrical and Electronics Engineers Inc., 2024-11-25) Kanburoğlu, Ali Buğra; Tek, Faik Boray
    This paper introduces TURSpider, a novel Turkish Text-to-SQL dataset developed through human translation of the widely used Spider dataset, aimed at addressing the current lack of complex, cross-domain SQL datasets for the Turkish language. TURSpider incorporates a wide range of query difficulties, including nested queries, to create a comprehensive benchmark for Turkish Text-to-SQL tasks. The dataset enables cross-language comparison and significantly enhances the training and evaluation of large language models (LLMs) in generating SQL queries from Turkish natural language inputs. We fine-tuned several Turkish-supported LLMs on TURSpider and evaluated their performance in comparison to state-of-the-art models like GPT-3.5 Turbo and GPT-4. Our results show that fine-tuned Turkish LLMs demonstrate competitive performance, with one model even surpassing GPT-based models on execution accuracy. We also apply the Chain-of-Feedback (CoF) methodology to further improve model performance, demonstrating its effectiveness across multiple LLMs. This work provides a valuable resource for Turkish NLP and addresses specific challenges in developing accurate Text-to-SQL models for low-resource languages.
  • Yayın
    Retinal disease diagnosis in OCT scans using a foundational model
    (Springer Science and Business Media Deutschland GmbH, 2025) Nazlı, Muhammet Serdar; Turkan, Yasemin; Tek, Faik Boray; Toslak, Devrim; Bulut, Mehmet; Arpacı, Fatih; Öcal, Mevlüt Celal
    This study examines the feasibility and performance of using single OCT slices from the OCTA-500 dataset to classify DR (Diabetic Retinopathy) and AMD (Age-Related Macular Degeneration) with a pre-trained transformer-based model (RETFound). The experiments revealed the effective adaptation capability of the pretrained model to the retinal disease classification problem. We further explored the impact of using different slices from the OCT volume, assessing the sensitivity of the results to the choice of a single slice (e.g., “middle slice”) and whether analyzing both horizontal and vertical cross-sectional slices could improve outcomes. However, deep neural networks are complex systems that do not indicate directly whether they have learned and generalized the disease appearance as human experts do. The original dataset lacked disease localization annotations. Therefore, we collected new disease classification and localization annotations from independent experts for a subset of OCTA-500 images. We compared RETFound’s explainability-based localization outputs with these newly collected annotations and found that the region attributions aligned well with the expert annotations. Additionally, we assessed the agreement and variability between experts and RETFound in classifying disease conditions. The Kappa values, ranging from 0.35 to 0.69, indicated moderate agreement among experts and between the experts and the model. The transformer-based RETFound model using single or multiple OCT slices, is an efficient approach to diagnosing AMD and DR.
  • Yayın
    TURSpider veri kümesinde Temsilcilerin Karışımı Tabanlı Text-to-SQL çalışması
    (IEEE, 2025) Kanburoğlu, Ali Buğra; Tek, Faik Boray
    Bu çalışma, Türkçe Text-to-SQL için geliştirilen TURSpider veri kümesi üzerindeki deneyleri ele almaktadır. TURSpider, çeşitli zorluk seviyelerine sahip SQL sorgularını içeren geniş kapsamlı bir Türkçe veri kümesidir ve bu alandaki araştırmalar için önemli bir kaynak niteliğindedir. Çalışmada, geri bildirim odaklı temsilcilerin karışımı yaklaşımının (ing. feedback driven Mixture-of-Agents - MoAF) başarımı incelenmiştir. MoAF yapısında, birden fazla büyük dil modeli (BDM) iş birligi içinde çalışarak SQL oluşturma başarımını artırmayı hedeflemektedir. Bu yapıda temsilci (ing. agent) işbirliği, modellerin birbirinden ögrenmesini ve geri bildirim mekanizmaları aracılığıyla hataların düzeltilmesini sağlamaktadır. Deney sonuçlarına göre, MoAF yaklaşımı ile %60.63 yürütme doğruluğuna ulaşılmış ve TURSpider veri kümesi üzerindeki en iyi sonuç elde edilmiştir.