Bildiri Koleksiyonu | Bilgisayar Mühendisliği Bölümü

Bu koleksiyon için kalıcı URI

Güncel Gönderiler

Listeleniyor 1 - 20 / 127
  • Yayın
    Implementing lightweight, dynamic hierarchical key assignment scheme for cloud computing
    (IEEE, 2024-03-25) Çelikbilek, İbrahim; Çeliktaş, Barış; Özdemir, Enver
    In this paper, we propose the implementation and adaptation of a hierarchical key assignment scheme (HKAS) previously developed in our research to improve access control in cloud computing environments. The secret keys generated and managed by this scheme can be utilized for various purposes within the cloud computing, including data encryption, integrity checks, secure communications, and accessing critical infrastructures or services. Our implementation performs dynamic update operations with minimal computational cost and storage demands, as users within the hierarchical structure do not store any key components. Through security analysis, the scheme demonstrates strong key indistinguishability security (S-KI-security), effectively safeguarding keys against various cryptographic attacks. The scheme's flexibility allows it to be tailored to specific organizational needs, whether for securing sensitive data, ensuring compliance with regulatory standards, or facilitating secure data sharing and collaboration in cloud environments. Thus, we advocate for the practical implementation of the HKAS in transitioning to cloud environments.
  • Yayın
    Leveraging transformer-based language models for enhanced service insight in tourism
    (IEEE, 2023-12-22) Er, Aleyna; Özçelik, Şuayb Talha; Yöndem, Meltem Turhan
    Customer feedback is a valuable resource for enhancing customer experience and identifying areas that require improvement. Utilizing user insights allows a tourism company to identify and address problematic points in its service delivery, provide feedback to partner companies regarding their product offerings, and even reconsider agreements by incorporating these opinions when curating their product portfolio. Setur implemented a systematic approach to collecting customer feedback by distributing "after-stay surveys'' to its customers via email following the completion of the agency services provided. Guest answers to open-ended questions that gather opinions about travel experience are analyzed by four tasks: user intention for answering, the sentiment of the review, subjects touched upon, and whom it concerned. For these tasks, transformer-based natural language processing (NLP) models BERT, DistilBERT, RoBERTa, and Electra are fine-Tuned to classify customer reviews. Based on the test results, it is observed that best practices could be gathered using Bert. In addition, we showed that different insights can be obtained from text comments made for two hotels in Aydin, Turkiye. Some users made complaints using neutral sentences. In some cases, people gave high scores to the numerical rating questions, but their open-ended questions could have a negative meaning.
  • Yayın
    Forecasting and analysis of energy consumption and waste generation in Antalya with SVR
    (IEEE, 2023-12-24) Özçelik, Şuayb Talha; Tek, Faik Boray; Şekerci, Erdal
    Antalya, a rapidly expanding coastal city in Türkiye, has experienced significant changes due to urbanization and increasing tourism activities. Comprehending tourism trends is crucial for the city's sustainable development and environmental management. Based on this perspective, this paper aims to present a comprehensive retrospective analysis of Antalya's energy consumption, domestic solid waste generation, wastewater generation, population growth, and tourist numbers over the years. Antalya faces significant challenges due to escalating trends in listed areas. Utilizing the Support Vector Regression, this study projects a need for an additional 1715 GWh of electricity production capacity, an expansion of wastewater capacity by 85639 thousand m3, and an increase in domestic solid waste disposal capacity by 597745 tons by 2028 to accommodate growing demands. We emphasize the importance of adopting effective policies and strategies to support energy efficiency, waste reduction, and wastewater management alongside sustainable urban planning and tourism management for Antalya's long-Term environmental sustainability and development. The findings presented in this study provide valuable insights for policymakers, urban planners, and stakeholders to make informed decisions, ensuring a balanced approach toward economic growth and environmental conservation.
  • Yayın
    Application of ChatGPT in the tourism domain: potential structures and challenges
    (IEEE, 2023-12-23) Kılıçlıoğlu, Orkun Mehmet; Özçelik, Şuayb Talha; Yöndem, Meltem Turhan
    The tourism industry stands out as a sector where effective customer communication significantly influences sales and customer satisfaction. The recent shift from traditional natural language processing methodologies to state-of-The-Art deep learning and transformer-based models has revolutionized the development of Conversational AI tools. These tools can provide comprehensive information about a company's product portfolio, enhancing customer engagement and decision-making. One potential Conversational AI application can be developed with ChatGPT. In this study, we explore the potential of using ChatGPT, a cutting-edge Conversational AI, in the context of Setur's products and services, focusing on two distinct scenarios: intention recognition and response generation. We incorporate Setur-specific data, including hotel information and annual catalogs. Our research aims to present potential structures and strategies for utilizing Language Model-based systems, particularly ChatGPT, in the tourism domain. We investigate the advantages and disadvantages of three different architectures and evaluate whether a restrictive or more independent model would be suitable for our application. Despite the impressive performance of Large Language Models (LLMs) in generating human-like dialogues, their end-To-end application faces limitations, such as system prompt constraints, fine-Tuning challenges, and model unavailability. Moreover, semantic search fails to deliver satisfactory performance when searching filters that require clear answers. To address these issues, we propose a hybrid approach that employs external interventions, the assignment of different GPT agents according to intent analysis, and traditional methods at specific junctures, which will facilitate the integration of domain knowledge into these systems.
  • Yayın
    AnlamVer: Semantic model evaluation dataset for Turkish - word similarity and relatedness
    (Association for Computational Linguistics (ACL), 2018-08-26) Ercan, Gökhan; Yıldız, Olcay Taner
    In this paper, we present AnlamVer, which is a semantic model evaluation dataset for Turkish designed to evaluate word similarity and word relatedness tasks while discriminating those two relations from each other. Our dataset consists of 500 word-pairs annotated by 12 human subjects, and each pair has two distinct scores for similarity and relatedness. Word-pairs are selected to enable the evaluation of distributional semantic models by multiple attributes of words and word-pair relations such as frequency, morphology, concreteness and relation types (e.g., synonymy, antonymy). Our aim is to provide insights to semantic model researchers by evaluating models in multiple attributes. We balance dataset word-pairs by their frequencies to evaluate the robustness of semantic models concerning out-of-vocabulary and rare words problems, which are caused by the rich derivational and inflectional morphology of the Turkish language.
  • Yayın
    TUR2SQL: A cross-domain Turkish dataset for Text-to-SQL
    (IEEE, 2023-09-15) Kanburoğlu, Ali Buğra; Tek, Faik Boray
    The field of converting natural language into corresponding SQL queries using deep learning techniques has attracted significant attention in recent years. While existing Text-to-SQL datasets primarily focus on English and other languages such as Chinese, there is a lack of resources for the Turkish language. In this study, we introduce the first publicly available cross-domain Turkish Text-to-SQL dataset, named TUR2SQL. This dataset consists of 10,809 pairs of natural language statements and their corresponding SQL queries. We conducted experiments using SQLNet and ChatGPT on the TUR2SQL dataset. The experimental results show that SQLNet has limited performance and ChatGPT has superior performance on the dataset. We believe that TUR2SQL provides a foundation for further exploration and advancements in Turkish language-based Text-to-SQL research.
  • Yayın
    Hotel sales forecasting with LSTM and N-BEATS
    (IEEE, 2023-09-15) Özçelik, Şuayb Talha; Tek, Faik Boray; Şekerci, Erdal
    Time series forecasting aims to model the change in data points over time. It is applicable in many areas, such as energy consumption, solid waste generation, economic indicators (inflation, currency), global warming (heat, water level), and hotel sales forecasting. This paper focuses on hotel sales forecasting with machine learning and deep learning solutions. A simple forecast solution is to repeat the last observation (Naive method) or the average of the past observations (Average method). More sophisticated solutions have been developed over the years, such as machine learning methods that have linear (Linear Regression, ARIMA) and nonlinear (Polynomial Regression and Support Vector Regression) methods. Different kinds of neural networks are developed and used in time series forecasting problems, and two of the successful ones are Recurrent Neural Networks and N-BEATS. This paper presents a forecasting analysis of hotel sales from Türkiye and Cyprus. We showed that N-BEATS is a solid choice against LSTM, especially in long sequences. Moreover, N-BEATS has slightly better inference time results in long sequences, but LSTM is faster in short sequences.
  • Yayın
    ISIKSumm at BioLaySumm task 1: BART-based summarization system enhanced with Bio-entity labels
    (Association for Computational Linguistics (ACL), 2023-07-13) Çolak, Çağla; Karadeniz, İlknur
    Communicating scientific research to the general public is an essential yet challenging task. Lay summaries, which provide a simplified version of research findings, can bridge the gap between scientific knowledge and public understanding. The BioLaySumm task (Goldsack et al., 2023) is a shared task that seeks to automate this process by generating lay summaries from biomedical articles. Two different datasets that have been created from curating two biomedical journals (PLOS and eLife) are provided by the task organizers. As a participant in this shared task, we developed a system to generate a lay summary from an article’s abstract and main text.
  • Yayın
    Auto Train Brain increases the variance of the gamma band sample entropy in the left hemisphere in dyslexia: a pilot study
    (Springer Science and Business Media Deutschland GmbH, 2023) Eroğlu, Günet
    Auto Train Brain is a mobile app that improves reading speed and reading comprehension in dyslexia. The efficacy of Auto Train Brain was proven with a clinical trial. We have analyzed the long-term training effects of the Auto Train Brain on dyslexic children. We have collected QEEG data from 14 channels from 21 dyslexic children for 100 sessions and calculated the sample entropy in the gamma bands for the left posterior brain (T7, P7, and O1). Although the gamma band values fluctuate and no permanent increase in the gamma band values is detected after Auto Train Brain training at T7, P7, and O1, the variance of gamma band sample entropy increases as the neurofeedback session number increases. We have concluded that the Auto Train Brain increases the flexibility of the left brain in dyslexia.
  • Yayın
    BOUN-ISIK participation: an unsupervised approach for the named entity normalization and relation extraction of Bacteria Biotopes
    (Association for Computational Linguistics (ACL), 2019-11-04) Karadeniz, İlknur; Tuna, Ömer Faruk; Özgu, Arzucan
    This paper presents our participation at the Bacteria Biotope Task of the BioNLP Shared Task 2019. Our participation includes two systems for the two subtasks of the Bacteria Biotope Task: the normalization of entities (BB-norm) and the identification of the relations between the entities given a biomedical text (BB-rel). For the normalization of entities, we utilized word embeddings and syntactic re-ranking. For the relation extraction task, pre-defined rules are used. Although both approaches are unsupervised, in the sense that they do not need any labeled data, they achieved promising results. Especially, for the BB-norm task, the results have shown that the proposed method performs as good as deep learning based methods, which require labeled data.
  • Yayın
    Convolutional neural network (CNN) algorithm based facial emotion recognition (FER) system for FER-2013 dataset
    (IEEE, 2022-11-18) Ezerceli, Özay; Eskil, Mustafa Taner
    Facial expression recognition (FER) is the key to understanding human emotions and feelings. It is an active area of research since human thoughts can be collected, processed, and used in customer satisfaction, politics, and medical domains. Automated FER systems had been developed and have been used to recognize humans’ emotions but it has been a quite challenging problem in machine learning due to the high intra-class variation. The first models were using known methods such as Support Vector Machines (SVM), Bayes classifier, Fuzzy Techniques, Feature Selection, Artificial Neural Networks (ANN) in their models but still, some limitations affect the accuracy critically such as subjectivity, occlusion, pose, low resolution, scale, illumination variation, etc. The ability of CNN boosts FER accuracy. Deep learning algorithms have emerged as the greatest way to produce the best results in FER in recent years. Various datasets were used to train, test, and validate the models. FER2013, CK+, JAFFE and FERG are some of the most popular datasets. To improve the accuracy of FER models, one dataset or a mix of datasets has been employed. Every dataset includes limitations and issues that have an impact on the model that is trained for it. As a solution to this problem, our state-of-the-art model based on deep learning architectures, particularly convolutional neural network architectures (CNN) with supportive techniques has been implemented. The proposed model achieved 93.7% accuracy with the combination of FER2013 and CK+ datasets for FER2013.
  • Yayın
    Analysis of single image super resolution models
    (IEEE, 2022-11-18) Köprülü, Mertali; Eskil, Mustafa Taner
    Image Super-Resolution (SR) is a set of image processing techniques which improve the resolution of images and videos. Deep learning approaches have made remarkable improvement in image super-resolution in recent years. This article aims and seeks to provide a comprehensive analysis on recent advances of models which has been used in image superresolution. This study has been investigated over other essential topics of current model problems, such as publicly accessible benchmark data-sets and performance evaluation measures. Finally, The study concluded these analysis by highlighting several weaknesses of existing base models as their feeding strategy and approved that the training technique which is Blind Feeding, which led several model to achieve state-of-the art.
  • Yayın
    Machine learning-based model categorization using textual and structural features
    (Springer Science and Business Media Deutschland GmbH, 2022-09-08) Khalilipour, Alireza; Bozyiğit, Fatma; Utku, Can; Challenger, Moharram
    Model Driven Engineering (MDE), where models are the core elements in the entire life cycle from the specification to maintenance phases, is one of the promising techniques to provide abstraction and automation. However, model management is another challenging issue due to the increasing number of models, their size, and their structural complexity. So that the available models should be organized by modelers to be reused and overcome the development of the new and more complex models with less cost and effort. In this direction, many studies are conducted to categorize models automatically. However, most of the studies focus either on the textual data or structural information in the intelligent model management, leading to less precision in the model management activities. Therefore, we utilized a model classification using baseline machine learning approaches on a dataset including 555 Ecore metamodels through hybrid feature vectors including both textual and structural information. In the proposed approach, first, the textual information of each model has been summarized in its elements through text processing as well as the ontology of synonyms within a specific domain. Then, the performances of machine learning classifiers were observed on two different variants of the datasets. The first variant includes only textual features (represented both in TF-IDF and word2vec representations), whereas the second variant consists of the determined structural features and textual features. It was finally concluded that each experimented machine learning algorithm gave more successful prediction performance on the variant containing structural features. The presented model yields promising results for the model classification task with a classification accuracy of 89.16%.
  • Yayın
    El yazısı rakam sınıflandırması için gözetimsiz benzerlik tabanlı evrişimler
    (Institute of Electrical and Electronics Engineers Inc., 2022) Erkoç, Tuğba; Eskil, Mustafa Taner
    Effective training of filters in Convolutional Neural Networks (CNN) ensures their success. In order to achieve good classification results in CNNs, filters must be carefully initialized, trained and fine-tuned. We propose an unsupervised method that allows the discovery of filters from the given dataset in a single epoch without specifying the number of filters hyper-parameter in convolutional layers. Our proposed method gradually builds the convolutional layers by a discovery routine that extracts a number of features that adequately represent the complexity of the input domain. The discovered filters represent the patterns in the domain, so they do not require any initialization method or backpropagation training for fine tuning purposes. Our method achieves 99.03% accuracy on MNIST dataset without applying any data augmentation techniques.
  • Yayın
    Comparison of choreography vs orchestration based Saga patterns in microservices
    (Institute of Electrical and Electronics Engineers Inc., 2022) Aydın, Şahin; Çebi, Cem Berke
    Microservice Architecture (MSA) is a design and architecture pattern created to deal with the challenges of conventional software programs in terms of stream processing, highly available flexibility, and infrastructural agility. Despite the many advantages of MSA, designing isolated services using the autonomous Databases per Services paradigm is difficult. We realized that because each microservice will have its repository, ensuring data coherence between databases becomes difficult, especially in reversals, where operations transcend different sites. Distributed networked transactions and rollbacks can be efficiently handled using two-phase commitment methods in hardware virtualization using RDBMS databases. However, these approaches can't be used in micro-services with segregated NoSQL servers. Three issues have been addressed in this study: (i) investigate the implementation of event choreography and orchestration methods for the Saga pattern execution in MSA, (ii) existing reality suggestions on the saga pattern adoption and implementation besides the use cases, and (iii) introduce the disbursed transaction records and rollbacks challenges in isolated No-SQL databases with reliant collections in MSA.
  • Yayın
    Unreasonable effectiveness of last hidden layer activations for adversarial robustness
    (Institute of Electrical and Electronics Engineers Inc., 2022) Tuna, Ömer Faruk; Çatak, Ferhat Özgür; Eskil, Mustafa Taner
    In standard Deep Neural Network (DNN) based classifiers, the general convention is to omit the activation function in the last (output) layer and directly apply the softmax function on the logits to get the probability scores of each class. In this type of architectures, the loss value of the classifier against any output class is directly proportional to the difference between the final probability score and the label value of the associated class. Standard White-box adversarial evasion attacks, whether targeted or untargeted, mainly try to exploit the gradient of the model loss function to craft adversarial samples and fool the model. In this study, we show both mathematically and experimentally that using some widely known activation functions in the output layer of the model with high temperature values has the effect of zeroing out the gradients for both targeted and untargeted attack cases, preventing attackers from exploiting the model's loss function to craft adversarial samples. We've experimentally verified the efficacy of our approach on MNIST (Digit), CIFAR10 datasets. Detailed experiments confirmed that our approach substantially improves robustness against gradient-based targeted and untargeted attack threats. And, we showed that the increased non-linearity at the output layer has some ad-ditional benefits against some other attack methods like Deepfool attack.
  • Yayın
    Categorization of the models based on structural information extraction and machine learning
    (Springer Science and Business Media Deutschland GmbH, 2022-07-21) Khalilipour, Alireza; Bozyiğit, Fatma; Utku, Can; Challenger, Moharram
    As various engineering fields increasingly use modelling techniques, the number of provided models, their size, and their structural complexity increase. This makes model management, including finding these models, with state of the art very expensive computationally, i.e., leads to non-tractable graph comparison algorithms. To handle this problem, modelers can organize available models to be reused and overcome the development of the new and more complex models with less cost and effort. Therefore, we utilized a model classification using baseline machine learning approaches on a dataset including 555 Ecore metamodels. In our proposed system, the structural information of each model was summarized in its elements through generating their simple labelled graphs. The proposed solution is to transform the complex attributed graphs of the models to simply labelled graphs so that graph analysis algorithms can be applied to them. The labelled graphs (models) were structurally compared using graph comparison techniques such as graph kernels, and the results were used as a set of features for similarity search. After generating feature vectors, the performance of six machine learning classifiers (Naïve Bayes (NB), k Nearest Neighbors (kNN), Support Vector Machine (SVM), Random Forest (RF), and Artificial Neural Network (ANN) were evaluated on the feature vectors. The presented model yields promising results for the model classification task with a classification accuracy over 87%.
  • Yayın
    Convolutional attention network for MRI-based Alzheimer's disease classification and its interpretability analysis
    (IEEE, 2021-09-17) Türkan, Yasemin; Tek, Faik Boray
    Neuroimaging techniques, such as Magnetic Resonance Imaging (MRI) and Positron Emission Tomography (PET), help to identify Alzheimer's disease (AD). These techniques generate large-scale, high-dimensional, multimodal neuroimaging data, which is time-consuming and difficult to interpret and classify. Therefore, interest in deep learning approaches for the classification of 3D structural MRI brain scans has grown rapidly. In this research study, we improved the 3D VGG model proposed by Korolev et al. [2]. We increased the filters in the 3D convolutional layers and then added an attention mechanism for better classification. We compared the performance of the proposed approaches for the classification of Alzheimer's disease versus mild cognitive impairments and normal cohorts on the Alzheimer's Disease Neuroimaging Initiative (ADNI) dataset. We observed that both the accuracy and area under curve results improved with the proposed models. However, deep neural networks are black boxes that produce predictions that require further explanation for medical usage. We compared the 3D-data interpretation capabilities of the proposed models using four different interpretability methods: Occlusion, 3D Ultrametric Contour Map, 3D Gradient-Weighted Class Activation Mapping, and SHapley Additive explanations (SHAP). We observed that explanation results differed in different network models and data classes.
  • Yayın
    Tweet sentiment analysis for cryptocurrencies
    (IEEE, 2021-10-13) Şaşmaz, Emre; Tek, Faik Boray
    Many traders believe in and use Twitter tweets to guide their daily cryptocurrency trading. In this project, we investigated the feasibility of automated sentiment analysis for cryptocurrencies. For the study, we targeted one cryptocurrency (NEO) altcoin and collected related data. The data collection and cleaning were essential components of the study. First, the last five years of daily tweets with NEO hashtags were obtained from Twitter. The collected tweets were then filtered to contain or mention only NEO. We manually tagged a subset of the tweets with positive, negative, and neutral sentiment labels. We trained and tested a Random Forest classifier on the labeled data where the test set accuracy reached 77%. In the second phase of the study, we investigated whether the daily sentiment of the tweets was correlated with the NEO price. We found positive correlations between the number of tweets and the daily prices, and between the prices of different crypto coins. We share the data publicly.
  • Yayın
    ISIKUN at the FinCausal 2020: Linguistically informed machine-learning approach for causality identification in financial documents
    (Association for Computational Linguistics (ACL), 2020) Özenir, Hüseyin Gökberk; Karadeniz, İlknur
    This paper presents our participation to the FinCausal-2020 Shared Task whose ultimate aim is to extract cause-effect relations from a given financial text. Our participation includes two systems for the two sub-tasks of the FinCausal-2020 Shared Task. The first sub-task (Task-1) consists of the binary classification of the given sentences as causal meaningful (1) or causal meaningless (0). Our approach for the Task-1 includes applying linear support vector machines after transforming the input sentences into vector representations using term frequency-inverse document frequency scheme with 3-grams. The second sub-task (Task-2) consists of the identification of the cause-effect relations in the sentences, which are detected as causal meaningful. Our approach for the Task-2 is a CRF-based model which uses linguistically informed features. For the Task-1, the obtained results show that there is a small difference between the proposed approach based on linear support vector machines (F-score 94%), which requires less time compared to the BERT-based baseline (F-score 95%). For the Task-2, although a minor modifications such as the learning algorithm type and the feature representations are made in the conditional random fields based baseline (F-score 52%), we have obtained better results (F-score 60%). The source codes for the both tasks are available online (https://github.com/ozenirgokberk/FinCausal2020.git/).