Learning filter scale and orientation in convolutional neural networks

Çam, İlker

dc.contributor.advisor	Tek, Faik Boray	en_US
dc.contributor.author	Çam, İlker	en_US
dc.contributor.other	Işık Üniversitesi, Fen Bilimleri Enstitüsü, Bilgisayar Mühendisliği Yüksek Lisans Programı	en_US
dc.date.accessioned	2019-04-09T01:16:52Z
dc.date.available	2019-04-09T01:16:52Z
dc.date.issued	2019-04-02
dc.identifier.citation	Çam, İ. (2019). Learning filter scale and orientation in convolutional neural networks. İstanbul: Işık Üniversitesi Fen Bilimleri Enstitüsü.	en_US
dc.identifier.uri	https://hdl.handle.net/11729/1545
dc.description	Text in English ; Abstract: English and Turkish	en_US
dc.description	Includes bibliographical references (leaves 38-40)	en_US
dc.description	xi, 40 leaves	en_US
dc.description.abstract	Convolutional neural networks have many hyper-parameters such as ?lter size, number of ?lters, and pooling size, which require manual tuning. Though deep stacked structures are able to create multi-scale and hierarchical representations, manually ?xed ?lter sizes limit the scale of representations that can be learned in a single convolutional layer. Can we adaptively learn to scale the ?lters on training time? Proposed adaptive ?lter model can learn the scale and orientation parameters of ?lters using backpropagation. Therefore, in a single convolution layer, we can create ?lters of di?erent scale and orientation that can adapt to small or large features and objects. The proposed model uses a relatively large base size (grid) for ?lters. In the grid, a di?erentiable function acts as an envelope for the ?lters. The envelope function guides e?ective ?lter scale and shape/orientation by masking the ?lter weights before the convolution. Therefore, only the weights in the envelope are updated during training. In this work, we employed a multivariate (2D) Gaussian as the envelope function and showed that it can grow, shrink, or rotate by updating its covariance matrix during backpropagation training. We tested the model with its basic settings to show the collaboration of weight matrix with envelope function is possible. A deeper architecture was used to show the performance on deeper and wider networks. We tested the new ?lter model on MNIST, MNIST-cluttered, and CIFAR-10 datasets. Compared the results with the networks that used conventional convolution layers. The results demonstrate that the new model can e?ectively learn and produce ?lters of di?erent scales and orientations in a single layer. Moreover, the experiments show that the adaptive convolution layers perform equally; or better, especially when data includes objects of varying scale and noisy backgrounds.	en_US
dc.description.abstract	Evrişimsel sinir ağlarında ?ltre boyutu, sayısı ve ortaklama boyutu elle seçilmektedir. Derin katmanlı sinir ağları hiyerarşik çok ölçekli temsiller öğrenebilmesine rağmen, sabit ?ltre boyutları farklı ölçekteki öğrenilebilecek ?ltre sayısını sınırlamaktadır. Aynı katmanda farklı ölçeklerde ?ltreleri eğitim aşamasında öğrenebilen bir mimari olabilir mi? Önerilen ?ltre modelimizde ?ltre ölçek ve oryantasyonları geriye yayılım ile öğrenilebilir. Bu şekilde, aynı evrişimsel katmanda farklı ölçek ve oryantasonlarla büyük ve küçük objeleri tanımlayabiliriz. Önerilen model, nispeten büyük ?ltre (ızgara) boyutlarına sahiptir. Türevi olan bir çevreleyici fonksiyon ile ?ltrelerin efektif ölçeklerini ve oryantasyonlarını, evrişim işlemine girmeden, katsayı matrislerini maskeleyebiliriz. Bu sayede, sadece çevreleyici fonksiyon içerisindeki katsayılar eğitilecektir. Bu çalışmamızda, çok değişkenli (2 Boyutlu) Gaussian fonksiyonunu çevreleyici fonksiyon olarak kullandık. Kovaryans matrisinin geriye yayılım yöntemiyle eğitilmesiyle, çevreyeliyici fonksiyonun büyüyüp, küçüldüğünü ve dönebildiğini gösterdik. Çevreliyici fonksiyonun eğitilebildiğini ve katsayılarla işbirliğini, modelin en basit haliyle deneyimledik. Derin katmanlardaki performansını, derin ve geniş mimariler üzerinde çalıştırdık ve performansını izledik. Önerilen modeli, MNIST, MNIST-cluttered ve CIFAR-10 veri kümelerinde çaıştırdık ve geleneksel evrişimsel sinir ağ mimarilerindeki çalışma performanslarıyla karşılaştırdık. Sonuçlar, önerdiğimiz modelin, farklı ölçek ve oryanyasyonlarda, aynı katmanda, ?ltreler öğrenebildiğini gösterdi. Ayrıca, deneylerimiz, adaptifevrişimsel katmanının aynı, özellikle veri kümesinde farklı ölçeklerde obje ve gürültülü arkaplan içeren veri kümelerinde daha iyi çalıştığını gösterdik.	en_US
dc.description.tableofcontents	Machine Learning	en_US
dc.description.tableofcontents	Classification	en_US
dc.description.tableofcontents	Neural Networks	en_US
dc.description.tableofcontents	Back-propagation	en_US
dc.description.tableofcontents	Auto-encoders	en_US
dc.description.tableofcontents	Convolutional Neural Networks	en_US
dc.description.tableofcontents	Convolutional Layer	en_US
dc.description.tableofcontents	Non-linearity Layer	en_US
dc.description.tableofcontents	Pooling Layer	en_US
dc.description.tableofcontents	Related Adaptive Neural Networks	en_US
dc.description.tableofcontents	Method	en_US
dc.description.tableofcontents	Envelope Function	en_US
dc.description.tableofcontents	Selecting an Envelope Function	en_US
dc.description.tableofcontents	An alternative envelope function	en_US
dc.description.tableofcontents	Initialization	en_US
dc.description.tableofcontents	Implementation	en_US
dc.description.tableofcontents	Experiments	en_US
dc.description.tableofcontents	Filter Guide Experiments	en_US
dc.description.tableofcontents	Learning a Gaussian filter	en_US
dc.description.tableofcontents	Learning a Gaussian and edge filter	en_US
dc.description.tableofcontents	Application on deeper and wider networks	en_US
dc.description.tableofcontents	Scale and orientation experiments	en_US
dc.description.tableofcontents	MNIST	en_US
dc.description.tableofcontents	MNIST Cluttered	en_US
dc.description.tableofcontents	CIFAR-10	en_US
dc.language.iso	en	en_US
dc.publisher	Işık Üniversitesi	en_US
dc.rights	info:eu-repo/semantics/openAccess	en_US
dc.rights	Attribution-NonCommercial-NoDerivs 3.0 United States	*
dc.rights.uri	http://creativecommons.org/licenses/by-nc-nd/3.0/us/	*
dc.subject	Adaptive CNN	en_US
dc.subject	Filter learning	en_US
dc.subject	Filter scaling	en_US
dc.subject	Adaptif CNN	en_US
dc.subject	Filtre öğrenmesi	en_US
dc.subject	Ölçeklendirilebilir filtre	en_US
dc.subject.lcc	QA76.9.A3 C36 2019
dc.subject.lcsh	Adaptive computing systems.	en_US
dc.subject.lcsh	Convolutions (Mathematics)	en_US
dc.title	Learning filter scale and orientation in convolutional neural networks	en_US
dc.title.alternative	Evrişimsel sinir ağlarında filtre ölçeği ve oryantasyonunun öğrenilmesi	en_US
dc.type	Master Thesis	en_US
dc.department	Işık Üniversitesi, Fen Bilimleri Enstitüsü, Bilgisayar Mühendisliği Yüksek Lisans Programı	en_US
dc.relation.publicationcategory	Tez	en_US
dc.institutionauthor	Çam, İlker	en_US

Bu öğenin dosyaları:

Ad:: 1545.pdf
Boyut:: 1.818Mb
Biçim:: PDF
Açıklama:: MasterThesis

Göster/Aç

Bu öğe aşağıdaki koleksiyon(lar)da görünmektedir.

FBE - Tez Koleksiyonu | Bilgisayar Mühendisliği / Computer Engineering [73]
Bilgisayar Mühendisliği Yüksek Lisans programına ait tez koleksiyonunu içerir.

Basit öğe kaydını göster

Aksi belirtilmediği sürece bu öğenin lisansı: info:eu-repo/semantics/openAccess