Multivariate variational autoencoder

dc.authorid0000-0003-1677-9496
dc.contributor.authorYavuz, Mehmet Canen_US
dc.date.accessioned2026-05-05T13:15:04Z
dc.date.available2026-05-05T13:15:04Z
dc.date.issued2025-11-08
dc.departmentIşık Üniversitesi, Mühendislik ve Doğa Bilimleri Fakültesi, Bilgisayar Mühendisliği Bölümüen_US
dc.departmentIşık University, Faculty of Engineering and Natural Sciences, Department of Computer Engineeringen_US
dc.description.abstractLearning latent representations that are simultaneously expressive, geometrically well-structured, and reliably calibrated remains a central challenge for Variational Autoencoders (VAEs). Standard VAEs typically assume a diagonal Gaussian posterior, which simplifies optimization but rules out correlated uncertainty and often yields entangled or redundant latent dimensions. We introduce the Multivariate Variational Autoencoder (MVAE), a tractable full-covariance extension of the VAE that augments the encoder with sample-specific diagonal scales and a global coupling matrix. This induces a multivariate Gaussian posterior of the form N (µϕ(x), C diag(σ2ϕ(x))C⊤), enabling correlated latent factors while preserving a closedform KL divergence and a simple reparameterization path. Beyond likelihood, we propose a multi-criterion evaluation protocol that jointly assesses reconstruction quality (MSE, ELBO), downstream discrimination (linear probes), probabilistic calibration (NLL, Brier, ECE), and unsupervised structure (NMI, ARI). Across Larochelle-style MNIST variants, Fashion-MNIST, and CIFAR-10/100, MVAE consistently matches or outperforms diagonal-covariance VAEs of comparable capacity, with particularly notable gains in calibration and clustering metrics at both low and high latent dimensions. Qualitative analyses further show smoother, more semantically coherent latent traversals and sharper reconstructions. All code, dataset splits, and evaluation utilities are released to facilitate reproducible comparison and future extensions of multivariate posterior models.en_US
dc.identifier.citationYavuz, M. C. (2025). Multivariate variational autoencoder. Arxiv, 1-10. doi: https://doi.org/10.48550/arXiv.2511.07472en_US
dc.identifier.endpage10
dc.identifier.startpage1
dc.identifier.urihttps://hdl.handle.net/11729/7377
dc.identifier.urihttps://doi.org/10.48550/arXiv.2511.07472
dc.identifier.wosPPRN:161694573
dc.identifier.wosqualityN/A
dc.indekslendigikaynakWeb of Scienceen_US
dc.indekslendigikaynakPreprint Citation Indexen_US
dc.institutionauthorYavuz, Mehmet Canen_US
dc.institutionauthorid0000-0003-1677-9496
dc.language.isoenen_US
dc.publisherCornell Univen_US
dc.relation.ispartofArxiven_US
dc.relation.publicationcategoryÖn Baskı – Uluslararası – Kurum Öğretim Elemanıen_US
dc.rightsinfo:eu-repo/semantics/openAccessen_US
dc.subjectMultivariateen_US
dc.subjectVariational autoencoderen_US
dc.subjectFullCovariance posterioren_US
dc.subjectLatent correlation modelingen_US
dc.subjectRepresentation learningen_US
dc.titleMultivariate variational autoencoderen_US
dc.typePreprinten_US
dspace.entity.typePublicationen_US

Dosyalar

Orijinal paket
Listeleniyor 1 - 1 / 1
Yükleniyor...
Küçük Resim
İsim:
Multivariate_Variational_Autoencoder.pdf
Boyut:
7.95 MB
Biçim:
Adobe Portable Document Format
Lisans paketi
Listeleniyor 1 - 1 / 1
Küçük Resim Yok
İsim:
license.txt
Boyut:
1.17 KB
Biçim:
Item-specific license agreed upon to submission
Açıklama: