Multivariate variational autoencoder

Yavuz, Mehmet Can

Multivariate variational autoencoder

dc.authorid	0000-0003-1677-9496
dc.contributor.author	Yavuz, Mehmet Can	en_US
dc.date.accessioned	2026-05-05T13:15:04Z
dc.date.available	2026-05-05T13:15:04Z
dc.date.issued	2025-11-08
dc.department	Işık Üniversitesi, Mühendislik ve Doğa Bilimleri Fakültesi, Bilgisayar Mühendisliği Bölümü	en_US
dc.department	Işık University, Faculty of Engineering and Natural Sciences, Department of Computer Engineering	en_US
dc.description.abstract	Learning latent representations that are simultaneously expressive, geometrically well-structured, and reliably calibrated remains a central challenge for Variational Autoencoders (VAEs). Standard VAEs typically assume a diagonal Gaussian posterior, which simplifies optimization but rules out correlated uncertainty and often yields entangled or redundant latent dimensions. We introduce the Multivariate Variational Autoencoder (MVAE), a tractable full-covariance extension of the VAE that augments the encoder with sample-specific diagonal scales and a global coupling matrix. This induces a multivariate Gaussian posterior of the form N (µϕ(x), C diag(σ2ϕ(x))C⊤), enabling correlated latent factors while preserving a closedform KL divergence and a simple reparameterization path. Beyond likelihood, we propose a multi-criterion evaluation protocol that jointly assesses reconstruction quality (MSE, ELBO), downstream discrimination (linear probes), probabilistic calibration (NLL, Brier, ECE), and unsupervised structure (NMI, ARI). Across Larochelle-style MNIST variants, Fashion-MNIST, and CIFAR-10/100, MVAE consistently matches or outperforms diagonal-covariance VAEs of comparable capacity, with particularly notable gains in calibration and clustering metrics at both low and high latent dimensions. Qualitative analyses further show smoother, more semantically coherent latent traversals and sharper reconstructions. All code, dataset splits, and evaluation utilities are released to facilitate reproducible comparison and future extensions of multivariate posterior models.	en_US
dc.identifier.citation	Yavuz, M. C. (2025). Multivariate variational autoencoder. Arxiv, 1-10. doi: https://doi.org/10.48550/arXiv.2511.07472	en_US
dc.identifier.endpage	10
dc.identifier.startpage	1
dc.identifier.uri	https://hdl.handle.net/11729/7377
dc.identifier.uri	https://doi.org/10.48550/arXiv.2511.07472
dc.identifier.wos	PPRN:161694573
dc.identifier.wosquality	N/A
dc.indekslendigikaynak	Web of Science	en_US
dc.indekslendigikaynak	Preprint Citation Index	en_US
dc.institutionauthor	Yavuz, Mehmet Can	en_US
dc.institutionauthorid	0000-0003-1677-9496
dc.language.iso	en	en_US
dc.publisher	Cornell Univ	en_US
dc.relation.ispartof	Arxiv	en_US
dc.relation.publicationcategory	Ön Baskı – Uluslararası – Kurum Öğretim Elemanı	en_US
dc.rights	info:eu-repo/semantics/openAccess	en_US
dc.subject	Multivariate	en_US
dc.subject	Variational autoencoder	en_US
dc.subject	FullCovariance posterior	en_US
dc.subject	Latent correlation modeling	en_US
dc.subject	Representation learning	en_US
dc.title	Multivariate variational autoencoder	en_US
dc.type	Preprint	en_US
dspace.entity.type	Publication	en_US

Dosyalar

Orijinal paket

Listeleniyor 1 - 1 / 1

İsim:: Multivariate_Variational_Autoencoder.pdf
Boyut:: 7.95 MB
Biçim:: Adobe Portable Document Format

İndir

Lisans paketi

Listeleniyor 1 - 1 / 1

İsim:: license.txt
Boyut:: 1.17 KB
Biçim:: Item-specific license agreed upon to submission
Açıklama:

İndir

Koleksiyon

Makale Koleksiyonu | Bilgisayar Mühendisliği Bölümü
WoS İndeksli Yayınlar Koleksiyonu