An analysis on dimensionality and architecture on generative models

Gursu, Ali Emre.

An analysis on dimensionality and architecture on generative models

Files

b2861426.039380.001.PDF (2.07 MB)

Date

2023

Authors

Gursu, Ali Emre.

Publisher

Thesis (M.S.) - Bogazici University. Institute for Graduate Studies in Science and Engineering, 2023.

Abstract

Deep generative models are powerful class of machine learning models. However, a significant amount of computing power and technical knowledge is required to conduct the training process. Even searching for hyperparameters requires a high computational cost. Moreover, there is still ongoing research on methods for evaluating generative models, and owing to the lack of a robust and consistent metric, there are limited comparisons between generative model architectures and algorithms. In this study, we attempted to compare two types of generative model architectures, Generative Adversarial Networks (GANs) and Real-valued Non-Volume-Preserving (NVP) flows, with synthetic datasets as well as with a well known image dataset MNIST. We evaluate their data capturing ability according to data dimensionality and variability. We propose an Minimum Description Length (MDL) based metric to examine the effect of model complexity which is measured as model’s parameter count. We provide estimated Kullback-Leibler (KL) divergence and propsed MDL-based metric results. Our findings indicate that NVP models have the capability to encode more data variability while utilizing fewer parameters when contrasted with GANs for lower dimensional datasets. The proposed MDL-based metric, facilitates selecting suitable architecture in terms of model complexity for a given dataset considering its variability and dimensionality. NOTE Keywords : Generative Models, Generative Adversarial Networks, RealNVP, Deep Learning.

URI

https://digitalarchive.library.bogazici.edu.tr/handle/123456789/21509

Collections

M.S. Theses

Full item page

An analysis on dimensionality and architecture on generative models

Files

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

Citation

URI

Collections