Researchers are showing great interest in Generative Adversarial Networks (GANs), which use deep learning techniques to mimic the content of datasets and are particularly adept at data generation. Despite their impressive performance, there is uncertainty about how GANs precisely map latent space vectors to realistic images and how the chosen dimensionality of the latent space affects the quality of the generated images. In this paper, we explored the potential of generative models in generating animal face images. For this purpose, we used the Deep Convolutional Generative Adversarial Network (DCGAN) model as a reference. To analyze the impact of selected latent space vectors, we synthesized animal face images by training data representations in the DCGAN model with the well-known AFHQ dataset from the literature. We compared the quantitative evaluation of the produced images using Fréchet Inception Distance (FID) and Inception Score (IS). As a result, we demonstrated that generative models can produce images with latent sizes significantly smaller and larger than the standard size of 100.
Researchers have shown great interest in Generative Adversarial Networks (GANs), which utilize deep learning techniques to mimic the content of datasets and particularly excel in data generation. Despite their impressive performance, there remains uncertainty about how GANs precisely map latent space vectors to realistic images and how the chosen dimensionality of the latent space affects the quality of the generated images. In this study, we analyze the potential of learned data representations to generate different animal face images and examine the impact of the selected latent space dimension on the synthesized image quality using a Deep Convolutional GAN (DCGAN). For quantitative evaluation of the generated synthetic images, we employ metrics such as Fr´echet Inception Distance (FID) and Inception Score (IS). In addition to quantitative assessment results, we also utilize qualitative evaluation methods to assess whether overfitting is present and to form intuitive perception about data samples extracted from the model and the ability to disseminate. Finally, we compare and evaluate the results of generative outputs by training the DCGAN on well-known AFHQ Cat, AFHQ Dog, and AFHQ Wild Animals datasets, measuring the impact of latent space dimensions and image feature quality through a comparative analysis.
Birincil Dil | İngilizce |
---|---|
Konular | Görüntü İşleme, Derin Öğrenme |
Bölüm | Makaleler |
Yazarlar | |
Erken Görünüm Tarihi | 29 Mart 2024 |
Yayımlanma Tarihi | 29 Mart 2024 |
Gönderilme Tarihi | 21 Kasım 2023 |
Kabul Tarihi | 19 Mart 2024 |
Yayımlandığı Sayı | Yıl 2024 |