Why are generative adversarial networks so fascinating and annoying?

Files

hdl_130210.pdf (8.78 MB)
  (Accepted version)

Date

2020

Authors

Faria, F.A.
Carneiro, G.

Editors

Advisors

Journal Title

Journal ISSN

Volume Title

Type:

Conference paper

Citation

Brazilian Symposium of Computer Graphic and Image Processing, 2020, pp.1-8

Statement of Responsibility

Fabio Augusto Faria, Gustavo Carneiro

Conference Name

Conference on Computer Graphics and Image Processing (SIBGRAPI) (7 Nov 2020 - 10 Nov 2020 : Virtual online)

Abstract

This paper focuses on one of the most fascinating and successful, but challenging generative models in the literature: the Generative Adversarial Networks (GAN). Recently, GAN has attracted much attention by the scientific community and the entertainment industry due to its effectiveness in generating complex and high-dimension data, which makes it a superior model for producing new samples, compared with other types of generative models. The traditional GAN (referred to as the Vanilla GAN) is composed of two neural networks, a generator and a discriminator, which are modeled using a minimax optimization. The generator creates samples to fool the discriminator that in turn tries to distinguish between the original and created samples. This optimization aims to train a model that can generate samples from the training set distribution. In addition to defining and explaining the Vanilla GAN and its main variations (e.g., DCGAN, WGAN, and SAGAN), this paper will present several applications that make GAN an extremely exciting method for the entertainment industry (e.g., style-transfer and image-to-image translation). Finally, the following measures to assess the quality of generated images are presented: Inception Search (IS), and Frechet Inception Distance (FID).

School/Discipline

Dissertation Note

Provenance

Description

Access Status

Rights

©2020 IEEE

License

Grant ID

Call number

Persistent link to this record