Disentangling Image Features using Laplacian Pyramid in Hierarchical Vector Quantized-Variational Autoencoder Engineering Pohang University of Science and Technology
- Title
- Disentangling Image Features using Laplacian Pyramid in Hierarchical Vector Quantized-Variational Autoencoder Engineering Pohang University of Science and Technology
- Authors
- 김현성
- Date Issued
- 2024
- Publisher
- 포항공과대학교
- Abstract
- We present a novel approach for high-resolution image generation within the framework of Vector Quantized Variational Autoencoder (VQ-VAE). Building upon the advancements of VQ-VAE-2, which introduces a hierarchical architecture design to address challenges such as the need for larger codebooks and feature disentanglement, we extend this approach by incorporating Laplacian pyramid-based image decomposition. This extension allows for frequency-specific supervision, resulting in a more faithful representation of the underlying data and higher-quality generated images. Our experimental results confirm the superiority of our approach in terms of feature disentanglement and output fidelity, establishing it as a promising avenue for achieving high-quality image generation.
- URI
- http://postech.dcollection.net/common/orgView/200000736299
https://oasis.postech.ac.kr/handle/2014.oak/123305
- Article Type
- Thesis
- Files in This Item:
- There are no files associated with this item.
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.