Open Access System for Information Sharing

Login Library

 

Thesis
Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Disentangling Image Features using Laplacian Pyramid in Hierarchical Vector Quantized-Variational Autoencoder Engineering Pohang University of Science and Technology

Title
Disentangling Image Features using Laplacian Pyramid in Hierarchical Vector Quantized-Variational Autoencoder Engineering Pohang University of Science and Technology
Authors
김현성
Date Issued
2024
Publisher
포항공과대학교
Abstract
We present a novel approach for high-resolution image generation within the framework of Vector Quantized Variational Autoencoder (VQ-VAE). Building upon the advancements of VQ-VAE-2, which introduces a hierarchical architecture design to address challenges such as the need for larger codebooks and feature disentanglement, we extend this approach by incorporating Laplacian pyramid-based image decomposition. This extension allows for frequency-specific supervision, resulting in a more faithful representation of the underlying data and higher-quality generated images. Our experimental results confirm the superiority of our approach in terms of feature disentanglement and output fidelity, establishing it as a promising avenue for achieving high-quality image generation.
URI
http://postech.dcollection.net/common/orgView/200000736299
https://oasis.postech.ac.kr/handle/2014.oak/123305
Article Type
Thesis
Files in This Item:
There are no files associated with this item.

qr_code

  • mendeley

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Views & Downloads

Browse