Dual stage semantic information based generative adversarial network for image super-resolution

ORCID

Vivek Singh: 0000-0003-1728-1198

Abstract

Deep learning has revolutionized image super-resolution, yet challenges persist in preserving intricate details and avoiding overly smooth reconstructions. In this work, we introduce a novel architecture, the Residue and Semantic Feature-based Dual Subpixel Generative Adversarial Network (RSF-DSGAN), which emphasizes the critical role of semantic information in addressing these issues. The proposed generator architecture is designed with two sequential stages: the Premier Residual Stage and the Deuxième Residual Stage. These stages are concatenated to form a dual-stage upsampling process, substantially augmenting the model’s capacity for feature learning. A central innovation of our approach is the integration of semantic information directly into the generator. Specifically, feature maps derived from a pre-trained network are fused with the primary feature maps of the first stage, enriching the generator with high-level contextual cues. This semantic infusion enhances the fidelity and sharpness of reconstructed images, particularly in preserving object details and textures. Inter- and intra-residual connections are employed within these stages to maintain high-frequency details and fine textures. Additionally, spectral normalization is introduced in the discriminator to stabilize training. Comprehensive evaluations, including visual perception and mean opinion scores, demonstrate that RSF-DSGAN, with its emphasis on semantic information, outperforms current state-of-the-art super-resolution methods.

DOI Link

10.1016/j.cviu.2024.104226

Publication Date

2024-11-11

Publication Title

Computer Vision and Image Understanding

Volume

250

ISSN

1077-3142

Acceptance Date

2024-11-06

Deposit Date

2026-06-11

Embargo Period

2025-11-11

Additional Links

https://www.sciencedirect.com/science/article/pii/S1077314224003072, https://www.scopus.com/pages/publications/85208555778

Keywords

Super-resolution, Convolutional Neural Networks, Generative Adversarial Networks, Residual learning, Spectral normalization

Recommended Citation

Sharma, S., Dhall, A., Johri, S., Kumar, V., & Singh, V. (2024) 'Dual stage semantic information based generative adversarial network for image super-resolution', Computer Vision and Image Understanding, 250. Available at: 10.1016/j.cviu.2024.104226

School of Engineering, Computing and Mathematics

Dual stage semantic information based generative adversarial network for image super-resolution

ORCID

Abstract

DOI Link

Publication Date

Publication Title

Volume

ISSN

Acceptance Date

Deposit Date

Embargo Period

Additional Links

Keywords

Recommended Citation

Search

Browse

About

Links

School of Engineering, Computing and Mathematics

Dual stage semantic information based generative adversarial network for image super-resolution

Authors

ORCID

Abstract

DOI Link

Publication Date

Publication Title

Volume

ISSN

Acceptance Date

Deposit Date

Embargo Period

Additional Links

Keywords

Recommended Citation

Share

Search

Browse

About

Links