Repository logo
 

HFS-HNeRV: High-Frequency Spectrum Hybrid Neural Representation for Videos

aut.relation.conferenceMMAsia '24: ACM Multimedia Asia
aut.relation.endpage7
aut.relation.startpage1
dc.contributor.authorZhao, Jianhua
dc.contributor.authorLi, Xue Jun
dc.contributor.authorChong, Peter Han Joo
dc.contributor.editorWang, R
dc.contributor.editorWang, Z
dc.contributor.editorLiu, J
dc.contributor.editordel Bimbo, A
dc.contributor.editorZhou, J
dc.contributor.editorBasu, A
dc.contributor.editorXu, M
dc.date.accessioned2025-02-09T20:34:49Z
dc.date.available2025-02-09T20:34:49Z
dc.date.issued2024-12-28
dc.description.abstractImplicit neural representations have recently demonstrated considerable potential in various applications, including video compression and reconstruction, owing to their rapid decoding speed and high adaptability. Based on the most advanced neural representation for Videos (NeRV), Expedite Neural Representation for Videos (E-NeRV) and Hybrid Neural Representation for Videos (H-NeRV) primarily boost performance by enhancing and broadening the NeRV network’s embedded input, whereas the NeRV module—the central component involved in video reconstruction—has attracted less attention. With a focus on high-frequency data in the frequency domain, this paper proposes a High-frequency Spectrum Hybrid Network (HFS-HNeRV), which adopts effective high-frequency data from the frequency domain to generate image details. Its core, HFS-HNeRV block, is a novel NeRV module, which adds the high-frequency spectrum convolution module (HFSCM) to the original one. This module extracts and emphasizes high-frequency features through the frequency domain attention mechanism, which not only provides superior performance, but also enhances the local detail recovery in video images. As an upgrade of the NeRV module, it has exceptional performance in terms of adaptability and versatility. It can conveniently substitute in a variety of current NeRV designs without requiring significant alterations to attain enhanced performance. Furthermore, this paper also introduces the High-frequency Spectrum (HFS) loss function to further mitigate the blurriness issue caused by the loss of high-frequency information during image generation. In the video compression task, the proposed HFS-HNeRV network outperformed NeRV, E-NeRV and HNeRV with an improvement of +5.68 dB, +4.46 dB, and +0.98 dB in reconstruction quality (PSNR), respectively.
dc.identifier.citationJianhua Zhao, Xue Jun Li, and Peter Han Joo Chong. 2024. HFS-HNeRV: High-Frequency Spectrum Hybrid Neural Representation for Videos. In Proceedings of the 6th ACM International Conference on Multimedia in Asia (MMAsia '24). Association for Computing Machinery, New York, NY, USA, Article 88, 1–7. https://doi.org/10.1145/3696409.3700250
dc.identifier.doi10.1145/3696409.3700250
dc.identifier.isbn9798400712739
dc.identifier.urihttp://hdl.handle.net/10292/18618
dc.publisherACM
dc.relation.urihttps://dl.acm.org/doi/10.1145/3696409.3700250
dc.rightsCopyright © 2024 Copyright held by the owner/author(s). This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike International 4.0 License.
dc.rights.accessrightsOpenAccess
dc.rights.urihttps://creativecommons.org/licenses/by-nc-sa/4.0/
dc.subject46 Information and Computing Sciences
dc.subject40 Engineering
dc.subject4008 Electrical Engineering
dc.subject4603 Computer Vision and Multimedia Computation
dc.titleHFS-HNeRV: High-Frequency Spectrum Hybrid Neural Representation for Videos
dc.typeConference Contribution
pubs.elements-id584249

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
HFS-HNeRV High-frequency spectrum hybrid neural representation for videos.pdf
Size:
13.78 MB
Format:
Adobe Portable Document Format
Description:
Conference contribution