Repository logo
 

Video Compression Using Hybrid Neural Representation with High-Frequency Spectrum Analysis

aut.relation.endpage2574
aut.relation.issue13
aut.relation.journalElectronics
aut.relation.startpage2574
aut.relation.volume14
dc.contributor.authorZhao, Jian Hua
dc.contributor.authorLi, Xue Jun
dc.contributor.authorChong, Peter Han Joo
dc.date.accessioned2025-06-30T20:35:58Z
dc.date.available2025-06-30T20:35:58Z
dc.date.issued2025-06-26
dc.description.abstractRecent advancements in implicit neural representations have shown substantial promise in various domains, particularly in video compression and reconstruction, due to their rapid decoding speed and high adaptability. Building upon the state-of-the-art Neural Representations for Videos, the Expedite Neural Representation for Videos and Hybrid Neural Representation for Videos primarily enhance performance by optimizing and expanding the embedded input of the Neural Representations for Videos network. However, the core module in Neural Representations for Videos network, responsible for video reconstruction, has garnered comparatively less attention. This paper introduces a novel High-frequency Spectrum Hybrid Network, which leverages high-frequency information from the frequency domain to generate detailed image reconstructions. The central component of this approach is the High-frequency Spectrum Hybrid Network block, an innovative extension of the module in Neural Representations for Videos network, which integrates the High-frequency Spectrum Convolution Module into the original framework. The high-frequency spectrum convolution module emphasizes the extraction of high-frequency features through a frequency domain attention mechanism, significantly enhancing both performance and the recovery of local details in video images. As an enhanced module in the Neural Representations for Videos network, it demonstrates exceptional adaptability and versatility, enabling seamless integration into a wide range of existing Neural Representations for Videos network architectures without requiring substantial modifications to achieve improved results. In addition, this work introduces the High-frequency Spectrum loss function and the Multi-scale Feature Reuse Path to further mitigate the issue of blurriness caused by the loss of high-frequency details during image generation. Experimental evaluations confirm that the proposed High-frequency Spectrum Hybrid Network surpasses the performance of the Neural Representations for Videos, the Expedite Neural Representation for Videos, and the Hybrid Neural Representation for Videos, achieving improvements of +5.75 dB, +4.53 dB, and +1.05 dB in peak signal-to-noise ratio, respectively.
dc.identifier.citationElectronics, ISSN: 2079-9292 (Online), MDPI AG, 14(13), 2574-2574. doi: 10.3390/electronics14132574
dc.identifier.doi10.3390/electronics14132574
dc.identifier.issn2079-9292
dc.identifier.urihttp://hdl.handle.net/10292/19416
dc.languageen
dc.publisherMDPI AG
dc.relation.urihttps://www.mdpi.com/2079-9292/14/13/2574
dc.rights© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
dc.rights.accessrightsOpenAccess
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/
dc.subject0906 Electrical and Electronic Engineering
dc.subject4009 Electronics, sensors and digital hardware
dc.titleVideo Compression Using Hybrid Neural Representation with High-Frequency Spectrum Analysis
dc.typeJournal Article
pubs.elements-id613471

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
electronics-14-02574-published-version.pdf
Size:
16.06 MB
Format:
Adobe Portable Document Format
Description:
Journal article