Repository logo
 

Sliding-Window Dissimilarity Cross-Attention for Near-Real-Time Building Change Detection

aut.relation.endpage135
aut.relation.issue1
aut.relation.journalRemote Sensing
aut.relation.startpage135
aut.relation.volume17
dc.contributor.authorLu, Wen
dc.contributor.authorNguyen, Minh
dc.date.accessioned2025-01-07T21:37:44Z
dc.date.available2025-01-07T21:37:44Z
dc.date.issued2025-01-02
dc.description.abstractA near-real-time change detection network can consistently identify unauthorized construction activities over a wide area, empowering authorities to enforce regulations efficiently. Furthermore, it can promptly assess building damage, enabling expedited rescue efforts. The extensive adoption of deep learning in change detection has prompted a predominant emphasis on enhancing detection performance, primarily through the expansion of the depth and width of networks, overlooking considerations regarding inference time and computational cost. To accurately represent the spatio-temporal semantic correlations between pre-change and post-change images, we create an innovative transformer attention mechanism named Sliding-Window Dissimilarity Cross-Attention (SWDCA), which detects spatio-temporal semantic discrepancies by explicitly modeling the dissimilarity of bi-temporal tokens, departing from the mono-temporal similarity attention typically used in conventional transformers. In order to fulfill the near-real-time requirement, SWDCA employs a sliding-window scheme to limit the range of the cross-attention mechanism within a predetermined window/dilated window size. This approach not only excludes distant and irrelevant information but also reduces computational cost. Furthermore, we develop a lightweight Siamese backbone for extracting building and environmental features. Subsequently, we integrate an SWDCA module into this backbone, forming an efficient change detection network. Quantitative evaluations and visual analyses of thorough experiments verify that our method achieves top-tier accuracy on two building change detection datasets of remote sensing imagery, while also achieving a real-time inference speed of 33.2 FPS on a mobile GPU.
dc.identifier.citationRemote Sensing, ISSN: 2072-4292 (Online), MDPI AG, 17(1), 135-135. doi: 10.3390/rs17010135
dc.identifier.doi10.3390/rs17010135
dc.identifier.issn2072-4292
dc.identifier.urihttp://hdl.handle.net/10292/18482
dc.languageen
dc.publisherMDPI AG
dc.relation.urihttps://www.mdpi.com/2072-4292/17/1/135
dc.rights© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
dc.rights.accessrightsOpenAccess
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/
dc.subject0203 Classical Physics
dc.subject0406 Physical Geography and Environmental Geoscience
dc.subject0909 Geomatic Engineering
dc.subject3701 Atmospheric sciences
dc.subject3709 Physical geography and environmental geoscience
dc.subject4013 Geomatic engineering
dc.titleSliding-Window Dissimilarity Cross-Attention for Near-Real-Time Building Change Detection
dc.typeJournal Article
pubs.elements-id584236

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
remotesensing-17-00135-v2.pdf
Size:
7.33 MB
Format:
Adobe Portable Document Format
Description:
Journal article