Repository logo
 

Depth2Elevation: Scale Modulation With Depth Anything Model for Single-View Remote Sensing Image Height Estimation

aut.relation.endpage1
aut.relation.issue99
aut.relation.journalIEEE Transactions on Geoscience and Remote Sensing
aut.relation.startpage1
aut.relation.volumePP
dc.contributor.authorHong, Z
dc.contributor.authorWu, T
dc.contributor.authorXu, Z
dc.contributor.authorZhao, W
dc.date.accessioned2025-05-16T00:04:27Z
dc.date.available2025-05-16T00:04:27Z
dc.date.issued2025-04-28
dc.description.abstractAccurate terrain elevation estimation from remote sensing data is essential for a multitude of geographic applications. Specifically, image-based elevation estimation has garnered growing attention due to advancements in optical sensor development and automated analysis algorithms, such as machine learning. In this context, deep learning methods, including Convolutional Neural Networks (CNNs) and Vision Transformers (ViTs), have recently enhanced the feature extraction ability and estimation accuracy of this task. Despite the distinct advantages afforded by each architectural paradigm, current methods are frequently impeded in their ability to discern subtle height variations within complex scenes and are ill-equipped to effectively tackle the extraction of features across both large and small scales. Although vision foundation models have shown significant advances in remote sensing analysis, their effectiveness for height estimation remains unexplored. In this study, we introduce the foundation model in the field of elevation estimation and propose a novel Depth to Elevation (Depth2Elevation) model, marking the first application of the Depth Anything Model (DAM) to height estimation in remote sensing images. First, we introduce the scale modulator for modulating partial encoders in the original DAM, which enables DAM to capture subtle representations of localized objects at different scales. Secondly, we further enhance the model’s representational capability by using a resolution-agnostic decoder architecture, which enables DAM to learn features at different spatial scales efficiently. We conducted comprehensive experiments on several benchmark datasets. Compared to strong baselines, our method achieves an average relative improvement of at most 42% on the latest large-scale benchmark dataset GAMUS and shows the best generalization ability across different scenarios.
dc.identifier.citationIEEE Transactions on Geoscience and Remote Sensing, ISSN: 0196-2892 (Print); 1558-0644 (Online), Institute of Electrical and Electronics Engineers (IEEE), PP(99), 1-1. doi: 10.1109/TGRS.2025.3564820
dc.identifier.doi10.1109/TGRS.2025.3564820
dc.identifier.issn0196-2892
dc.identifier.issn1558-0644
dc.identifier.urihttp://hdl.handle.net/10292/19207
dc.publisherInstitute of Electrical and Electronics Engineers (IEEE)
dc.relation.urihttps://ieeexplore.ieee.org/document/10978076
dc.rightsThis article has been accepted for publication in IEEE Transactions on Geoscience and Remote Sensing. This is the author's version which has not been fully edited and content may change prior to final publication. © 2025 IEEE. All rights reserved, including rights for text and data mining, and training of artificial intelligence and similar technologies. Personal use is permitted, but republication/redistribution requires IEEE permission. See https://www.ieee.org/publications/rights/index.html for more information
dc.rights.accessrightsOpenAccess
dc.subject37 Earth Sciences
dc.subject40 Engineering
dc.subjectNetworking and Information Technology R&D (NITRD)
dc.subjectMachine Learning and Artificial Intelligence
dc.subjectBioengineering
dc.subjectGeneric health relevance
dc.subject0404 Geophysics
dc.subject0906 Electrical and Electronic Engineering
dc.subject0909 Geomatic Engineering
dc.subjectGeological & Geomatics Engineering
dc.subject37 Earth sciences
dc.subject40 Engineering
dc.titleDepth2Elevation: Scale Modulation With Depth Anything Model for Single-View Remote Sensing Image Height Estimation
dc.typeJournal Article
pubs.elements-id604745

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Depth2Elevation_Scale_Modulation_with_Depth_Anything_Model_for_Single-view_Remote_Sensing_Image_Height_Estimation.pdf
Size:
15 MB
Format:
Adobe Portable Document Format
Description:
Author's Accepted Manuscript