Repository logo
 

Frequency-aware Spatio-temporal Topology Learning for Skeleton-based Human Activity Recognition

aut.relation.articlenumber113146
aut.relation.endpage113146
aut.relation.journalPattern Recognition
aut.relation.startpage113146
aut.relation.volume175
dc.contributor.authorXia, Y
dc.contributor.authorYongchareon, S
dc.contributor.authorLutui, R
dc.contributor.authorSheng, QZ
dc.date.accessioned2026-02-04T22:12:43Z
dc.date.available2026-02-04T22:12:43Z
dc.date.issued2026-07-01
dc.description.abstractSkeleton-based human activity recognition (HAR) has made significant progress through graph convolutional networks (GCNs) and Transformer architectures for spatiotemporal modeling. However, existing methods either employ predefined static graph topologies that cannot adapt to heterogeneous skeleton data or learn dynamic topologies based solely on local spatiotemporal features, thereby overlooking the global temporal frequency features of joint movements that are important for discovering semantically meaningful spatial relationships. We propose Frequency-Aware Topology Learning Graph Convolutional Network (FATL-GCN), a novel architecture that integrates frequency-aware temporal context to guide adaptive learning of spatial topology. Our approach leverages Time-to-Vector linear frequency encoding to capture both periodic and non-periodic motion patterns, employs frequency-guided topology learning to generate action-specific graphs through temporal-context-driven attention, and incorporates hierarchical multi-scale fusion for robust feature extraction across scales. Extensive experiments achieved top-1 accuracies of 93.8% (cross-subject) and 97.5% (cross-view) on NTU-60, 91.9% (cross-subject) and 93.1% (cross-setup) on NTU-120, and 51.7% on Kinetics-Skeleton. Ablation studies confirm the critical role of our components, with removing the dynamic graph topology causing a 3.5% accuracy drop and removing frequency-aware encoding causing a 2.1% drop.
dc.identifier.citationPattern Recognition, ISSN: 0031-3203 (Print), Elsevier BV, 175, 113146-113146. doi: 10.1016/j.patcog.2026.113146
dc.identifier.doi10.1016/j.patcog.2026.113146
dc.identifier.issn0031-3203
dc.identifier.urihttp://hdl.handle.net/10292/20590
dc.languageen
dc.publisherElsevier BV
dc.relation.urihttps://www.sciencedirect.com/science/article/pii/S0031320326001093?via%3Dihub
dc.rights© 2026 The Author(s). Published by Elsevier Ltd. Creative Commons. This is an open access article distributed under the terms of the Creative Commons CC-BY license, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. You are not required to obtain permission to reuse this article.
dc.rights.accessrightsOpenAccess
dc.subject46 Information and Computing Sciences
dc.subject4607 Graphics, Augmented Reality and Games
dc.subjectBioengineering
dc.subjectNetworking and Information Technology R&D (NITRD)
dc.subject0801 Artificial Intelligence and Image Processing
dc.subject0806 Information Systems
dc.subject0906 Electrical and Electronic Engineering
dc.subjectArtificial Intelligence & Image Processing
dc.subject4603 Computer vision and multimedia computation
dc.subject4605 Data management and data science
dc.subject4611 Machine learning
dc.titleFrequency-aware Spatio-temporal Topology Learning for Skeleton-based Human Activity Recognition
dc.typeJournal Article
pubs.elements-id752981

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Frequency-aware spatio-temporal topology learning for skeleton-based human activity recognition.pdf
Size:
4.55 MB
Format:
Adobe Portable Document Format
Description:
Journal article

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.37 KB
Format:
Plain Text
Description: