Deep Learning Methods for Human Behavior Recognition

aut.embargoNoen_NZ
aut.thirdpc.containsNoen_NZ
dc.contributor.advisorYan, Wei Qi
dc.contributor.advisorNguyen, Minh
dc.contributor.authorLu, Jia
dc.date.accessioned2021-03-17T20:54:35Z
dc.date.available2021-03-17T20:54:35Z
dc.date.copyright2021
dc.date.issued2021
dc.date.updated2021-03-17T05:35:36Z
dc.description.abstractWith the decreased costs of security monitoring equipment such as cameras, video surveillance has been broadly applied to our communities and public places. However, at present most of the surveillance systems acquire anomalies and visual evidences only through video playback. Hence, it is necessary to develop the methods of real-time human behavior recognition so as to reduce security staff’s workload and improve work efficiency. The existing work needs feature extraction from the video frames to detect human body and achieve human behavior recognition. In this thesis, our focus is on the state-of-the-art methods for human behavior recognition based on deep learning. Since deep learning methods have been well investigated in the past decades, as an end-to-end computational method, it simplifies feature extraction as the operations in a black box. In this thesis, we explore and exploit the state-of-the-art methods, which are utilized for human behavior recognition. More importantly, in order to attain our goal, spatiotemporal information was collected and employed to the implementation of our research project. We firstly adopted ensemble learning with deep learning methods. We proposed Selective Kernel Network (SKNet) and ResNeXt with attention mechanism, which generate positive results to recognize human behaviours. The contributions of this thesis are: (1) The ResNeXt and SKNet with attention mechanism make the best accuracy of overall human behavior recognition at the rate up to 98.7% based on public datasets; (2) The YOLOv3 + LSTM network to reply on both spatiotemporal information with class score fusion is able to achieve 97.58% accuracy based on our dataset for sign language processing.en_NZ
dc.identifier.urihttps://hdl.handle.net/10292/14058
dc.language.isoenen_NZ
dc.publisherAuckland University of Technology
dc.rights.accessrightsOpenAccess
dc.subjectDeep learning (DL)en_NZ
dc.subjectConvolutional neural network (CNN)en_NZ
dc.subjectLong short-term memory (LSTM)en_NZ
dc.subjectYou Only Look Once (YOLO)en_NZ
dc.subjectEnsemble learningen_NZ
dc.subjectSelective kernel network (SKNet)en_NZ
dc.subjectAttention mechanismen_NZ
dc.titleDeep Learning Methods for Human Behavior Recognitionen_NZ
dc.typeThesisen_NZ
thesis.degree.grantorAuckland University of Technology
thesis.degree.levelDoctoral Theses
thesis.degree.nameDoctor of Philosophyen_NZ
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
JiaL2.pdf
Size:
3.5 MB
Format:
Adobe Portable Document Format
Description:
Thesis
License bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
889 B
Format:
Item-specific license agreed upon to submission
Description:
Collections