Deep Learning Methods for Human Behavior Recognition

Lu, Jia

Deep Learning Methods for Human Behavior Recognition

aut.embargo	No	en_NZ
aut.thirdpc.contains	No	en_NZ
dc.contributor.advisor	Yan, Wei Qi
dc.contributor.advisor	Nguyen, Minh
dc.contributor.author	Lu, Jia
dc.date.accessioned	2021-03-17T20:54:35Z
dc.date.available	2021-03-17T20:54:35Z
dc.date.copyright	2021
dc.date.issued	2021
dc.date.updated	2021-03-17T05:35:36Z
dc.description.abstract	With the decreased costs of security monitoring equipment such as cameras, video surveillance has been broadly applied to our communities and public places. However, at present most of the surveillance systems acquire anomalies and visual evidences only through video playback. Hence, it is necessary to develop the methods of real-time human behavior recognition so as to reduce security staff’s workload and improve work efficiency. The existing work needs feature extraction from the video frames to detect human body and achieve human behavior recognition. In this thesis, our focus is on the state-of-the-art methods for human behavior recognition based on deep learning. Since deep learning methods have been well investigated in the past decades, as an end-to-end computational method, it simplifies feature extraction as the operations in a black box. In this thesis, we explore and exploit the state-of-the-art methods, which are utilized for human behavior recognition. More importantly, in order to attain our goal, spatiotemporal information was collected and employed to the implementation of our research project. We firstly adopted ensemble learning with deep learning methods. We proposed Selective Kernel Network (SKNet) and ResNeXt with attention mechanism, which generate positive results to recognize human behaviours. The contributions of this thesis are: (1) The ResNeXt and SKNet with attention mechanism make the best accuracy of overall human behavior recognition at the rate up to 98.7% based on public datasets; (2) The YOLOv3 + LSTM network to reply on both spatiotemporal information with class score fusion is able to achieve 97.58% accuracy based on our dataset for sign language processing.	en_NZ
dc.identifier.uri	https://hdl.handle.net/10292/14058
dc.language.iso	en	en_NZ
dc.publisher	Auckland University of Technology
dc.rights.accessrights	OpenAccess
dc.subject	Deep learning (DL)	en_NZ
dc.subject	Convolutional neural network (CNN)	en_NZ
dc.subject	Long short-term memory (LSTM)	en_NZ
dc.subject	You Only Look Once (YOLO)	en_NZ
dc.subject	Ensemble learning	en_NZ
dc.subject	Selective kernel network (SKNet)	en_NZ
dc.subject	Attention mechanism	en_NZ
dc.title	Deep Learning Methods for Human Behavior Recognition	en_NZ
dc.type	Thesis	en_NZ
thesis.degree.grantor	Auckland University of Technology
thesis.degree.level	Doctoral Theses
thesis.degree.name	Doctor of Philosophy	en_NZ

Files

Original bundle

Now showing 1 - 1 of 1

Name:: JiaL2.pdf
Size:: 3.5 MB
Format:: Adobe Portable Document Format
Description:: Thesis

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 889 B
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Doctoral Theses