SARM: Synthetic Data Annotation for Enhancing the Experiences of Augmented Reality Application Based on Machine Learning

aut.author.twitter@https://www.linkedin.com/in/robertle1991/
aut.embargoNoen_NZ
aut.thirdpc.containsNoen_NZ
dc.contributor.advisorNguyen, Minh
dc.contributor.advisorYan, Wei Qi
dc.contributor.authorLe, Huy Khac Anh
dc.date.accessioned2022-06-02T01:27:24Z
dc.date.available2022-06-02T01:27:24Z
dc.date.copyright2022
dc.date.issued2022
dc.date.updated2022-06-01T08:20:38Z
dc.description.abstractBackground and Objective: Augmented Reality is one of the fastest-growing fields, increasing funding for the last few years, as people realise the potential benefits of rendering virtual information in the real world. As the equipment gets more commercialised, the cost would get lowered while the performance also goes up. However, most of today’s Augmented Reality marker-based applications would use local features detection and tracking techniques. The disadvantages of applying these techniques are that the markers must be modified to match the unique classified algorithms or suffer from lower detection accuracy. Machine learning is a perfect solution to overcome the current drawbacks of image processing in Augmented Reality applications. Methods: This thesis is split into two investigation directions. The first investigation is to implement new Augmented Reality markers with concealed information such as bar-code or quick response code while keeping most of the visual information of the original texture. The second investigation demonstrates the Augmented Reality marker without using any embedded codes and original texture modification required by immersing the machine learning technology into the marker detection process. The new approach incorporated Machine Learning using deep neural networks to detect and track the Augmented Reality application’s marker targets. The research implemented the auto-generated dataset tool, which uses for the Machine Learning dataset preparation step. The final iOS prototype application was developed to incorporate object detection, object tracking and Augmented Reality. The Machine Learning model was taught to recognise the differences between targets using YOLO’s most famous object detection methods. The model was trained by either Pytorch, and the final product uses a valuable toolkit for developing the Augmented Reality application called ARKit. Results: Several different experimental exercises have been conducted to qualify the proposed methods on technical performances. The experimental outcomes indicated that the object detection model could achieve over 80% precision, over 90% recall, and over 70% mean average precision using proposed synthetic datasets. The proposed method significantly improves object detection accuracy where it could achieve at least 18% higher than the real-world dataset. The iOS prototype can detect the target markers and display the augmented objects under different lighting conditions at an average rate of 50 frames per second.en_NZ
dc.identifier.urihttps://hdl.handle.net/10292/15183
dc.language.isoenen_NZ
dc.publisherAuckland University of Technology
dc.rights.accessrightsOpenAccess
dc.titleSARM: Synthetic Data Annotation for Enhancing the Experiences of Augmented Reality Application Based on Machine Learningen_NZ
dc.typeThesisen_NZ
thesis.degree.grantorAuckland University of Technology
thesis.degree.levelDoctoral Theses
thesis.degree.nameDoctor of Philosophyen_NZ
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
LeH.pdf
Size:
46.1 MB
Format:
Adobe Portable Document Format
Description:
Thesis
License bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
889 B
Format:
Item-specific license agreed upon to submission
Description:
Collections