Open Research

Permanent link for this community

https://hdl.handle.net/10292/549

About

Tuwhera Open Access Research Outputs provides free access to full texts of scholarly works from AUT's Schools, Research Institutes and Centres.

AUT's research is built on a foundation of innovation and excellence, with the aim that its discoveries and applications are shared in ways that enhance wellbeing and prosperity.

Adding your outputs

AUT staff research outputs are added to this collection via Research Elements. All items submitted to this collection are checked to ensure material does not breach publisher copyright and is suitable for archiving prior to being made open access.

Find out more about making your work open access

For help with Research Elements contact the Research and Innovation Office.

Browse

Now showing 1 - 20 of 32

A Metasurface-Based LTC Polarization Converter With S-shaped Split Ring Resonator Structure for Flexible Applications
(MDPI AG, 2023-07-10) Li, Erfeng; Li, Xue Jun; Seet, Boon-Chong; Ghaffar, Adnan; Aneja, Aayush
This paper presents a metasurface-based linear-to-circular polarization converter with a flexible structure for conformal and wearable applications. The converter consists of nested S- and C-shaped split ring resonators in the unit cell and can convert linearly polarized incident waves into left-handed circularly polarized ones at 12.4 GHz. Simulation results show that the proposed design has a high polarization conversion rate and efficiency at the operating frequency. Conformal tests are also conducted to evaluate the performance under curvature circumstances. A minor shift in the operating frequency is observed when the converter is applied on a sinusoidal wavy surface.
A Privacy-Preserving Word Embedding Text Classification Model Based on Privacy Boundary Constructed by Deep Belief Network
(Springer Science and Business Media LLC, 2023-09-15) Ma, Bo; Lai, Edmund; Yan, Wei Qi; Wu, Jinsong
To effectively extract and classify the information from reports or documents and protect the privacy of the extracted results, we propose a privacy classification named Word Embedding Combination Privacy-preserving Support Vector Machine (WECPPSVM) model to classify the text. In addition, this paper also proposes the Privacy-preserving Distribution and Independent Frequent Subsequence Extraction Algorithm (PPDIFSEA), which calculates the degree of independence of the training data input to the classification model by training the Deep Belief Network(DBN) in PPDIFSEA, then obtains the Privacy Boundary(PB). PB is an indispensable condition for both data sampling and privacy noise generation. And this model can protect privacy by injecting the privacy noise into the classification result, this method can interfere with the background knowledge-based privacy attack. Our quantitative analysis shows that the WECPPSVM proposed in this paper can approach mainstream text classification algorithms in terms of text classification accuracy while preserving privacy without increasing computational complexity. In addition, the fusion study and privacy threat evaluation also verify that the proposed PPDIFSEA method combined with WECPPSVM achieves an acceptable level of classification accuracy and privacy protection.
A Unified Efficient Deep Learning Architecture for Rapid Safety Objects Classification Using Normalized Quantization-Aware Learning
(MDPI, 2023-11-05) Okeke, Stephen; Nguyen, Minh
The efficient recognition and classification of personal protective equipment are essential for ensuring the safety of personnel in complex industrial settings. Using the existing methods, manually performing macro-level classification and identification of personnel in intricate spheres is tedious, time-consuming, and inefficient. The availability of several artificial intelligence models in recent times presents a new paradigm shift in object classification and tracking in complex settings. In this study, several compact and efficient deep learning model architectures are explored, and a new efficient model is constructed by fusing the learning capabilities of the individual, efficient models for better object feature learning and optimal inferencing. The proposed model ensures rapid identification of personnel in complex working environments for appropriate safety measures. The new model construct follows the contributory learning theory whereby each fussed model brings its learned features that are then combined to obtain a more accurate and rapid model using normalized quantization-aware learning. The major contribution of the work is the introduction of a normalized quantization-aware learning strategy to fuse the features learned by each of the contributing models. During the investigation, a separable convolutional driven model was constructed as a base model, and then the various efficient architectures were combined for the rapid identification and classification of the various hardhat classes used in complex industrial settings. A remarkable rapid classification and accuracy were recorded with the new resultant model.
An Adaptive Traffic-flow Management System with a Cooperative Transitional Maneuver for Vehicular Platoons
(MDPI AG, ) Hota, Lopamudra; Nayak, Biraja Prasad; Sahoo, Bibhudatta; Chong, Peter HJ; Kumar, Arun
Globally, the increases in vehicle numbers, traffic congestion, and road accidents are serious issues. Autonomous vehicles (AVs) traveling in platoons provide innovative solutions for efficient traffic flow management, especially for congestion mitigation, thus reducing accidents. In recent years, platoon-based driving, also known as vehicle platoon, has emerged as an extensive research area. Vehicle platooning reduces travel time and increases road capacity by reducing the safety distance between vehicles. For connected and automated vehicles, cooperative adaptive cruise control (CACC) systems and platoon management systems play a significant role. Platoon vehicles can maintain a closer safety distance due to CACC systems, which are based on vehicle status data obtained through vehicular communications. This paper proposes an adaptive traffic flow and collision avoidance approach for vehicular platoons based on CACC. The proposed approach considers the creation and evolution of platoons to govern the traffic flow during congestion and avoid collisions in uncertain situations. Different obstructing scenarios are identified during travel, and solutions to these challenging situations are proposed. The merge and join maneuvers are performed to help the platoon’s steady movement. The simulation results show a significant improvement in traffic flow due to the mitigation of congestion using platooning, minimizing travel time, and avoiding collisions.
Apple Ripeness Identification from Digital Images Using Transformers
(Springer Science and Business Media LLC, 2023-06-10) Xiao, Bingjie; Nguyen, Minh; Yan, Wei Qi
We describe a non-destructive test of apple ripeness using digital images of multiple types of apples. In this paper, fruit images are treated as data samples, artificial intelligence models are employed to implement the classification of fruits and the identification of maturity levels. In order to obtain the ripeness classifications of fruits, we make use of deep learning models to conduct our experiments; we evaluate the test results of our proposed models. In order to ensure the accuracy of our experimental results, we created our own dataset, and obtained the best accuracy of fruit classification by comparing Transformer model and YOLO model in deep learning, thereby attaining the best accuracy of fruit maturity recognition. At the same time, we also combined YOLO model with attention module and gave the fast object detection by using the improved YOLO model.
Assessing Gait & Balance in Adults with Mild Balance Impairment: G&B App Reliability and Validity
(MDPI, 2023-12-09) Shafi, Hina; Awan, Waqar Ahmed; Olsen, Sharon; Siddiqi, Furqan Ahmed; Tassadaq, Naureen; Rashid, Usman; Niazi, Imran Khan
Smartphone applications (apps) that utilize embedded inertial sensors have the potential to provide valid and reliable estimations of different balance and gait parameters in older adults with mild balance impairment. This study aimed to assess the reliability, validity, and sensitivity of the Gait&Balance smartphone application (G&B App) for measuring gait and balance in a sample of middle- to older-aged adults with mild balance impairment in Pakistan. Community-dwelling adults over 50 years of age (N = 83, 50 female, range 50–75 years) with a Berg Balance Scale (BBS) score between 46/56 and 54/56 were included in the study. Data collection involved securing a smartphone to the participant’s lumbosacral spine. Participants performed six standardized balance tasks, including four quiet stance tasks and two gait tasks (walking looking straight ahead and walking with head turns). The G&B App collected accelerometry data during these tasks, and the tasks were repeated twice to assess test-retest reliability. The tasks in quiet stance were also recorded with a force plate, a gold-standard technology for measuring postural sway. Additionally, participants completed three clinical measures, the BBS, the Functional Reach Test (FRT), and the Timed Up and Go Test (TUG). Test-retest reliability within the same session was determined using intraclass correlation coefficients (ICCs) and the standard error of measurement (SEM). Validity was evaluated by correlating the G&B App outcomes against both the force plate data and the clinical measures using Pearson’s product-moment correlation coefficients. To assess the G&B App’s sensitivity to differences in balance across tasks and repetitions, one-way repeated measures analyses of variance (ANOVAs) were conducted. During quiet stance, the app demonstrated moderate reliability for steadiness on firm (ICC = 0.72) and compliant surfaces (ICC = 0.75) with eyes closed. For gait tasks, the G&B App indicated moderate to excellent reliability when walking looking straight ahead for gait symmetry (ICC = 0.65), walking speed (ICC = 0.93), step length (ICC = 0.94), and step time (ICC = 0.84). The TUG correlated with app measures under both gait conditions for walking speed (r −0.70 and 0.67), step length (r −0.56 and −0.58), and step time (r 0.58 and 0.50). The BBS correlated with app measures of walking speed under both gait conditions (r 0.55 and 0.51) and step length when walking with head turns (r = 0.53). Force plate measures of total distance wandered showed adequate to excellent correlations with G&B App measures of steadiness. Notably, G&B App measures of walking speed, gait symmetry, step length, and step time, were sensitive to detecting differences in performance between standard walking and the more difficult task of walking with head turns. This study demonstrates the G&B App’s potential as a reliable and valid tool for assessing some gait and balance parameters in middle-to-older age adults, with promise for application in low-income countries like Pakistan. The app’s accessibility and accuracy could enhance healthcare services and support preventive measures related to fall risk.
Automated Road Defect and Anomaly Detection for Traffic Safety: A Systematic Review
(MDPI AG, 2023-06-16) Rathee, Munish; Bačić, Boris; Doborjeh, Maryam
Recently, there has been a substantial increase in the development of sensor technology. As enabling factors, computer vision (CV) combined with sensor technology have made progress in applications intended to mitigate high rates of fatalities and the costs of traffic-related injuries. Although past surveys and applications of CV have focused on subareas of road hazards, there is yet to be one comprehensive and evidence-based systematic review that investigates CV applications for Automated Road Defect and Anomaly Detection (ARDAD). To present ARDAD’s state-of-the-art, this systematic review is focused on determining the research gaps, challenges, and future implications from selected papers (N = 116) between 2000 and 2023, relying primarily on Scopus and Litmaps services. The survey presents a selection of artefacts, including the most popular open-access datasets (D = 18), research and technology trends that with reported performance can help accelerate the application of rapidly advancing sensor technology in ARDAD and CV. The produced survey artefacts can assist the scientific community in further improving traffic conditions and safety.
Bio-Inspired Energy-Efficient Cluster-Based Routing Protocol for the IoT in Disaster Scenarios
(MDPI AG, 2024-08-19) Ahmed, Shakil; Hossain, Md Akbar; Chong, Peter Han Joo; Ray, Sayan Kumar
The Internet of Things (IoT) is a promising technology for sensing and monitoring the environment to reduce disaster impact. Energy is one of the major concerns for IoT devices, as sensors used in IoT devices are battery-operated. Thus, it is important to reduce energy consumption, especially during data transmission in disaster-prone situations. Clustering-based communication helps reduce a node's energy decay during data transmission and enhances network lifetime. Many hybrid combination algorithms have been proposed for clustering and routing protocols to improve network lifetime in disaster scenarios. However, the performance of these protocols varies widely based on the underlying network configuration and the optimisation parameters considered. In this research, we used the clustering parameters most relevant to disaster scenarios, such as the node's residual energy, distance to sink, and network coverage. We then proposed the bio-inspired hybrid BOA-PSO algorithm, where the Butterfly Optimisation Algorithm (BOA) is used for clustering and Particle Swarm Optimisation (PSO) is used for the routing protocol. The performance of the proposed algorithm was compared with that of various benchmark protocols: LEACH, DEEC, PSO, PSO-GA, and PSO-HAS. Residual energy, network throughput, and network lifetime were considered performance metrics. The simulation results demonstrate that the proposed algorithm effectively conserves residual energy, achieving more than a 17% improvement for short-range scenarios and a 10% improvement for long-range scenarios. In terms of throughput, the proposed method delivers a 60% performance enhancement compared to LEACH, a 53% enhancement compared to DEEC, and a 37% enhancement compared to PSO. Additionally, the proposed method results in a 60% reduction in packet drops compared to LEACH and DEEC, and a 30% reduction compared to PSO. It increases network lifetime by 10-20% compared to the benchmark algorithms.
CISO: Co-iteration Semi-supervised Learning for Visual Object Detection
(Springer Science and Business Media LLC, 2023-09-19) Qi, Jianchun; Nguyen, Minh; Yan, Wei Qi
Semi-supervised learning offers a solution to the high cost and limited availability of manually labeled samples in supervised learning. In semi-supervised visual object detection, the use of unlabeled data can significantly enhance the performance of deep learning models. In this paper, we introduce an end-to-end framework, named CISO (Co-Iteration Semi-Supervised Learning for Object Detection), which integrates a knowledge distillation approach and a collaborative, iterative semi-supervised learning strategy. To maximize the utilization of pseudo-label data and address the scarcity of pseudo-label data due to high threshold settings, we propose a mean iteration approach where all unlabeled data is applied to each training iteration. Pseudo-label data with high confidence is extracted based on an ever-changing threshold (average intersection over union of all pseudo-labeled data). This strategy not only ensures the accuracy of the pseudo-label but also optimizes the use of unlabeled data. Subsequently, we apply a weak-strong data augmentation strategy to update the model. Lastly, we evaluate CISO using Swin Transformer model and conduct comprehensive experiments on MS-COCO. Our framework showcases impressive results, outperforms the state-of-the-art methods by 2.16 mAP and 1.54 mAP with 10% and 5% labeled data, respectively.
Cyber-Physical Distributed Intelligent Motor Fault Detection
(MDPI AG, 2024-08-02) Al-Anbuky, Adnan; Altaf, Saud; Gheitasi, Alireza
This research paper explores the realm of fault detection in distributed motors through the vision of the Internet of electrical drives. This paper aims at employing artificial neural networks supported by the data collected by the Internet of distributed devices. Cross-verification of results offers reliable diagnosis of industrial motor faults. The proposed methodology involves the development of a cyber-physical system architecture and mathematical modeling framework for efficient fault detection. The mathematical model is designed to capture the intricate relationships within the cyber-physical system, incorporating the dynamic interactions between distributed motors and their edge controllers. Fast Fourier transform is employed for signal processing, enabling the extraction of meaningful frequency features that serve as indicators of potential faults. The artificial neural network based fault detection system is integrated with the solution, utilizing its ability to learn complex patterns and adapt to varying motor conditions. The effectiveness of the proposed framework and model is demonstrated through experimental results. The experimental setup involves diverse fault scenarios, and the system's performance is evaluated in terms of accuracy, sensitivity, and false positive rates.
Cylindrical Piezoelectric PZT Transducers for Sensing and Actuation
(MDPI AG, 2023-03-11) Meshkinzar, Ata; Al-Jumaily, Ahmed M
Piezoelectric transducers have numerous applications in a wide range of sensing and actuation applications. Such a variety has resulted in continuous research into the design and development of these transducers, including but not limited to their geometry, material and configuration. Among these, cylindrical-shaped piezoelectric PZT transducers with superior features are suitable for various sensor or actuator applications. However, despite their strong potential, they have not been thoroughly investigated and fully established. The aim of this paper is to shed light on various cylindrical piezoelectric PZT transducers, their applications and design configurations. Based on the latest literature, different design configurations such as stepped-thickness cylindrical transducers and their potential application areas will be elaborated on to propose future research trends for introducing new configurations that meet the requirements for biomedical applications, the food industry, as well as other industrial fields.
Decoding Attempted Hand Movements in Stroke Patients Using Surface Electromyography
(MDPI AG, 2020-11-26) Jochumsen, M; Niazi, IK; Rehman, MZU; Amjad, I; Shafique, M; Gilani, SO; Waris, A
Brain‐ and muscle‐triggered exoskeletons have been proposed as a means for motor training after a stroke. With the possibility of performing different movement types with an exoskeleton, it is possible to introduce task variability in training. It is difficult to decode different movement types simultaneously from brain activity, but it may be possible from residual muscle activity that many patients have or quickly regain. This study investigates whether nine different motion classes of the hand and forearm could be decoded from forearm EMG in 15 stroke patients. This study also evaluates the test‐retest reliability of a classical, but simple, classifier (linear discriminant analysis) and advanced, but more computationally intensive, classifiers (autoencoders and convolutional neural networks). Moreover, the association between the level of motor impairment and classification accuracy was tested. Three channels of surface EMG were recorded during the following motion classes: Hand Close, Hand Open, Wrist Extension, Wrist Flexion, Supination, Pronation, Lateral Grasp, Pinch Grasp, and Rest. Six repetitions of each motion class were performed on two different days. Hudgins time‐domain features were extracted and classified using linear discriminant analysis and autoencoders, and raw EMG was classified with convolutional neural networks. On average, 79 ± 12% and 80 ± 12% (autoencoders) of the movements were correctly classified for days 1 and 2, respectively, with an intraclass correlation coefficient of 0.88. No association was found between the level of motor impairment and classification accuracy (Spearman correlation: 0.24). It was shown that nine motion classes could be decoded from residual EMG, with autoencoders being the best classification approach, and that the results were reliable across days; this may have implications for the development of EMG‐controlled exoskeletons for training in the patient’s home.
Decoding of Ankle Joint Movements in Stroke Patients Using Surface Electromyography
(MDPI AG, 2021-02-24) Noor, A; Waris, A; Gilani, SO; Kashif, AS; Jochumsen, M; Iqbal, J; Niazi, IK
Stroke is a cerebrovascular disease (CVD), which results in hemiplegia, paralysis, or death. Conventionally, a stroke patient requires prolonged sessions with physical therapists for the recovery of motor function. Various home-based rehabilitative devices are also available for upper limbs and require minimal or no assistance from a physiotherapist. However, there is no clinically proven device available for functional recovery of a lower limb. In this study, we explored the potential use of surface electromyography (sEMG) as a controlling mechanism for the development of a home-based lower limb rehabilitative device for stroke patients. In this experiment, three channels of sEMG were used to record data from 11 stroke patients while performing ankle joint movements. The movements were then decoded from the sEMG data and their correlation with the level of motor impairment was investigated. The impairment level was quantified using the Fugl-Meyer Assessment (FMA) scale. During the analysis, Hudgins time-domain features were extracted and classified using linear discriminant analysis (LDA) and artificial neural network (ANN). On average, 63.86% ± 4.3% and 67.1% ± 7.9% of the movements were accurately classified in an offline analysis by LDA and ANN, respectively. We found that in both classifiers, some motions outperformed oth-ers (p < 0.001 for LDA and p = 0.014 for ANN). The Spearman correlation (ρ) was calculated between the FMA scores and classification accuracies. The results indicate that there is a moderately positive correlation (ρ = 0.75 for LDA and ρ = 0.55 for ANN) between the two of them. The findings of this study suggest that a home-based EMG system can be developed to provide customized therapy for the improvement of functional lower limb motion in stroke patients.
Deploying Wireless Sensor Networks in Multi-story Buildings Towards IoT-Based Intelligent Environments: An Empirical Study
(MDPI, 2024-05-25) Sarkar, Nurul I; Gul, S
With the growing integration of the Internet of Things in smart buildings, it is crucial to ensure the precise implementation and operation of wireless sensor networks (WSNs). This paper aims to study the implementation aspect of WSNs in a commercial multi-story building, specifically addressing the difficulty of dealing with the variable environmental conditions on each floor. This research addresses the disparity between simulated situations and actual deployments, offering valuable insights into the potential to significantly improve the efficiency and responsiveness of building management systems. We obtain real-time sensor data to analyze and evaluate the system’s performance. Our investigation is grounded in the growing importance of incorporating WSNs into buildings to create intelligent environments. We provide an in-depth analysis for scrutinizing the disparities and commonalities between the datasets obtained from real-world deployments and simulation. The results obtained show the significance of accurate simulation models for reliable data representation, providing a roadmap for further developments in the integration of WSNs into intelligent building scenarios. This research’s findings highlight the potential for optimizing living and working conditions based on the real-time monitoring of critical environmental parameters. This includes insights into temperature, humidity, and light intensity, offering opportunities for enhanced comfort and efficiency in intelligent environments.
Design and Modeling of a Terahertz Transceiver for Intra- and Inter-chip Communications in Wireless Network-on-Chip Architectures
(MDPI AG, 2024-05-18) Paudel, Biswash; Li, Xue Jun; Seet, Boon-Chong
This paper addresses the increasing demand for computing power and the challenges associated with adding more core units to a computer processor. It explores the utilization of System-on-Chip (SoC) technology, which integrates Terahertz (THz) wave communication capabilities for intra- and inter-chip communication, using the concept of Wireless Network-on-Chips (WNoCs). Various types of network topologies are discussed, along with the disadvantages of wired networks. We explore the idea of applying wireless connections among cores and across the chip. Additionally, we describe the WNoC architecture, the flip-chip package, and the THz antenna. Electromagnetic fields are analyzed using a full-wave simulation software, Ansys High Frequency Structure Simulator (HFSS). The simulation is conducted with dipole and zigzag antennas communicating within the chip at resonant frequencies of 446 GHz and 462.5 GHz, with transmission coefficients of around -28 dB and -33 to -41 dB, respectively. Transmission coefficient characterization, path loss analysis, a study of electric field distribution, and a basic link budget for transmission are provided. Furthermore, the feasibility of calculated transmission power is validated in cases of high insertion loss, ensuring that the achieved energy expenditure is less than 1 pJ/bit. Finally, employing a similar setup, we study intra-chip communication using the same antennas. Simulation results indicate that the zigzag antenna exhibits a higher electric field magnitude compared with the dipole antenna across the simulated chip structure. We conclude that transmission occurs through reflection from the ground plane of a printed circuit board (PCB), as evidenced by the electric field distribution.
Detection of Error-Related Potentials in Stroke Patients from EEG Using an Artificial Neural Network
(MDPI AG, 2021-09-18) Usama, N; Niazi, IK; Dremstrup, K; Jochumsen, M
Error-related potentials (ErrPs) have been proposed as a means for improving brain– computer interface (BCI) performance by either correcting an incorrect action performed by the BCI or label data for continuous adaptation of the BCI to improve the performance. The latter approach could be relevant within stroke rehabilitation where BCI calibration time could be minimized by using a generalized classifier that is continuously being individualized throughout the rehabilitation session. This may be achieved if data are correctly labelled. Therefore, the aims of this study were: (1) classify single-trial ErrPs produced by individuals with stroke, (2) investigate test–retest reliability, and (3) compare different classifier calibration schemes with different classification methods (artificial neural network, ANN, and linear discriminant analysis, LDA) with waveform features as input for meaningful physiological interpretability. Twenty-five individuals with stroke operated a sham BCI on two separate days where they attempted to perform a movement after which they received feedback (error/correct) while continuous EEG was recorded. The EEG was divided into epochs: ErrPs and NonErrPs. The epochs were classified with a multi-layer perceptron ANN based on temporal features or the entire epoch. Additionally, the features were classified with shrinkage LDA. The features were waveforms of the ErrPs and NonErrPs from the sensorimotor cortex to improve the explainability and interpretation of the output of the classifiers. Three calibration schemes were tested: within-day, between-day, and across-participant. Using within-day calibration, 90% of the data were correctly classified with the entire epoch as input to the ANN; it decreased to 86% and 69% when using temporal features as input to ANN and LDA, respectively. There was poor test–retest reliability between the two days, and the other calibration schemes led to accuracies in the range of 63–72% with LDA performing the best. There was no association between the individuals’ impairment level and classification accuracies. The results show that ErrPs can be classified in individuals with stroke, but that user-and session-specific calibration is needed for optimal ErrP decoding with this approach. The use of ErrP/NonErrP waveform features makes it possible to have a physiological meaningful interpretation of the output of the classifiers. The results may have implications for labelling data continuously in BCIs for stroke rehabilitation and thus potentially improve the BCI performance.
Extended Context-Based Semantic Communication System for Text Transmission
(Elsevier BV, 2022-10-08) Liu, Yueling; Jiang, Shengteng; Zhang, Yichi; Cao, Kuo; Zhou, Li; Seet, Boon-Chong; Zhao, Haitao; Wei, Jibo
Context information is significant for semantic extraction and recovery of messages in semantic communication. However, context information is not fully utilized in the existing semantic communication systems since relationships between sentences are often ignored. In this paper, we propose an Extended Context-based Semantic Communication (ECSC) system for text transmission, in which context information within and between sentences is explored for semantic representation and recovery. At the encoder, self-attention and segment-level relative attention are used to extract context information within and between sentences, respectively. In addition, a gate mechanism is adopted at the encoder to incorporate the context information from different ranges. At the decoder, Transformer-XL is introduced to obtain more semantic information from the historical communication processes for semantic recovery. Simulation results show the effectiveness of our proposed model in improving the semantic accuracy between transmitted and recovered messages under various channel conditions.
Fruit Ripeness Identification Using YOLOv8 Model
(Springer Science and Business Media LLC, 2023-08-31) Xiao, Bingjie; Nguyen, Minh; Yan, Wei Qi
Deep learning-based visual object detection is a fundamental aspect of computer vision. These models not only locate and classify multiple objects within an image, but they also identify bounding boxes. The focus of this paper's research work is to classify fruits as ripe or overripe using digital images. Our proposed model extracts visual features from fruit images and analyzes fruit peel characteristics to predict the fruit's class. We utilize our own datasets to train two "anchor-free" models: YOLOv8 and CenterNet, aiming to produce accurate predictions. The CenterNet network primarily incorporates ResNet-50 and employs the deconvolution module DeConv for feature map upsampling. The final three branches of convolutional neural networks are applied to predict the heatmap. The YOLOv8 model leverages CSP and C2f modules for lightweight processing. After analyzing and comparing the two models, we found that the C2f module of the YOLOv8 model significantly enhances classification results, achieving an impressive accuracy rate of 99.5%.
Geometric Implications of Photodiode Arrays on Received Power Distribution in Mobile Underwater Optical Wireless Communication
(MDPI AG, 2024-05-28) Govinda Waduge, Tharuka; Seet, Boon-Chong; Vopel, Kay
Underwater optical wireless communication (UOWC) has gained interest in recent years with the introduction of autonomous and remotely operated mobile systems in blue economic ventures such as offshore food production and energy generation. Here, we devised a model for estimating the received power distribution of diffused line-of-sight mobile optical links, accommodating irregular intensity distributions beyond the beam-spread angle of the emitter. We then used this model to conduct a spatial analysis investigating the parametric influence of the placement, orientation, and angular spread of photodiodes in array-based receivers on the mobile UOWC links in different Jerlov seawater types. It revealed that flat arrays were best for links where strict alignment could be maintained, whereas curved arrays performed better spatially but were not always optimal. Furthermore, utilizing two or more spectrally distinct wavelengths and more bandwidth-efficient modulation may be preferred for received-signal intensity-based localization and improving link range in clearer oceans, respectively. Considering the geometric implications of the array of receiver photodiodes for mobile UOWCs, we recommend the use of dynamically shape-shifting array geometries.
Grape Maturity Estimation Using Time-of-Flight and LiDAR Depth Cameras
(MDPI AG, 2024-08-07) Legg, Mathew; Parr, Baden; Pascual, Genevieve; Alam, Fakhrul
This article investigates the potential for using low-cost depth cameras to estimate the maturity of green table grapes after they have been harvested. Time-of-flight (Kinect Azure) and LiDAR (Intel L515) depth cameras were used to capture depth scans of green table grape berries over time. The depth scans of the grapes are distorted due to the diffused scattering of the light emitted from the cameras within the berries. This causes a distance bias where a grape berry appears to be further from the camera than it is. As the grape aged, the shape of the peak corresponding to the grape became increasingly flattened in shape, resulting in an increased distance bias over time. The distance bias variation with time was able to be fitted with an 𝑅2 value of 0.969 for the Kinect Azure and an average of 0.904 for the Intel L515. This work shows that there is potential to use time-of-flight and LIDAR cameras for estimating grape maturity postharvest in a non-contact and nondestructive manner.

Browse

Browsing Open Research by Subject "0805 Distributed Computing"

Results Per Page

Sort Options