Performance Evaluation of YOLOv8 Model Configurations, for Instance Segmentation of Strawberry Fruit Development Stages in an Open Field Environment

Read original: arXiv:2408.05661 - Published 8/14/2024 by Abdul-Razak Alhassan Gamani, Ibrahim Arhin, Adrena Kyeremateng Asamoah
Total Score

0

Performance Evaluation of YOLOv8 Model Configurations, for Instance Segmentation of Strawberry Fruit Development Stages in an Open Field Environment

Sign in to get full access

or

If you already have an account, we'll log you in

Overview

  • This paper evaluates the performance of different YOLOv8 model configurations for the instance segmentation of strawberry fruit development stages in an open field environment.
  • The researchers examined the accuracy, inference time, and model size of various YOLOv8 models to determine the optimal configuration for their specific task.
  • The goal was to develop a reliable and efficient system for monitoring strawberry growth stages, which could benefit precision agriculture and crop management.

Plain English Explanation

The researchers in this study wanted to find the best version of the YOLOv8 machine learning model for identifying different stages of strawberry growth in outdoor farm settings. YOLOv8 is a popular object detection and instance segmentation model.

They tested out several different configurations of the YOLOv8 model, evaluating how accurate each one was at recognizing strawberries at different maturity levels, how quickly it could make those identifications, and how large the model file size was. The goal was to find the YOLOv8 setup that could do this job accurately and efficiently.

Identifying the growth stages of strawberries is important for precision agriculture, allowing farmers to better monitor and manage their crops. This kind of fruit and vegetable detection is an active area of research, with applications in smart farming and automating agricultural tasks.

Technical Explanation

The researchers evaluated several configurations of the YOLOv8 model for the task of instance segmentation of strawberry fruit development stages in an open field environment. They tested different YOLOv8 model sizes (N, S, M, L, and X) as well as different detection heads (Nano, Segmentation, and Detection).

The models were trained on a custom dataset of images captured in an open field, containing strawberries at various growth stages. The researchers measured the accuracy, inference time, and model size for each configuration to determine the optimal tradeoff between performance and efficiency.

The results showed that the YOLOv8-N-Seg model achieved the best balance, with high instance segmentation accuracy while maintaining a relatively small model size and fast inference time. This configuration could be well-suited for deployment in resource-constrained edge devices for real-time monitoring of strawberry growth in open field environments.

Critical Analysis

The paper provides a thorough evaluation of different YOLOv8 model configurations for the specific task of strawberry instance segmentation. However, the dataset used for training and evaluation was limited to a single open field environment. Further research would be needed to assess the generalizability of the findings to other strawberry growing regions and conditions.

Additionally, the paper does not discuss potential biases in the dataset or model, such as whether the models perform equally well on different strawberry varieties or under varying lighting and weather conditions. These factors could impact the real-world deployment and performance of the system.

It would also be valuable to compare the YOLOv8 models to other instance segmentation approaches, such as Mask R-CNN, to provide a more comprehensive evaluation of the state-of-the-art for this agricultural application. Recent research has compared YOLOv8 to other leading object detection and segmentation models, but the specific comparison for strawberry monitoring would be insightful.

Conclusion

This paper presents a detailed performance evaluation of different YOLOv8 model configurations for the task of instance segmentation of strawberry fruit development stages in an open field environment. The researchers identified the YOLOv8-N-Seg model as the optimal balance of accuracy, inference time, and model size for this application.

The findings of this study could inform the development of precision agriculture systems for real-time monitoring and management of strawberry crops. The ability to reliably detect and track strawberry growth stages could lead to improved yield, quality, and sustainability in strawberry farming. Further research is needed to assess the generalizability of the models and compare them to other state-of-the-art approaches.



This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

Performance Evaluation of YOLOv8 Model Configurations, for Instance Segmentation of Strawberry Fruit Development Stages in an Open Field Environment
Total Score

0

Performance Evaluation of YOLOv8 Model Configurations, for Instance Segmentation of Strawberry Fruit Development Stages in an Open Field Environment

Abdul-Razak Alhassan Gamani, Ibrahim Arhin, Adrena Kyeremateng Asamoah

Accurate identification of strawberries during their maturing stages is crucial for optimizing yield management, and pest control, and making informed decisions related to harvest and post-harvest logistics. This study evaluates the performance of YOLOv8 model configurations for instance segmentation of strawberries into ripe and unripe stages in an open field environment. The YOLOv8n model demonstrated superior segmentation accuracy with a mean Average Precision (mAP) of 80.9%, outperforming other YOLOv8 configurations. In terms of inference speed, YOLOv8n processed images at 12.9 milliseconds, while YOLOv8s, the least-performing model, processed at 22.2 milliseconds. Over 86 test images with 348 ground truth labels, YOLOv8n detected 235 ripe fruit classes and 51 unripe fruit classes out of 251 ground truth ripe fruits and 97 unripe ground truth labels, respectively. In comparison, YOLOv8s detected 204 ripe fruits and 37 unripe fruits. Overall, YOLOv8n achieved the fastest inference speed of 24.2 milliseconds, outperforming YOLOv8s, YOLOv8m, YOLOv8l, and YOLOv8x, which processed images at 33.0 milliseconds, 44.3 milliseconds, 53.6 milliseconds, and 62.5 milliseconds, respectively. These results underscore the potential of advanced object segmentation algorithms to address complex visual recognition tasks in open-field agriculture effectively to address complex visual recognition tasks in open-field agriculture effectively.

Read more

8/14/2024

🚀

Total Score

0

Comprehensive Performance Evaluation of YOLOv10, YOLOv9 and YOLOv8 on Detecting and Counting Fruitlet in Complex Orchard Environments

Ranjan Sapkota, Zhichao Meng, Martin Churuvija, Xiaoqiang Du, Zenghong Ma, Manoj Karkee

This study performed an extensive evaluation of the performances of all configurations of YOLOv8, YOLOv9, and YOLOv10 object detection algorithms for fruitlet (of green fruit) detection in commercial orchards. Additionally, this research performed and validated in-field counting of fruitlets using an iPhone and machine vision sensors in 5 different apple varieties (Scifresh, Scilate, Honeycrisp, Cosmic crisp & Golden delicious). This comprehensive investigation of total 17 different configurations (5 for YOLOv8, 6 for YOLOv9 and 6 for YOLOv10) revealed that YOLOv9 outperforms YOLOv10 and YOLOv8 in terms of mAP@50, while YOLOv10x outperformed all 17 configurations tested in terms of precision and recall. Specifically, YOLOv9 Gelan-e achieved the highest mAP@50 of 0.935, outperforming YOLOv10n's 0.921 and YOLOv8s's 0.924. In terms of precision, YOLOv10x achieved the highest precision of 0.908, indicating superior object identification accuracy compared to other configurations tested (e.g. YOLOv9 Gelan-c with a precision of 0.903 and YOLOv8m with 0.897. In terms of recall, YOLOv10s achieved the highest in its series (0.872), while YOLOv9 Gelan m performed the best among YOLOv9 configurations (0.899), and YOLOv8n performed the best among the YOLOv8 configurations (0.883). Meanwhile, three configurations of YOLOv10: YOLOv10b, YOLOv10l, and YOLOv10x achieved superior post-processing speeds of 1.5 milliseconds, outperforming all other configurations within the YOLOv9 and YOLOv8 families. Specifically, YOLOv9 Gelan-e recorded a post-processing speed of 1.9 milliseconds, and YOLOv8m achieved 2.1 milliseconds. Furthermore, YOLOv8n exhibited the highest inference speed among all configurations tested, achieving a processing time of 4.1 milliseconds while YOLOv9 Gelan-t and YOLOv10n also demonstrated comparatively slower inference speeds of 9.3 ms and 5.5 ms, respectively.

Read more

8/28/2024

🖼️

Total Score

0

Comparing YOLOv8 and Mask RCNN for object segmentation in complex orchard environments

Ranjan Sapkota, Dawood Ahmed, Manoj Karkee

Instance segmentation, an important image processing operation for automation in agriculture, is used to precisely delineate individual objects of interest within images, which provides foundational information for various automated or robotic tasks such as selective harvesting and precision pruning. This study compares the one-stage YOLOv8 and the two-stage Mask R-CNN machine learning models for instance segmentation under varying orchard conditions across two datasets. Dataset 1, collected in dormant season, includes images of dormant apple trees, which were used to train multi-object segmentation models delineating tree branches and trunks. Dataset 2, collected in the early growing season, includes images of apple tree canopies with green foliage and immature (green) apples (also called fruitlet), which were used to train single-object segmentation models delineating only immature green apples. The results showed that YOLOv8 performed better than Mask R-CNN, achieving good precision and near-perfect recall across both datasets at a confidence threshold of 0.5. Specifically, for Dataset 1, YOLOv8 achieved a precision of 0.90 and a recall of 0.95 for all classes. In comparison, Mask R-CNN demonstrated a precision of 0.81 and a recall of 0.81 for the same dataset. With Dataset 2, YOLOv8 achieved a precision of 0.93 and a recall of 0.97. Mask R-CNN, in this single-class scenario, achieved a precision of 0.85 and a recall of 0.88. Additionally, the inference times for YOLOv8 were 10.9 ms for multi-class segmentation (Dataset 1) and 7.8 ms for single-class segmentation (Dataset 2), compared to 15.6 ms and 12.8 ms achieved by Mask R-CNN's, respectively.

Read more

7/8/2024

🔎

Total Score

0

Strawberry detection and counting based on YOLOv7 pruning and information based tracking algorithm

Shiyu Liu, Congliang Zhou, Won Suk Lee

The strawberry industry yields significant economic benefits for Florida, yet the process of monitoring strawberry growth and yield is labor-intensive and costly. The development of machine learning-based detection and tracking methodologies has been used for helping automated monitoring and prediction of strawberry yield, still, enhancement has been limited as previous studies only applied the deep learning method for flower and fruit detection, which did not consider the unique characteristics of image datasets collected by the machine vision system. This study proposed an optimal pruning of detection heads of the deep learning model (YOLOv7 and its variants) that could achieve fast and precise strawberry flower, immature fruit, and mature fruit detection. Thereafter, an enhanced object tracking algorithm, which is called the Information Based Tracking Algorithm (IBTA) utilized the best detection result, removed the Kalman Filter, and integrated moving direction, velocity, and spatial information to improve the precision in strawberry flower and fruit tracking. The proposed pruning of detection heads across YOLOv7 variants, notably Pruning-YOLOv7-tiny with detection head 3 and Pruning-YOLOv7-tiny with heads 2 and 3 achieved the best inference speed (163.9 frames per second) and detection accuracy (89.1%), respectively. On the other hand, the effect of IBTA was proved by comparing it with the centroid tracking algorithm (CTA), the Multiple Object Tracking Accuracy (MOTA) and Multiple Object Tracking Precision (MOTP) of IBTA were 12.3% and 6.0% higher than that of CTA, accordingly. In addition, other object-tracking evaluation metrics, including IDF1, IDR, IDP, MT, and IDs, show that IBTA performed better than CTA in strawberry flower and fruit tracking.

Read more

7/18/2024