Abstract
In the realm of agricultural automation, the efficient management of tasks like yield estimation, harvesting, and monitoring is crucial. While fruits are typically detected using bounding boxes, pixel-level segmentation is essential for extracting detailed information such as color, maturity, and shape. Furthermore, while previous studies have typically focused on controlled environments and scenes, achieving robust performance in real orchard conditions is also imperative. To prioritize these aspects, we propose the following two considerations: first, a novel peach image dataset designed for rough orchard environments, focusing on pixel-level segmentation for detailed insights; and second, utilizing a transformer-based instance segmentation model, specifically the Swin Transformer as a backbone of Mask R-CNN. We achieve superior results compared to CNN-based models, reaching 60.2 AP on the proposed peach image dataset. The proposed transformer-based approach specially excels in detecting small or obscured peaches, making it highly suitable for practical field applications. The proposed model achieved 40.4 AP for small objects, nearly doubling that of CNN-based models. This advancement significantly enhances automated agricultural systems, especially in yield estimation, harvesting, and crop monitoring.
| Original language | English |
|---|---|
| Article number | 903 |
| Journal | Agriculture (Switzerland) |
| Volume | 14 |
| Issue number | 6 |
| DOIs | |
| State | Published - 2024.06 |
Keywords
- agricultural automation
- computer vision
- fruit segmentation
- transformer
Quacquarelli Symonds(QS) Subject Topics
- Agriculture & Forestry
Fingerprint
Dive into the research topics of 'High-Precision Peach Fruit Segmentation under Adverse Conditions Using Swin Transformer'. Together they form a unique fingerprint.Press/Media
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver