Aqua3DNet: real-time 3D pose estimation of livestock in aquaculture by monocular machine vision
We present a low-cost monocular 3D position estimation method for perception in aquaculture monitoring. Video surveillance of aquaculture has many advantages but given the size of farms and the complexity of their habitats, it is not feasible for farmers to continuously monitor fish health. We formu...
Saved in:
Main Authors: | , , |
---|---|
Other Authors: | |
Format: | Article |
Language: | English |
Published: |
2023
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/171237 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-171237 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1712372023-10-17T07:43:03Z Aqua3DNet: real-time 3D pose estimation of livestock in aquaculture by monocular machine vision Koh, Ming En Fong, Mark Wong Kei Ng, Eddie Yin Kwee School of Mechanical and Aerospace Engineering School of Computer Science and Engineering Engineering::Mechanical engineering Engineering::Computer science and engineering Aquaculture Monocular Machine Vision We present a low-cost monocular 3D position estimation method for perception in aquaculture monitoring. Video surveillance of aquaculture has many advantages but given the size of farms and the complexity of their habitats, it is not feasible for farmers to continuously monitor fish health. We formulate a novel end-to-end deep visual learning pipeline called Aqua3DNet that estimates fish pose using a bottom-up approach to detect and assign key features in one pass. In addition, a depth estimation model using Saliency Object Detection (SOD) masks is implemented to track the 3D position of the fish over time, which is used in this paper to create 3D density heat maps of the fish. The evaluation of the algorithm's performance shows that the detection accuracy reaches 80.63%, the F1 score reaches 87.34%, and the frames per second (fps) reaches 5.12. Aqua3DNet achieves comparable performance to other aquaculture-based computer vision and depth estimation models, with minimal decrease in speed despite the synthesis of the two models. 2023-10-17T07:43:03Z 2023-10-17T07:43:03Z 2023 Journal Article Koh, M. E., Fong, M. W. K. & Ng, E. Y. K. (2023). Aqua3DNet: real-time 3D pose estimation of livestock in aquaculture by monocular machine vision. Aquacultural Engineering, 103, 102367-. https://dx.doi.org/10.1016/j.aquaeng.2023.102367 0144-8609 https://hdl.handle.net/10356/171237 10.1016/j.aquaeng.2023.102367 2-s2.0-85171442889 103 102367 en Aquacultural Engineering © 2023 Elsevier B.V. All rights reserved. |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Mechanical engineering Engineering::Computer science and engineering Aquaculture Monocular Machine Vision |
spellingShingle |
Engineering::Mechanical engineering Engineering::Computer science and engineering Aquaculture Monocular Machine Vision Koh, Ming En Fong, Mark Wong Kei Ng, Eddie Yin Kwee Aqua3DNet: real-time 3D pose estimation of livestock in aquaculture by monocular machine vision |
description |
We present a low-cost monocular 3D position estimation method for perception in aquaculture monitoring. Video surveillance of aquaculture has many advantages but given the size of farms and the complexity of their habitats, it is not feasible for farmers to continuously monitor fish health. We formulate a novel end-to-end deep visual learning pipeline called Aqua3DNet that estimates fish pose using a bottom-up approach to detect and assign key features in one pass. In addition, a depth estimation model using Saliency Object Detection (SOD) masks is implemented to track the 3D position of the fish over time, which is used in this paper to create 3D density heat maps of the fish. The evaluation of the algorithm's performance shows that the detection accuracy reaches 80.63%, the F1 score reaches 87.34%, and the frames per second (fps) reaches 5.12. Aqua3DNet achieves comparable performance to other aquaculture-based computer vision and depth estimation models, with minimal decrease in speed despite the synthesis of the two models. |
author2 |
School of Mechanical and Aerospace Engineering |
author_facet |
School of Mechanical and Aerospace Engineering Koh, Ming En Fong, Mark Wong Kei Ng, Eddie Yin Kwee |
format |
Article |
author |
Koh, Ming En Fong, Mark Wong Kei Ng, Eddie Yin Kwee |
author_sort |
Koh, Ming En |
title |
Aqua3DNet: real-time 3D pose estimation of livestock in aquaculture by monocular machine vision |
title_short |
Aqua3DNet: real-time 3D pose estimation of livestock in aquaculture by monocular machine vision |
title_full |
Aqua3DNet: real-time 3D pose estimation of livestock in aquaculture by monocular machine vision |
title_fullStr |
Aqua3DNet: real-time 3D pose estimation of livestock in aquaculture by monocular machine vision |
title_full_unstemmed |
Aqua3DNet: real-time 3D pose estimation of livestock in aquaculture by monocular machine vision |
title_sort |
aqua3dnet: real-time 3d pose estimation of livestock in aquaculture by monocular machine vision |
publishDate |
2023 |
url |
https://hdl.handle.net/10356/171237 |
_version_ |
1781793850959855616 |