Aqua3DNet: real-time 3D pose estimation of livestock in aquaculture by monocular machine vision

We present a low-cost monocular 3D position estimation method for perception in aquaculture monitoring. Video surveillance of aquaculture has many advantages but given the size of farms and the complexity of their habitats, it is not feasible for farmers to continuously monitor fish health. We formu...

Full description

Saved in:
Bibliographic Details
Main Authors: Koh, Ming En, Fong, Mark Wong Kei, Ng, Eddie Yin Kwee
Other Authors: School of Mechanical and Aerospace Engineering
Format: Article
Language:English
Published: 2023
Subjects:
Online Access:https://hdl.handle.net/10356/171237
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-171237
record_format dspace
spelling sg-ntu-dr.10356-1712372023-10-17T07:43:03Z Aqua3DNet: real-time 3D pose estimation of livestock in aquaculture by monocular machine vision Koh, Ming En Fong, Mark Wong Kei Ng, Eddie Yin Kwee School of Mechanical and Aerospace Engineering School of Computer Science and Engineering Engineering::Mechanical engineering Engineering::Computer science and engineering Aquaculture Monocular Machine Vision We present a low-cost monocular 3D position estimation method for perception in aquaculture monitoring. Video surveillance of aquaculture has many advantages but given the size of farms and the complexity of their habitats, it is not feasible for farmers to continuously monitor fish health. We formulate a novel end-to-end deep visual learning pipeline called Aqua3DNet that estimates fish pose using a bottom-up approach to detect and assign key features in one pass. In addition, a depth estimation model using Saliency Object Detection (SOD) masks is implemented to track the 3D position of the fish over time, which is used in this paper to create 3D density heat maps of the fish. The evaluation of the algorithm's performance shows that the detection accuracy reaches 80.63%, the F1 score reaches 87.34%, and the frames per second (fps) reaches 5.12. Aqua3DNet achieves comparable performance to other aquaculture-based computer vision and depth estimation models, with minimal decrease in speed despite the synthesis of the two models. 2023-10-17T07:43:03Z 2023-10-17T07:43:03Z 2023 Journal Article Koh, M. E., Fong, M. W. K. & Ng, E. Y. K. (2023). Aqua3DNet: real-time 3D pose estimation of livestock in aquaculture by monocular machine vision. Aquacultural Engineering, 103, 102367-. https://dx.doi.org/10.1016/j.aquaeng.2023.102367 0144-8609 https://hdl.handle.net/10356/171237 10.1016/j.aquaeng.2023.102367 2-s2.0-85171442889 103 102367 en Aquacultural Engineering © 2023 Elsevier B.V. All rights reserved.
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Mechanical engineering
Engineering::Computer science and engineering
Aquaculture
Monocular Machine Vision
spellingShingle Engineering::Mechanical engineering
Engineering::Computer science and engineering
Aquaculture
Monocular Machine Vision
Koh, Ming En
Fong, Mark Wong Kei
Ng, Eddie Yin Kwee
Aqua3DNet: real-time 3D pose estimation of livestock in aquaculture by monocular machine vision
description We present a low-cost monocular 3D position estimation method for perception in aquaculture monitoring. Video surveillance of aquaculture has many advantages but given the size of farms and the complexity of their habitats, it is not feasible for farmers to continuously monitor fish health. We formulate a novel end-to-end deep visual learning pipeline called Aqua3DNet that estimates fish pose using a bottom-up approach to detect and assign key features in one pass. In addition, a depth estimation model using Saliency Object Detection (SOD) masks is implemented to track the 3D position of the fish over time, which is used in this paper to create 3D density heat maps of the fish. The evaluation of the algorithm's performance shows that the detection accuracy reaches 80.63%, the F1 score reaches 87.34%, and the frames per second (fps) reaches 5.12. Aqua3DNet achieves comparable performance to other aquaculture-based computer vision and depth estimation models, with minimal decrease in speed despite the synthesis of the two models.
author2 School of Mechanical and Aerospace Engineering
author_facet School of Mechanical and Aerospace Engineering
Koh, Ming En
Fong, Mark Wong Kei
Ng, Eddie Yin Kwee
format Article
author Koh, Ming En
Fong, Mark Wong Kei
Ng, Eddie Yin Kwee
author_sort Koh, Ming En
title Aqua3DNet: real-time 3D pose estimation of livestock in aquaculture by monocular machine vision
title_short Aqua3DNet: real-time 3D pose estimation of livestock in aquaculture by monocular machine vision
title_full Aqua3DNet: real-time 3D pose estimation of livestock in aquaculture by monocular machine vision
title_fullStr Aqua3DNet: real-time 3D pose estimation of livestock in aquaculture by monocular machine vision
title_full_unstemmed Aqua3DNet: real-time 3D pose estimation of livestock in aquaculture by monocular machine vision
title_sort aqua3dnet: real-time 3d pose estimation of livestock in aquaculture by monocular machine vision
publishDate 2023
url https://hdl.handle.net/10356/171237
_version_ 1781793850959855616