FoodMask: Real-time food instance counting, segmentation and recognition
Food computing has long been studied and deployed to several applications. Understanding a food image at the instance level, including recognition, counting and segmentation, is essential to quantifying nutrition and calorie consumption. Nevertheless, existing techniques are limited to either catego...
Saved in:
Main Authors: | , , , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2024
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/8321 https://ink.library.smu.edu.sg/context/sis_research/article/9324/viewcontent/FoodMask_av.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.sis_research-9324 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.sis_research-93242023-12-05T03:05:42Z FoodMask: Real-time food instance counting, segmentation and recognition NGUYEN, Huu-Thanh CAO, Yu NGO, Chong-wah CHAN, Wing-Kwong Food computing has long been studied and deployed to several applications. Understanding a food image at the instance level, including recognition, counting and segmentation, is essential to quantifying nutrition and calorie consumption. Nevertheless, existing techniques are limited to either category-specific instance detection, which does not reflect precisely the instance size at the pixel level, or category-agnostic instance segmentation, which is insufficient for dish recognition. This paper presents a compact and fast multi-task network, namely FoodMask, for clustering-based food instance counting, segmentation and recognition. The network learns a semantic space simultaneously encoding food category distribution and instance height at pixel basis. While the former value addresses instance recognition, the latter value provides prior knowledge for instance extraction. Besides, we integrate into the semantic space a pathway for class-specific counting. With these three outputs, we propose a clustering algorithm to segment and recognize food instances at a real-time speed. Empirical studies are made on three large-scale food datasets, including Mixed Dishes, UECFoodPixComp and FoodSeg103, which cover Western, Chinese, Japanese and Indian cuisines. The proposed networks outperform benchmarks in both terms of instance map quality and speed efficiency. 2024-02-01T08:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/8321 info:doi/10.1016/j.patcog.2023.110017 https://ink.library.smu.edu.sg/context/sis_research/article/9324/viewcontent/FoodMask_av.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Food counting Food instance segmentation Food recognition Databases and Information Systems Food Science Graphics and Human Computer Interfaces |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
Food counting Food instance segmentation Food recognition Databases and Information Systems Food Science Graphics and Human Computer Interfaces |
spellingShingle |
Food counting Food instance segmentation Food recognition Databases and Information Systems Food Science Graphics and Human Computer Interfaces NGUYEN, Huu-Thanh CAO, Yu NGO, Chong-wah CHAN, Wing-Kwong FoodMask: Real-time food instance counting, segmentation and recognition |
description |
Food computing has long been studied and deployed to several applications. Understanding a food image at the instance level, including recognition, counting and segmentation, is essential to quantifying nutrition and calorie consumption. Nevertheless, existing techniques are limited to either category-specific instance detection, which does not reflect precisely the instance size at the pixel level, or category-agnostic instance segmentation, which is insufficient for dish recognition. This paper presents a compact and fast multi-task network, namely FoodMask, for clustering-based food instance counting, segmentation and recognition. The network learns a semantic space simultaneously encoding food category distribution and instance height at pixel basis. While the former value addresses instance recognition, the latter value provides prior knowledge for instance extraction. Besides, we integrate into the semantic space a pathway for class-specific counting. With these three outputs, we propose a clustering algorithm to segment and recognize food instances at a real-time speed. Empirical studies are made on three large-scale food datasets, including Mixed Dishes, UECFoodPixComp and FoodSeg103, which cover Western, Chinese, Japanese and Indian cuisines. The proposed networks outperform benchmarks in both terms of instance map quality and speed efficiency. |
format |
text |
author |
NGUYEN, Huu-Thanh CAO, Yu NGO, Chong-wah CHAN, Wing-Kwong |
author_facet |
NGUYEN, Huu-Thanh CAO, Yu NGO, Chong-wah CHAN, Wing-Kwong |
author_sort |
NGUYEN, Huu-Thanh |
title |
FoodMask: Real-time food instance counting, segmentation and recognition |
title_short |
FoodMask: Real-time food instance counting, segmentation and recognition |
title_full |
FoodMask: Real-time food instance counting, segmentation and recognition |
title_fullStr |
FoodMask: Real-time food instance counting, segmentation and recognition |
title_full_unstemmed |
FoodMask: Real-time food instance counting, segmentation and recognition |
title_sort |
foodmask: real-time food instance counting, segmentation and recognition |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2024 |
url |
https://ink.library.smu.edu.sg/sis_research/8321 https://ink.library.smu.edu.sg/context/sis_research/article/9324/viewcontent/FoodMask_av.pdf |
_version_ |
1784855633602805760 |