Deep convolutional neural network-based Bernoulli heatmap for head pose estimation

Head pose estimation is a crucial problem for many tasks, such as driver attention, fatigue detection, and human behaviour analysis. It is well known that neural networks are better at handling classification problems than regression problems. It is an extremely nonlinear process to let the network...

Full description

Saved in:
Bibliographic Details
Main Authors: Hu, Zhongxu, Xing, Yang, Lv, Chen, Hang, Peng, Liu, Jie
Other Authors: School of Mechanical and Aerospace Engineering
Format: Article
Language:English
Published: 2022
Subjects:
Online Access:https://hdl.handle.net/10356/160805
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-160805
record_format dspace
spelling sg-ntu-dr.10356-1608052022-08-03T02:36:42Z Deep convolutional neural network-based Bernoulli heatmap for head pose estimation Hu, Zhongxu Xing, Yang Lv, Chen Hang, Peng Liu, Jie School of Mechanical and Aerospace Engineering Engineering::Mechanical engineering Bernoulli Heatmap Channel-Wise Fusion Head pose estimation is a crucial problem for many tasks, such as driver attention, fatigue detection, and human behaviour analysis. It is well known that neural networks are better at handling classification problems than regression problems. It is an extremely nonlinear process to let the network output the angle value directly for optimization learning, and the weight constraint of the loss function will be relatively weak. This paper proposes a novel Bernoulli heatmap for head pose estimation from a single RGB image. Our method can achieve the positioning of the head area while estimating the angles of the head. The Bernoulli heatmap makes it possible to construct fully convolutional neural networks without fully connected layers and provides a new idea for the output form of head pose estimation. A deep convolutional neural network (CNN) structure with multiscale representations is adopted to maintain high-resolution information and low-resolution information in parallel. This kind of structure can maintain rich, high-resolution representations. In addition, channelwise fusion is adopted to make the fusion weights learnable instead of simple addition with equal weights. As a result, the estimation is spatially more precise and potentially more accurate. The effectiveness of the proposed method is empirically demonstrated by comparing it with other state-of-the-art methods on public datasets. Agency for Science, Technology and Research (A*STAR) This work was supported by the A*STAR Grant (No. 1922500046) , Singapore. 2022-08-03T02:36:41Z 2022-08-03T02:36:41Z 2021 Journal Article Hu, Z., Xing, Y., Lv, C., Hang, P. & Liu, J. (2021). Deep convolutional neural network-based Bernoulli heatmap for head pose estimation. Neurocomputing, 436, 198-209. https://dx.doi.org/10.1016/j.neucom.2021.01.048 0925-2312 https://hdl.handle.net/10356/160805 10.1016/j.neucom.2021.01.048 2-s2.0-85100321473 436 198 209 en 1922500046 Neurocomputing © 2021 Elsevier B.V. All rights reserved.
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Mechanical engineering
Bernoulli Heatmap
Channel-Wise Fusion
spellingShingle Engineering::Mechanical engineering
Bernoulli Heatmap
Channel-Wise Fusion
Hu, Zhongxu
Xing, Yang
Lv, Chen
Hang, Peng
Liu, Jie
Deep convolutional neural network-based Bernoulli heatmap for head pose estimation
description Head pose estimation is a crucial problem for many tasks, such as driver attention, fatigue detection, and human behaviour analysis. It is well known that neural networks are better at handling classification problems than regression problems. It is an extremely nonlinear process to let the network output the angle value directly for optimization learning, and the weight constraint of the loss function will be relatively weak. This paper proposes a novel Bernoulli heatmap for head pose estimation from a single RGB image. Our method can achieve the positioning of the head area while estimating the angles of the head. The Bernoulli heatmap makes it possible to construct fully convolutional neural networks without fully connected layers and provides a new idea for the output form of head pose estimation. A deep convolutional neural network (CNN) structure with multiscale representations is adopted to maintain high-resolution information and low-resolution information in parallel. This kind of structure can maintain rich, high-resolution representations. In addition, channelwise fusion is adopted to make the fusion weights learnable instead of simple addition with equal weights. As a result, the estimation is spatially more precise and potentially more accurate. The effectiveness of the proposed method is empirically demonstrated by comparing it with other state-of-the-art methods on public datasets.
author2 School of Mechanical and Aerospace Engineering
author_facet School of Mechanical and Aerospace Engineering
Hu, Zhongxu
Xing, Yang
Lv, Chen
Hang, Peng
Liu, Jie
format Article
author Hu, Zhongxu
Xing, Yang
Lv, Chen
Hang, Peng
Liu, Jie
author_sort Hu, Zhongxu
title Deep convolutional neural network-based Bernoulli heatmap for head pose estimation
title_short Deep convolutional neural network-based Bernoulli heatmap for head pose estimation
title_full Deep convolutional neural network-based Bernoulli heatmap for head pose estimation
title_fullStr Deep convolutional neural network-based Bernoulli heatmap for head pose estimation
title_full_unstemmed Deep convolutional neural network-based Bernoulli heatmap for head pose estimation
title_sort deep convolutional neural network-based bernoulli heatmap for head pose estimation
publishDate 2022
url https://hdl.handle.net/10356/160805
_version_ 1743119518025121792