Benchmarking different deep regression models for predicting image rotation angle and robot’s end effector’s position
Deep visual regression models have an important role to find how much the learning model fits the relationship between the visual data (images) and the predicted continuous output. Recently, deep visual regression has been utilized in different applications such as age prediction, digital holography...
Saved in:
Main Authors: | , |
---|---|
Format: | Conference or Workshop Item |
Language: | English English |
Published: |
Institute of Electrical and Electronics Engineers Inc.
2019
|
Subjects: | |
Online Access: | http://irep.iium.edu.my/78088/1/78088_Benchmarking%20different%20deep%20regression%20models_complete_new.pdf http://irep.iium.edu.my/78088/2/78088_Benchmarking%20different%20deep%20regression%20models_scopus.pdf http://irep.iium.edu.my/78088/ https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8952047 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Universiti Islam Antarabangsa Malaysia |
Language: | English English |
Summary: | Deep visual regression models have an important role to find how much the learning model fits the relationship between the visual data (images) and the predicted continuous output. Recently, deep visual regression has been utilized in different applications such as age prediction, digital holography, and head-pose estimation. Deep learning has recently been cutting-edge research. Most of the research papers have focused on utilizing deep learning in classification tasks. There is still a lack of research that use deep learning for regression. This paper utilizes different deep learning models for two regression tasks. The first one is the prediction of the image rotation angle. The second task is to predict the position of the robot’s end-effector in 2D space. Efficient features were learned or extracted in order to perform good regression. The paper demonstrates and compares various models such as a local Receptive Field-Extreme Learning Machine (LRF-ELM), Hierarchical ELM, Supervised Convolutional Neural Network (CNN), and pre-trained CNN such as AlexNet. Each model was trained to learn or extract features and map them to specific continuous output. The results show that all models gave good performance in terms of RMSE and accuracy. H-ELM was found to outperform other models in term of training speed. |
---|