PVT3: a pruned video-vision transformer for tactile texture classification

With the newly involved technologies in tactile sensory, variants tactile sensors have been deployed on robots which provides them touching ability to perceive complex environments. One typical example of robot touching task is to recognize different materials based on the tactile data generated fro...

Full description

Saved in:
Bibliographic Details
Main Author: Ouyang, Yanjia
Other Authors: Lin Zhiping
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2022
Subjects:
Online Access:https://hdl.handle.net/10356/158296
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-158296
record_format dspace
spelling sg-ntu-dr.10356-1582962023-07-07T18:55:18Z PVT3: a pruned video-vision transformer for tactile texture classification Ouyang, Yanjia Lin Zhiping School of Electrical and Electronic Engineering A*STAR -I2R Wu Yan EZPLin@ntu.edu.sg Engineering::Electrical and electronic engineering With the newly involved technologies in tactile sensory, variants tactile sensors have been deployed on robots which provides them touching ability to perceive complex environments. One typical example of robot touching task is to recognize different materials based on the tactile data generated from different textures. In this report, we propose PVT 3 , a light-weight Transformer-based architecture with pruning layers to model the texture representation. By using a Video-Vision Transformer backbone, the spatial and temporal features will be well preserved and utilized. The multi-dimensional pruning layers will reduce model complexity and size without sacrificing the performance. Three tactile datasets are used for 3 testing the PVT model. Overall, our proposed model achieves higher accuracy on material classification results with a smaller model size compared to the state-of-the-art tactile texture models. This work was written as a paper and submitted to the International Conference on Intelligent Robots and Systems (IROS) 2022. Bachelor of Engineering (Electrical and Electronic Engineering) 2022-05-31T07:38:53Z 2022-05-31T07:38:53Z 2022 Final Year Project (FYP) Ouyang, Y. (2022). PVT3: a pruned video-vision transformer for tactile texture classification. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/158296 https://hdl.handle.net/10356/158296 en B3127-211 application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Electrical and electronic engineering
spellingShingle Engineering::Electrical and electronic engineering
Ouyang, Yanjia
PVT3: a pruned video-vision transformer for tactile texture classification
description With the newly involved technologies in tactile sensory, variants tactile sensors have been deployed on robots which provides them touching ability to perceive complex environments. One typical example of robot touching task is to recognize different materials based on the tactile data generated from different textures. In this report, we propose PVT 3 , a light-weight Transformer-based architecture with pruning layers to model the texture representation. By using a Video-Vision Transformer backbone, the spatial and temporal features will be well preserved and utilized. The multi-dimensional pruning layers will reduce model complexity and size without sacrificing the performance. Three tactile datasets are used for 3 testing the PVT model. Overall, our proposed model achieves higher accuracy on material classification results with a smaller model size compared to the state-of-the-art tactile texture models. This work was written as a paper and submitted to the International Conference on Intelligent Robots and Systems (IROS) 2022.
author2 Lin Zhiping
author_facet Lin Zhiping
Ouyang, Yanjia
format Final Year Project
author Ouyang, Yanjia
author_sort Ouyang, Yanjia
title PVT3: a pruned video-vision transformer for tactile texture classification
title_short PVT3: a pruned video-vision transformer for tactile texture classification
title_full PVT3: a pruned video-vision transformer for tactile texture classification
title_fullStr PVT3: a pruned video-vision transformer for tactile texture classification
title_full_unstemmed PVT3: a pruned video-vision transformer for tactile texture classification
title_sort pvt3: a pruned video-vision transformer for tactile texture classification
publisher Nanyang Technological University
publishDate 2022
url https://hdl.handle.net/10356/158296
_version_ 1772828965758566400