PVT3: a pruned video-vision transformer for tactile texture classification
With the newly involved technologies in tactile sensory, variants tactile sensors have been deployed on robots which provides them touching ability to perceive complex environments. One typical example of robot touching task is to recognize different materials based on the tactile data generated fro...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
Nanyang Technological University
2022
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/158296 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-158296 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1582962023-07-07T18:55:18Z PVT3: a pruned video-vision transformer for tactile texture classification Ouyang, Yanjia Lin Zhiping School of Electrical and Electronic Engineering A*STAR -I2R Wu Yan EZPLin@ntu.edu.sg Engineering::Electrical and electronic engineering With the newly involved technologies in tactile sensory, variants tactile sensors have been deployed on robots which provides them touching ability to perceive complex environments. One typical example of robot touching task is to recognize different materials based on the tactile data generated from different textures. In this report, we propose PVT 3 , a light-weight Transformer-based architecture with pruning layers to model the texture representation. By using a Video-Vision Transformer backbone, the spatial and temporal features will be well preserved and utilized. The multi-dimensional pruning layers will reduce model complexity and size without sacrificing the performance. Three tactile datasets are used for 3 testing the PVT model. Overall, our proposed model achieves higher accuracy on material classification results with a smaller model size compared to the state-of-the-art tactile texture models. This work was written as a paper and submitted to the International Conference on Intelligent Robots and Systems (IROS) 2022. Bachelor of Engineering (Electrical and Electronic Engineering) 2022-05-31T07:38:53Z 2022-05-31T07:38:53Z 2022 Final Year Project (FYP) Ouyang, Y. (2022). PVT3: a pruned video-vision transformer for tactile texture classification. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/158296 https://hdl.handle.net/10356/158296 en B3127-211 application/pdf Nanyang Technological University |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Electrical and electronic engineering |
spellingShingle |
Engineering::Electrical and electronic engineering Ouyang, Yanjia PVT3: a pruned video-vision transformer for tactile texture classification |
description |
With the newly involved technologies in tactile sensory, variants tactile sensors have been deployed on robots which provides them touching ability to perceive complex environments. One typical example of robot touching task is to recognize different materials based on the tactile data generated from different textures. In this report, we propose PVT 3 , a light-weight Transformer-based architecture with pruning layers to model the texture representation. By using a Video-Vision Transformer backbone, the spatial and temporal features will be well preserved and utilized. The multi-dimensional pruning layers will reduce model complexity and size without sacrificing the performance. Three tactile datasets are used for 3 testing the PVT model. Overall, our proposed model achieves higher accuracy on material classification results with a smaller model size compared to the state-of-the-art tactile texture models.
This work was written as a paper and submitted to the International Conference on Intelligent Robots and Systems (IROS) 2022. |
author2 |
Lin Zhiping |
author_facet |
Lin Zhiping Ouyang, Yanjia |
format |
Final Year Project |
author |
Ouyang, Yanjia |
author_sort |
Ouyang, Yanjia |
title |
PVT3: a pruned video-vision transformer for tactile texture classification |
title_short |
PVT3: a pruned video-vision transformer for tactile texture classification |
title_full |
PVT3: a pruned video-vision transformer for tactile texture classification |
title_fullStr |
PVT3: a pruned video-vision transformer for tactile texture classification |
title_full_unstemmed |
PVT3: a pruned video-vision transformer for tactile texture classification |
title_sort |
pvt3: a pruned video-vision transformer for tactile texture classification |
publisher |
Nanyang Technological University |
publishDate |
2022 |
url |
https://hdl.handle.net/10356/158296 |
_version_ |
1772828965758566400 |