IMPLEMENTATION OF YOLOV5 AND MASK R-CNN OBJECT DETECTION MODEL IN REAL TIME MOBILE APPLICATION FOR LOW VISION USERS
Low vision is partial vision loss that can’t be corrected with glasses, contacts or surgery. It is not blindness who experience total vision loss. People who have low vision usually have blurry vision, therefore it is harder for them to do every day activities. Based on that problem statement, on...
Saved in:
Main Author: | |
---|---|
Format: | Final Project |
Language: | Indonesia |
Online Access: | https://digilib.itb.ac.id/gdl/view/69191 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Institut Teknologi Bandung |
Language: | Indonesia |
id |
id-itb.:69191 |
---|---|
spelling |
id-itb.:691912022-09-20T21:06:11ZIMPLEMENTATION OF YOLOV5 AND MASK R-CNN OBJECT DETECTION MODEL IN REAL TIME MOBILE APPLICATION FOR LOW VISION USERS Anindya Riyadi, Inka Indonesia Final Project object detection, transfer learning, Mask R-CNN, YOLOv5, mobile application INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/69191 Low vision is partial vision loss that can’t be corrected with glasses, contacts or surgery. It is not blindness who experience total vision loss. People who have low vision usually have blurry vision, therefore it is harder for them to do every day activities. Based on that problem statement, one of the latest applications of deep learning that can help low vision is object detection. However, several published mobile applications that helps low vision does not have capability to detect object by real time and none of it supports Bahasa Indonesia. Unfortunately, this caused the application can only be used for users who understands English. This research covers the implementation of Mask R-CNN and YOLOv5 using transfer learning method. Both models are trained but only one of the models will be implemented on the mobile application. The research will conduct experiment using various optimizer including SGD, Adam, AdamW when training the models to find which optimizer helps model to have the best performance. The result of the experiment is compared and analyzed in server. YOLOv5s – SGD have the highest mAP score with 0.39401 and YOLOv5s – AdamW have the fastest inference time. Both model with optimizer is further compared and analyzed in mobile. YOLOv5s – SGD inference time (930 ms) is faster than YOLOv5s – AdamW (3037 ms). Therefore, YOLOv5s – SGD is chosen as the best performing model and implemented on the mobile application. text |
institution |
Institut Teknologi Bandung |
building |
Institut Teknologi Bandung Library |
continent |
Asia |
country |
Indonesia Indonesia |
content_provider |
Institut Teknologi Bandung |
collection |
Digital ITB |
language |
Indonesia |
description |
Low vision is partial vision loss that can’t be corrected with glasses,
contacts or surgery. It is not blindness who experience total vision loss. People who
have low vision usually have blurry vision, therefore it is harder for them to do
every day activities. Based on that problem statement, one of the latest applications
of deep learning that can help low vision is object detection. However, several
published mobile applications that helps low vision does not have capability to
detect object by real time and none of it supports Bahasa Indonesia. Unfortunately,
this caused the application can only be used for users who understands English.
This research covers the implementation of Mask R-CNN and YOLOv5
using transfer learning method. Both models are trained but only one of the models
will be implemented on the mobile application. The research will conduct
experiment using various optimizer including SGD, Adam, AdamW when training
the models to find which optimizer helps model to have the best performance. The
result of the experiment is compared and analyzed in server. YOLOv5s – SGD have
the highest mAP score with 0.39401 and YOLOv5s – AdamW have the fastest
inference time. Both model with optimizer is further compared and analyzed in
mobile. YOLOv5s – SGD inference time (930 ms) is faster than YOLOv5s –
AdamW (3037 ms). Therefore, YOLOv5s – SGD is chosen as the best performing
model and implemented on the mobile application. |
format |
Final Project |
author |
Anindya Riyadi, Inka |
spellingShingle |
Anindya Riyadi, Inka IMPLEMENTATION OF YOLOV5 AND MASK R-CNN OBJECT DETECTION MODEL IN REAL TIME MOBILE APPLICATION FOR LOW VISION USERS |
author_facet |
Anindya Riyadi, Inka |
author_sort |
Anindya Riyadi, Inka |
title |
IMPLEMENTATION OF YOLOV5 AND MASK R-CNN OBJECT DETECTION MODEL IN REAL TIME MOBILE APPLICATION FOR LOW VISION USERS |
title_short |
IMPLEMENTATION OF YOLOV5 AND MASK R-CNN OBJECT DETECTION MODEL IN REAL TIME MOBILE APPLICATION FOR LOW VISION USERS |
title_full |
IMPLEMENTATION OF YOLOV5 AND MASK R-CNN OBJECT DETECTION MODEL IN REAL TIME MOBILE APPLICATION FOR LOW VISION USERS |
title_fullStr |
IMPLEMENTATION OF YOLOV5 AND MASK R-CNN OBJECT DETECTION MODEL IN REAL TIME MOBILE APPLICATION FOR LOW VISION USERS |
title_full_unstemmed |
IMPLEMENTATION OF YOLOV5 AND MASK R-CNN OBJECT DETECTION MODEL IN REAL TIME MOBILE APPLICATION FOR LOW VISION USERS |
title_sort |
implementation of yolov5 and mask r-cnn object detection model in real time mobile application for low vision users |
url |
https://digilib.itb.ac.id/gdl/view/69191 |
_version_ |
1822990946072002560 |