Deep learning for X-ray image analysis

Deep learning is a branch of machine learning, which is an efficient way to achieve the goal, artificial intelligence. It takes the idea of human neural networks and learns the features of a large dataset, which contributes greatly to language and image analysis. This dissertation applied deep learn...

Full description

Saved in:

Bibliographic Details
Main Author:	Ren, Bing
Other Authors:	Wen Bihan
Format:	Thesis-Master by Coursework
Language:	English
Published:	Nanyang Technological University 2020
Subjects:	Engineering::Electrical and electronic engineering
Online Access:	https://hdl.handle.net/10356/141477
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

id	sg-ntu-dr.10356-141477
record_format	dspace
spelling	sg-ntu-dr.10356-1414772023-07-04T15:35:55Z Deep learning for X-ray image analysis Ren, Bing Wen Bihan School of Electrical and Electronic Engineering bihan.wen@ntu.edu.sg Engineering::Electrical and electronic engineering Deep learning is a branch of machine learning, which is an efficient way to achieve the goal, artificial intelligence. It takes the idea of human neural networks and learns the features of a large dataset, which contributes greatly to language and image analysis. This dissertation applied deep learning for X-Ray image analysis is to utilize deep learning methods to implement object detection. Security checking for bags and luggage at airports and the real-way station is done by the staff, which is inefficient and requires heavy human workload. The dissertation aims to solve this problem. The outcome of the dissertation is a system that achieves automated detection of prohibited items. It aims to locate the prohibited items and identifies their categories. Raw X-ray images of security checking are already prepared by researches, i.e., the large-scale X-ray images dataset for security inspection which is publicly available. It contains 6 categories of prohibited objects. To achieve position detection some pre-processing (labeling) is required to allocate the position and classification of objects. The dissertation gave a review of different deep learning methods for object detection, like R-CNN, YOLO, and SSD. With labeled X-Ray images, the experiments used SSD and YOLO respectively to implement prohibited item detection and made a comparison of their task performances, such as accuracy and time consumed. As the exiting dataset of the annotation is limited, it could result in overfitting. The dissertation tries to figure out some methods for improvement, like reducing the network complexity, data augment, and early stopping. When using the exciting label datasets provide by other experts, the SSD network and YOLOv3 can achieve 0.1683 and 0.1457 mAP. The experiments were carried on ii making some improvements to the performance of SSD. The existing dataset contains some inaccurate labels and mistaken label. Another dataset was prepared which is labeled manually with careful selection of labels and the position of bounding boxes. It helped to raise the mAP of SSD to 0.2156. Later, the further experiments were conducted tried to improve the class imbalance by manipulating datasets, which further improved the mAP of SSD by 0.0109. Besides, the experiments also utilized a resized smaller feature extraction network by fixing a certain convolutional layer of VGG16, which can be interpreted as reducing the network complexity. The project used the existing dataset prepared by researchers to train the resized SSD network, and the result approved that, for this particular X-ray project, this method did not have much effect on the mAP of SSD models. Moreover, the dissertation provided some perspective for further development. In the real scenario, only a small percentage of luggage contains prohibited items, which is regarded as a positive sample. Those carrying no prohibited items are negative samples. Thus, it is necessary to take the ratio of positive samples and negative samples into consideration. To further improve the performance of the object detection model, a dataset can be constructed with a certain proportion of positive and negative samples, that mimics the real scenario of prohibited item detection at security checkpoints. Master of Science (Signal Processing) 2020-06-08T10:45:17Z 2020-06-08T10:45:17Z 2020 Thesis-Master by Coursework https://hdl.handle.net/10356/141477 en ISM-DISS-01825 application/pdf Nanyang Technological University
institution	Nanyang Technological University
building	NTU Library
continent	Asia
country	Singapore Singapore
content_provider	NTU Library
collection	DR-NTU
language	English
topic	Engineering::Electrical and electronic engineering
spellingShingle	Engineering::Electrical and electronic engineering Ren, Bing Deep learning for X-ray image analysis
description	Deep learning is a branch of machine learning, which is an efficient way to achieve the goal, artificial intelligence. It takes the idea of human neural networks and learns the features of a large dataset, which contributes greatly to language and image analysis. This dissertation applied deep learning for X-Ray image analysis is to utilize deep learning methods to implement object detection. Security checking for bags and luggage at airports and the real-way station is done by the staff, which is inefficient and requires heavy human workload. The dissertation aims to solve this problem. The outcome of the dissertation is a system that achieves automated detection of prohibited items. It aims to locate the prohibited items and identifies their categories. Raw X-ray images of security checking are already prepared by researches, i.e., the large-scale X-ray images dataset for security inspection which is publicly available. It contains 6 categories of prohibited objects. To achieve position detection some pre-processing (labeling) is required to allocate the position and classification of objects. The dissertation gave a review of different deep learning methods for object detection, like R-CNN, YOLO, and SSD. With labeled X-Ray images, the experiments used SSD and YOLO respectively to implement prohibited item detection and made a comparison of their task performances, such as accuracy and time consumed. As the exiting dataset of the annotation is limited, it could result in overfitting. The dissertation tries to figure out some methods for improvement, like reducing the network complexity, data augment, and early stopping. When using the exciting label datasets provide by other experts, the SSD network and YOLOv3 can achieve 0.1683 and 0.1457 mAP. The experiments were carried on ii making some improvements to the performance of SSD. The existing dataset contains some inaccurate labels and mistaken label. Another dataset was prepared which is labeled manually with careful selection of labels and the position of bounding boxes. It helped to raise the mAP of SSD to 0.2156. Later, the further experiments were conducted tried to improve the class imbalance by manipulating datasets, which further improved the mAP of SSD by 0.0109. Besides, the experiments also utilized a resized smaller feature extraction network by fixing a certain convolutional layer of VGG16, which can be interpreted as reducing the network complexity. The project used the existing dataset prepared by researchers to train the resized SSD network, and the result approved that, for this particular X-ray project, this method did not have much effect on the mAP of SSD models. Moreover, the dissertation provided some perspective for further development. In the real scenario, only a small percentage of luggage contains prohibited items, which is regarded as a positive sample. Those carrying no prohibited items are negative samples. Thus, it is necessary to take the ratio of positive samples and negative samples into consideration. To further improve the performance of the object detection model, a dataset can be constructed with a certain proportion of positive and negative samples, that mimics the real scenario of prohibited item detection at security checkpoints.
author2	Wen Bihan
author_facet	Wen Bihan Ren, Bing
format	Thesis-Master by Coursework
author	Ren, Bing
author_sort	Ren, Bing
title	Deep learning for X-ray image analysis
title_short	Deep learning for X-ray image analysis
title_full	Deep learning for X-ray image analysis
title_fullStr	Deep learning for X-ray image analysis
title_full_unstemmed	Deep learning for X-ray image analysis
title_sort	deep learning for x-ray image analysis
publisher	Nanyang Technological University
publishDate	2020
url	https://hdl.handle.net/10356/141477
_version_	1772825432811372544

Deep learning for X-ray image analysis

Similar Items