Decoding the city: harnessing machine learning to extract building attributes from street-view imagery

Building attribute data can be used to improve the accuracy of hazard risk models, but detailed per-building data is generally unavailable due to the difficulty of collecting and maintaining such data. Machine learning models could be used to automate part of the collection process, by analysing...

Full description

Saved in:
Bibliographic Details
Main Author: Chia, Zhi Yi
Other Authors: David Lallemant
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2024
Subjects:
Online Access:https://hdl.handle.net/10356/174813
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-174813
record_format dspace
spelling sg-ntu-dr.10356-1748132024-04-15T15:31:44Z Decoding the city: harnessing machine learning to extract building attributes from street-view imagery Chia, Zhi Yi David Lallemant Asian School of the Environment dlallemant@ntu.edu.sg Computer and Information Science Earth and Environmental Sciences Machine learning Computer vision Object detection Image classification Attribute prediction Street-view imagery Multi-task learning Building attributes Faster- RCNN Building attribute data can be used to improve the accuracy of hazard risk models, but detailed per-building data is generally unavailable due to the difficulty of collecting and maintaining such data. Machine learning models could be used to automate part of the collection process, by analysing widely available street-view data to locate the buildings and classify their attributes. Prior research has explored the use of object detection models to locate the buildings and image classification models to classify their attributes. However, it may be possible to instead modify an object detection model to accomplish both object detection and attribute prediction, as demonstrated in other fields. To test this possibility, a dataset of street-view images with annotated building attributes was constructed, and several modified versions of an object detection model were tested on the dataset. Another baseline model was constructed based on the approach outlined in prior research and tested on the dataset. Comparing the modified models, the modification with the greatest separation between the object detection and attribute prediction tasks performed the best, likely because of conflicts between the tasks. However, completely separating the tasks, like in the baseline model, only slightly improves object detection performance at the cost of substantially worsened attribute prediction performance on our dataset. Hence, the modified object detection model approach is superior for retrieving building attribute data, at least on this dataset. The potential of such an approach should be further explored, and more extensively verified by testing on more data. Bachelor's degree 2024-04-12T00:12:32Z 2024-04-12T00:12:32Z 2024 Final Year Project (FYP) Chia, Z. Y. (2024). Decoding the city: harnessing machine learning to extract building attributes from street-view imagery. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/174813 https://hdl.handle.net/10356/174813 en application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Computer and Information Science
Earth and Environmental Sciences
Machine learning
Computer vision
Object detection
Image classification
Attribute prediction
Street-view imagery
Multi-task learning
Building attributes
Faster- RCNN
spellingShingle Computer and Information Science
Earth and Environmental Sciences
Machine learning
Computer vision
Object detection
Image classification
Attribute prediction
Street-view imagery
Multi-task learning
Building attributes
Faster- RCNN
Chia, Zhi Yi
Decoding the city: harnessing machine learning to extract building attributes from street-view imagery
description Building attribute data can be used to improve the accuracy of hazard risk models, but detailed per-building data is generally unavailable due to the difficulty of collecting and maintaining such data. Machine learning models could be used to automate part of the collection process, by analysing widely available street-view data to locate the buildings and classify their attributes. Prior research has explored the use of object detection models to locate the buildings and image classification models to classify their attributes. However, it may be possible to instead modify an object detection model to accomplish both object detection and attribute prediction, as demonstrated in other fields. To test this possibility, a dataset of street-view images with annotated building attributes was constructed, and several modified versions of an object detection model were tested on the dataset. Another baseline model was constructed based on the approach outlined in prior research and tested on the dataset. Comparing the modified models, the modification with the greatest separation between the object detection and attribute prediction tasks performed the best, likely because of conflicts between the tasks. However, completely separating the tasks, like in the baseline model, only slightly improves object detection performance at the cost of substantially worsened attribute prediction performance on our dataset. Hence, the modified object detection model approach is superior for retrieving building attribute data, at least on this dataset. The potential of such an approach should be further explored, and more extensively verified by testing on more data.
author2 David Lallemant
author_facet David Lallemant
Chia, Zhi Yi
format Final Year Project
author Chia, Zhi Yi
author_sort Chia, Zhi Yi
title Decoding the city: harnessing machine learning to extract building attributes from street-view imagery
title_short Decoding the city: harnessing machine learning to extract building attributes from street-view imagery
title_full Decoding the city: harnessing machine learning to extract building attributes from street-view imagery
title_fullStr Decoding the city: harnessing machine learning to extract building attributes from street-view imagery
title_full_unstemmed Decoding the city: harnessing machine learning to extract building attributes from street-view imagery
title_sort decoding the city: harnessing machine learning to extract building attributes from street-view imagery
publisher Nanyang Technological University
publishDate 2024
url https://hdl.handle.net/10356/174813
_version_ 1800916317029531648