Unsupervised action segmentation in videos with clustering algorithms

This project sought to develop a system that performs unsupervised action segmentation in videos. Users are able to perform feature extraction on a desired raw video file as preprocessing for action segmentation, followed by segmenting unlabelled instructional videos according to distinct acti...

Full description

Saved in:
Bibliographic Details
Main Author: Lim, Isaac Sheng Yang
Other Authors: Yeo Chai Kiat
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2024
Subjects:
Online Access:https://hdl.handle.net/10356/174834
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-174834
record_format dspace
spelling sg-ntu-dr.10356-1748342024-04-19T15:44:37Z Unsupervised action segmentation in videos with clustering algorithms Lim, Isaac Sheng Yang Yeo Chai Kiat School of Computer Science and Engineering ASCKYEO@ntu.edu.sg Computer and Information Science Computer vision Clustering Action segmentation This project sought to develop a system that performs unsupervised action segmentation in videos. Users are able to perform feature extraction on a desired raw video file as preprocessing for action segmentation, followed by segmenting unlabelled instructional videos according to distinct actions (action segmentation). The project utilised 3 datasets: the Breakfast Actions dataset, the 50 Salads dataset, and the YouTube Instruction Videos dataset, to perform tests and measure the performance of 5 different clustering algorithms. The videos from the datasets were pre-processed by resampling the videos to a common video codec and a frame rate that matched the dimensions of their respective ground truth labels. The features from the resampled videos in these datasets were extracted utilising a Bag-Of-Features model, in which 2 different feature extraction methods: Oriented FAST and Rotated BRIEF (ORB) and Scale-Invariant Feature Transform (SIFT)—were compared to find the better feature extraction algorithm for this action segmentation task. The extracted features were then passed to 5 clustering algorithms to analyse and compare their performance during exploration. These 5 clustering algorithms were: Temporally Weighted First NN Clustering Hierarchy (TW-FINCH), Action Boundary Detection (ABD), Spectral Clustering (SPECTRAL), Ordering Points To Identify the Clustering Structure (OPTICS), and Density-Based Spatial Clustering of Applications with Noise (DBSCAN). Of the 5 clustering algorithms, TW-FINCH was found to have the best performance for the action segmentation task. It was also found that the ORB feature descriptors provided good performance in terms of speed without drastically reducing accuracy, despite research suggesting ORB's inferiority in robustness compared to SIFT descriptors. Based on the exploration and comparisons made, a system was proposed for unsupervised action segmentations on raw video files, with the output being the labelled videos after clustering. Bachelor's degree 2024-04-15T00:32:22Z 2024-04-15T00:32:22Z 2024 Final Year Project (FYP) Lim, I. S. Y. (2024). Unsupervised action segmentation in videos with clustering algorithms. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/174834 https://hdl.handle.net/10356/174834 en SCSE23-0438 application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Computer and Information Science
Computer vision
Clustering
Action segmentation
spellingShingle Computer and Information Science
Computer vision
Clustering
Action segmentation
Lim, Isaac Sheng Yang
Unsupervised action segmentation in videos with clustering algorithms
description This project sought to develop a system that performs unsupervised action segmentation in videos. Users are able to perform feature extraction on a desired raw video file as preprocessing for action segmentation, followed by segmenting unlabelled instructional videos according to distinct actions (action segmentation). The project utilised 3 datasets: the Breakfast Actions dataset, the 50 Salads dataset, and the YouTube Instruction Videos dataset, to perform tests and measure the performance of 5 different clustering algorithms. The videos from the datasets were pre-processed by resampling the videos to a common video codec and a frame rate that matched the dimensions of their respective ground truth labels. The features from the resampled videos in these datasets were extracted utilising a Bag-Of-Features model, in which 2 different feature extraction methods: Oriented FAST and Rotated BRIEF (ORB) and Scale-Invariant Feature Transform (SIFT)—were compared to find the better feature extraction algorithm for this action segmentation task. The extracted features were then passed to 5 clustering algorithms to analyse and compare their performance during exploration. These 5 clustering algorithms were: Temporally Weighted First NN Clustering Hierarchy (TW-FINCH), Action Boundary Detection (ABD), Spectral Clustering (SPECTRAL), Ordering Points To Identify the Clustering Structure (OPTICS), and Density-Based Spatial Clustering of Applications with Noise (DBSCAN). Of the 5 clustering algorithms, TW-FINCH was found to have the best performance for the action segmentation task. It was also found that the ORB feature descriptors provided good performance in terms of speed without drastically reducing accuracy, despite research suggesting ORB's inferiority in robustness compared to SIFT descriptors. Based on the exploration and comparisons made, a system was proposed for unsupervised action segmentations on raw video files, with the output being the labelled videos after clustering.
author2 Yeo Chai Kiat
author_facet Yeo Chai Kiat
Lim, Isaac Sheng Yang
format Final Year Project
author Lim, Isaac Sheng Yang
author_sort Lim, Isaac Sheng Yang
title Unsupervised action segmentation in videos with clustering algorithms
title_short Unsupervised action segmentation in videos with clustering algorithms
title_full Unsupervised action segmentation in videos with clustering algorithms
title_fullStr Unsupervised action segmentation in videos with clustering algorithms
title_full_unstemmed Unsupervised action segmentation in videos with clustering algorithms
title_sort unsupervised action segmentation in videos with clustering algorithms
publisher Nanyang Technological University
publishDate 2024
url https://hdl.handle.net/10356/174834
_version_ 1814047086734213120