Visual event recognition

This report summarizes the work that has been done in the final year project of recognizing visual events in videos. It starts with image recognition, in which im- ages are represented in spatial pyramids. Such representations are then input into SVM and KNN for recognition. In video recognition, ba...

全面介紹

Saved in:
書目詳細資料
主要作者: Gong, Li.
其他作者: Xu Dong
格式: Final Year Project
語言:English
出版: 2013
主題:
在線閱讀:http://hdl.handle.net/10356/55095
標簽: 添加標簽
沒有標簽, 成為第一個標記此記錄!
實物特徵
總結:This report summarizes the work that has been done in the final year project of recognizing visual events in videos. It starts with image recognition, in which im- ages are represented in spatial pyramids. Such representations are then input into SVM and KNN for recognition. In video recognition, bag of words and special- ized Gaussian Mixture Models are employed to represent videos, and respective distance calculation is used to measure video-to-video distance. These distance matrices are then input into SVM for recognition using different kernel types. Also, four domain adaptation methods are implemented to recognize Kodak con- sumer videos using Youtube videos. Adaptive multiple kernel learning achieves the best and improves the mean average precision from 44.33% to 61.40%. Last but not least, a web-based demo system is implemented in two modes to visually demonstrate the underlying recognition system.