Online learning for search and classification
Online learning is a common and useful tool for machine learning and data mining. In contrast to batch learning, online learning receives a sequence of training instances and uses some of them at a time. By the nature of online learning, the training instances may be processed only once. Therefore o...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Theses and Dissertations |
Language: | English |
Published: |
2014
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/55287 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-55287 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-552872023-03-04T00:37:30Z Online learning for search and classification Nguyen, Thanh Tam Chang Kuiyu Hui Siu Cheung School of Computer Engineering DRNTU::Engineering::Computer science and engineering::Computing methodologies::Document and text processing Online learning is a common and useful tool for machine learning and data mining. In contrast to batch learning, online learning receives a sequence of training instances and uses some of them at a time. By the nature of online learning, the training instances may be processed only once. Therefore online learning algorithms can work on big data beyond the memory or disk capacity as well as streaming data. Moreover in document classification, online linear learning has been shown to be much more efficient than non-linear learning in terms of training and testing time. Therefore, online linear learning has recently become an active research topic. This thesis proposes a research framework that attempts to solve the search and classification problems based on the online linear learning approaches. Specifically, we have proposed online learning classification algorithms that are able to work on multiple view datasets and an online learning-to-rank algorithm that improves the accuracy of a search engine. The main research contributions are listed as follows. (i) Feature selection: we have investigated a number of newly supervised term weighting methods to improve the performance of text classification; (ii) Online classification: we have proposed several online learning algorithms that can be used for topic classification; (iii) Two-view online learning: we have proposed a two-view online learning algorithm, which can work on two-view datasets; (iv) Online learning-to-rank: for search engine, we have proposed an online learning-to-rank algorithm, which was to learn a scoring function to re-rank the search result. DOCTOR OF PHILOSOPHY (SCE) 2014-01-10T04:41:59Z 2014-01-10T04:41:59Z 2013 2013 Thesis Nguyen, T. T. (2013). Online learning for search and classification. Doctoral thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/55287 10.32657/10356/55287 en 125 p. application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
DRNTU::Engineering::Computer science and engineering::Computing methodologies::Document and text processing |
spellingShingle |
DRNTU::Engineering::Computer science and engineering::Computing methodologies::Document and text processing Nguyen, Thanh Tam Online learning for search and classification |
description |
Online learning is a common and useful tool for machine learning and data mining. In contrast to batch learning, online learning receives a sequence of training instances and uses some of them at a time. By the nature of online learning, the training instances may be processed only once. Therefore online learning algorithms can work on big data beyond the memory or disk capacity as well as streaming data. Moreover in document classification, online linear learning has been shown to be much more efficient than non-linear learning in terms of training and testing time. Therefore, online linear learning has recently become an active research topic. This thesis proposes a research framework that attempts to solve the search and classification problems based on the online linear learning approaches. Specifically, we have proposed online learning classification algorithms that are able to work on multiple view datasets and an online learning-to-rank algorithm that improves the accuracy of a search engine. The main research contributions are listed as follows. (i) Feature selection: we have investigated a number of newly supervised term weighting methods to improve the performance of text classification; (ii) Online classification: we have proposed several online learning algorithms that can be used for topic classification; (iii) Two-view online learning: we have proposed a two-view online learning algorithm, which can work on two-view datasets; (iv) Online learning-to-rank: for search engine, we have proposed an online learning-to-rank algorithm, which was to learn a scoring function to re-rank the search result. |
author2 |
Chang Kuiyu |
author_facet |
Chang Kuiyu Nguyen, Thanh Tam |
format |
Theses and Dissertations |
author |
Nguyen, Thanh Tam |
author_sort |
Nguyen, Thanh Tam |
title |
Online learning for search and classification |
title_short |
Online learning for search and classification |
title_full |
Online learning for search and classification |
title_fullStr |
Online learning for search and classification |
title_full_unstemmed |
Online learning for search and classification |
title_sort |
online learning for search and classification |
publishDate |
2014 |
url |
https://hdl.handle.net/10356/55287 |
_version_ |
1759858056279097344 |