Keyword extraction on online advertisement using clustering and classification methodology
Keyword advertising is a form of online advertising that an advertiser pays to have an advertisement appear in the results listing when a person uses a phrase to search the web. Selection of keywords is particularly important as they summarize the key characteristics of the advertised products an...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
2017
|
Subjects: | |
Online Access: | http://hdl.handle.net/10356/70238 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-70238 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-702382023-03-03T20:39:05Z Keyword extraction on online advertisement using clustering and classification methodology Liu, Peng Li Fang School of Computer Science and Engineering Optimate Sun Wenting DRNTU::Library and information science Keyword advertising is a form of online advertising that an advertiser pays to have an advertisement appear in the results listing when a person uses a phrase to search the web. Selection of keywords is particularly important as they summarize the key characteristics of the advertised products and services, and serve as the important factor for advertiser to increase the reach of the advertisement (Ad) and potentially the conversion rate. In my company, Optimate, we provided the services to help clients optimize their online marketing campaign, advertisement placements and customer reach via multiple channels such as Google Adwords and Facebook. Keyword selection remains a crucial component to increase the overall effectiveness and efficiency of the services. In the report, I aim to propose a new keyword extraction approach from the advertisement text, while considering the grammar pattern of the text, historical ads and the other attributes such as industry and objective. The whole approach can be broadly divided into three phases, keyword candidate generation, Clustering using K-Means and K-nearest-neighbour classification. Selection rules on keyword candidates are based on linguistic feature and Part-of-Speech (POS) pattern of the ad content. The aim of keyword candidates is to generate a comprehensive list of possible keywords for subsequent classification. Kmeans clustering divides ads into different groups, and the subsequent classification is performed only on the group which the ad is in. Such way helps reduce the computing complexity and choose the best group which can yield better keywords. Then the TD-IDF feature of the keyword candidates is analysed. Cosine Distance is also computed and inputted into K-nearest-neighbour classification. Based on the majority vote of 20 neighbour keywords, the candidate keyword is classified into either a true keyword or a false keyword. This approach achieves good results in extracting keywords, but there are still issues limiting its effectiveness. Nevertheless, this approach offers a quick, highly flexible, and easily implementable solution to keyword extraction. Bachelor of Engineering (Computer Science) 2017-04-17T09:07:26Z 2017-04-17T09:07:26Z 2017 Final Year Project (FYP) http://hdl.handle.net/10356/70238 en Nanyang Technological University 40 p. application/pdf application/vnd.ms-excel application/octet-stream application/octet-stream application/octet-stream |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
DRNTU::Library and information science |
spellingShingle |
DRNTU::Library and information science Liu, Peng Keyword extraction on online advertisement using clustering and classification methodology |
description |
Keyword advertising is a form of online advertising that an advertiser pays to have an
advertisement appear in the results listing when a person uses a phrase to search the web.
Selection of keywords is particularly important as they summarize the key characteristics
of the advertised products and services, and serve as the important factor for advertiser to
increase the reach of the advertisement (Ad) and potentially the conversion rate. In my
company, Optimate, we provided the services to help clients optimize their online
marketing campaign, advertisement placements and customer reach via multiple channels
such as Google Adwords and Facebook. Keyword selection remains a crucial component
to increase the overall effectiveness and efficiency of the services. In the report, I aim to
propose a new keyword extraction approach from the advertisement text, while
considering the grammar pattern of the text, historical ads and the other attributes such as
industry and objective. The whole approach can be broadly divided into three phases,
keyword candidate generation, Clustering using K-Means and K-nearest-neighbour
classification. Selection rules on keyword candidates are based on linguistic feature and
Part-of-Speech (POS) pattern of the ad content. The aim of keyword candidates is to
generate a comprehensive list of possible keywords for subsequent classification. Kmeans
clustering divides ads into different groups, and the subsequent classification is
performed only on the group which the ad is in. Such way helps reduce the computing
complexity and choose the best group which can yield better keywords. Then the TD-IDF
feature of the keyword candidates is analysed. Cosine Distance is also computed and
inputted into K-nearest-neighbour classification. Based on the majority vote of 20
neighbour keywords, the candidate keyword is classified into either a true keyword or a
false keyword. This approach achieves good results in extracting keywords, but there are
still issues limiting its effectiveness. Nevertheless, this approach offers a quick, highly
flexible, and easily implementable solution to keyword extraction. |
author2 |
Li Fang |
author_facet |
Li Fang Liu, Peng |
format |
Final Year Project |
author |
Liu, Peng |
author_sort |
Liu, Peng |
title |
Keyword extraction on online advertisement using clustering and classification methodology |
title_short |
Keyword extraction on online advertisement using clustering and classification methodology |
title_full |
Keyword extraction on online advertisement using clustering and classification methodology |
title_fullStr |
Keyword extraction on online advertisement using clustering and classification methodology |
title_full_unstemmed |
Keyword extraction on online advertisement using clustering and classification methodology |
title_sort |
keyword extraction on online advertisement using clustering and classification methodology |
publishDate |
2017 |
url |
http://hdl.handle.net/10356/70238 |
_version_ |
1759855969043480576 |