Exploring tweets normalization and query time sensitivity for Twitter search

This paper presents our work for the Realtime Adhoc Task of TREC 2011 Microblog Track. Microblog texts like tweets are generally characterized by the inclusion of a large proportion of irregular expressions, such as ill-formed words, which can lead to significant mismatch between query terms and twe...

Full description

Saved in:
Bibliographic Details
Main Authors: WEI, Zhongyu, GAO, Wei, ZHOU, Lanjun, LI, Binyang, WONG, Kam-Fai
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2011
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/4644
https://ink.library.smu.edu.sg/context/sis_research/article/5647/viewcontent/SEEM_CUHK.microblog.update.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-5647
record_format dspace
spelling sg-smu-ink.sis_research-56472020-01-02T08:27:29Z Exploring tweets normalization and query time sensitivity for Twitter search WEI, Zhongyu GAO, Wei ZHOU, Lanjun LI, Binyang WONG, Kam-Fai This paper presents our work for the Realtime Adhoc Task of TREC 2011 Microblog Track. Microblog texts like tweets are generally characterized by the inclusion of a large proportion of irregular expressions, such as ill-formed words, which can lead to significant mismatch between query terms and tweets. In addition, Twitter queries are distinguished from Web queries with many unique characteristics, one of which reflects the clearly distinct temporal aspects of Twitter search behavior. In this study, we deal with the first problem by normalizing tweet texts and the second by capturing the temporal characteristics of topic. We divided topics into two categories: time-sensitive and time-insensitive. For the time-sensitive ones, we introduce a decay factor to adjust the relevance score of results according to the expected date of the topical event to happen, and then re-rank the search results. Experiments demonstrate that our methods are significantly better than baseline and outperform the medium of all runs. 2011-11-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/4644 https://ink.library.smu.edu.sg/context/sis_research/article/5647/viewcontent/SEEM_CUHK.microblog.update.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Social Media Theory and Algorithms
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Social Media
Theory and Algorithms
spellingShingle Social Media
Theory and Algorithms
WEI, Zhongyu
GAO, Wei
ZHOU, Lanjun
LI, Binyang
WONG, Kam-Fai
Exploring tweets normalization and query time sensitivity for Twitter search
description This paper presents our work for the Realtime Adhoc Task of TREC 2011 Microblog Track. Microblog texts like tweets are generally characterized by the inclusion of a large proportion of irregular expressions, such as ill-formed words, which can lead to significant mismatch between query terms and tweets. In addition, Twitter queries are distinguished from Web queries with many unique characteristics, one of which reflects the clearly distinct temporal aspects of Twitter search behavior. In this study, we deal with the first problem by normalizing tweet texts and the second by capturing the temporal characteristics of topic. We divided topics into two categories: time-sensitive and time-insensitive. For the time-sensitive ones, we introduce a decay factor to adjust the relevance score of results according to the expected date of the topical event to happen, and then re-rank the search results. Experiments demonstrate that our methods are significantly better than baseline and outperform the medium of all runs.
format text
author WEI, Zhongyu
GAO, Wei
ZHOU, Lanjun
LI, Binyang
WONG, Kam-Fai
author_facet WEI, Zhongyu
GAO, Wei
ZHOU, Lanjun
LI, Binyang
WONG, Kam-Fai
author_sort WEI, Zhongyu
title Exploring tweets normalization and query time sensitivity for Twitter search
title_short Exploring tweets normalization and query time sensitivity for Twitter search
title_full Exploring tweets normalization and query time sensitivity for Twitter search
title_fullStr Exploring tweets normalization and query time sensitivity for Twitter search
title_full_unstemmed Exploring tweets normalization and query time sensitivity for Twitter search
title_sort exploring tweets normalization and query time sensitivity for twitter search
publisher Institutional Knowledge at Singapore Management University
publishDate 2011
url https://ink.library.smu.edu.sg/sis_research/4644
https://ink.library.smu.edu.sg/context/sis_research/article/5647/viewcontent/SEEM_CUHK.microblog.update.pdf
_version_ 1770574947069984768