Twitter-LDA

Latent Dirichlet Allocation (LDA) has been widely used in textual analysis. The original LDA is used to find hidden "topics" in the documents, where a topic is a subject like "arts" or "education" that is discussed in the documents. The original setting in LDA, where ea...

全面介紹

Saved in:

書目詳細資料
Main Authors:	ZHAO, Wayne Xin, JIANG, Jing, WENG, Jianshu, HE, Jing, LIM, Ee Peng, YAN, Hongfei, LI, Xiaoming
格式:	text
出版:	Institutional Knowledge at Singapore Management University 2011
主題:	Computer Sciences Databases and Information Systems
在線閱讀:	https://ink.library.smu.edu.sg/researchdata/12 https://github.com/minghui/Twitter-LDA
標簽:	添加標簽沒有標簽, 成為第一個標記此記錄!
機構:	Singapore Management University

實物特徵
總結:	Latent Dirichlet Allocation (LDA) has been widely used in textual analysis. The original LDA is used to find hidden "topics" in the documents, where a topic is a subject like "arts" or "education" that is discussed in the documents. The original setting in LDA, where each word has a topic label, may not work well with Twitter as tweets are short and a single tweet is more likely to talk about one topic. Hence, Twitter-LDA (T-LDA) has been proposed to address this issue. T-LDA also addresses the noisy nature of tweets, where it captures background words in tweets. As experiments in [7] have shown that T-LDA could capture more meaningful topics than LDA in Microblogs. The original setting in Latent Dirichlet Allocation (LDA), where each word has a topic label, may not work well with Twitter as tweets are short and a single tweet is more likely to talk about one topic. Hence, Twitter-LDA (T-LDA) has been proposed to address this issue. T-LDA also addresses the noisy nature of tweets, where it captures background words in tweets.

Twitter-LDA

相似書籍