Events

Home » News » Events » Content

Raymond Y. K. LAU,Associate Professor in the Department of Information Systems at City University of Hong Kong: Pseudo Labeling for the Construction of Domain-specific Sentiment Lexicons and Its Financial Applications

2013-04-07
View:

【Abstract】Sentiment lexicons have been widely used for sentiment analysis. However, manually constructing domain-specific sentiment lexicons is extremely time consuming and it may not even be feasible for domains where linguistic expertise is not available. Research on the automatic construction of domain-specific sentiment lexicons has become a hot topic in recent years. In this presentation, the research work about our semi-supervised learning method which exploits the “distributional characteristic” of sentiments in labeled or unlabeled corpora for the construction of domain-specific sentiment lexicons will be discussed. More specifically, the proposed two-pass “pseudo labeling” algorithm combines shallow linguistic parsing and corpus-base statistical learning to make sentiment lexicon learning scalable with respect to the sheer volume of opinionated documents archived on the Internet these days. As subjectively assessed by human experts, the automatically constructed domain-specific sentiment lexicons are considered to have high quality. Based on an objective polarity prediction task at the document level, it is shown that our domain-specific sentiment lexicons outperform other well-known baseline methods. Finally, the applications of our domain-specific sentiment lexicons to financial prediction tasks are highlighted and the business implications of our research work are discussed.