Chenghua Lin | Researchain

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where Chenghua Lin is active.

Explore More

Publication

Featured researches published by Chenghua Lin.

ACM Transactions on Intelligent Systems and Technology | 2013

Dynamic joint sentiment-topic model

Yulan He; Chenghua Lin; Wei Gao; Kam-Fai Wong

Social media data are produced continuously by a large and uncontrolled number of users. The dynamic nature of such data requires the sentiment and topic analysis model to be also dynamically updated, capturing the most recent language use of sentiments and topics in text. We propose a dynamic Joint Sentiment-Topic model (dJST) which allows the detection and tracking of views of current and recurrent interests and shifts in topic and sentiment. Both topic and sentiment dynamics are captured by assuming that the current sentiment-topic-specific word distributions are generated according to the word distributions at previous epochs. We study three different ways of accounting for such dependency information: (1) sliding window where the current sentiment-topic word distributions are dependent on the previous sentiment-topic-specific word distributions in the last S epochs; (2) skip model where history sentiment topic word distributions are considered by skipping some epochs in between; and (3) multiscale model where previous long- and short- timescale distributions are taken into consideration. We derive efficient online inference procedures to sequentially update the model with newly arrived data and show the effectiveness of our proposed model on the Mozilla add-on reviews crawled between 2007 and 2011.

international semantic web conference | 2012

Feature LDA: a supervised topic model for automatic detection of web API documentations from the web

Chenghua Lin; Yulan He; Carlos Pedrinaci; John Domingue

Web APIs have gained increasing popularity in recent Web service technology development owing to its simplicity of technology stack and the proliferation of mashups. However, efficiently discovering Web APIs and the relevant documentations on the Web is still a challenging task even with the best resources available on the Web. In this paper we cast the problem of detecting the Web API documentations as a text classification problem of classifying a given Web page as Web API associated or not. We propose a supervised generative topic model called feature latent Dirichlet allocation (feaLDA) which offers a generic probabilistic framework for automatic detection of Web APIs. feaLDA not only captures the correspondence between data and the associated class labels, but also provides a mechanism for incorporating side information such as labelled features automatically learned from data that can effectively help improving classification performance. Extensive experiments on our Web APIs documentation dataset shows that the feaLDA model outperforms three strong supervised baselines including naive Bayes, support vector machines, and the maximum entropy model, by over 3% in classification accuracy. In addition, feaLDA also gives superior performance when compared against other existing supervised topic models.

privacy security risk and trust | 2012

Online Sentiment and Topic Dynamics Tracking over the Streaming Data

Yulan He; Chenghua Lin; Amparo Elizabeth Cano

We propose a dynamic joint sentiment-topic model (dJST) which is able to effectively track sentiment and topic dynamics over the streaming data. Both topic and sentiment dynamics are captured by assuming that the current sentiment-topic specific word distributions are generated according to the word distributions at previous epochs. We study three different ways of accounting for such dependency information, (1) Sliding window where the current sentiment-topic-word distributions are dependent on the previous sentiment-topic specific word distributions in the last S epochs; (2) Skip model where history sentiment-topic-word distributions are considered by skipping some epochs in between; and (3) Multiscale model where previous long-and short-timescale distributions are taken into consideration. We derive efficient online inference procedures to sequentially update the model with newly arrived data and show the effectiveness of our proposed model on the Mozilla add-on reviews crawled between 2007 and 2011.

applications of natural language to data bases | 2018

Assessing the Effectiveness of Affective Lexicons for Depression Classification

Noor Fazilla Binti Abd Yusof; Chenghua Lin; Frank Guerin

Affective lexicons have been commonly used as lexical features for depression classification, but their effectiveness is relatively unexplored in the literature. In this paper, we investigate the effectiveness of three popular affective lexicons in the task of depression classification. We also develop two lexical feature engineering strategies for incorporating those lexicons into a supervised classifier. The effectiveness of different lexicons and feature engineering strategies are evaluated on a depression dataset collected from LiveJournal.

applications of natural language to data bases | 2017

Automatically Labelling Sentiment-Bearing Topics with Descriptive Sentence Labels

Mohamad Hardyman Bin Barawi; Chenghua Lin; Advaith Siddharthan

In this paper, we propose a simple yet effective approach for automatically labelling sentiment-bearing topics with descriptive sentence labels. Specifically, our approach consists of two components: (i) a mechanism which can automatically learn the relevance to sentiment-bearing topics of the underlying sentences in a corpus; and (ii) a sentence ranking algorithm for label selection that jointly considers topic-sentence relevance as well as aspect and sentiment co-coverage. To our knowledge, we are the first to study the problem of labelling sentiment-bearing topics. Our experimental results show that our approach outperforms four strong baselines and demonstrates the effectiveness of our sentence labels in facilitating topic understanding and interpretation.

international joint conference on natural language processing | 2011