Abstract
Sentiment analysis in various languages has been a hot research topic with several applications. Most of the existing models have been reported to work well with widely used language. Were the lass directly applying these models to poor-quality corpora often leads to low results. Thus, to deal with these shortcoming we propose a cross-lingual sentiment topic model evolution over time (CLSTOT) which jointly models time with topic and sentiment. In CLSTOT, we consider the mapping between sentiment-aware topics under different cultures and analyze their evolution over time. The topic-specific sentiment is extracted using the entire data and not for each single document. As long as providing sentiment-topic, we can predict the timestamps for each test document by finding its most likely location over the timeline. This is achieved by using inference algorithm which is based on Gibbs Sampling. The experimental results on Chinese and English newsreader dataset; Chinese from SinaNews2, and English from Yahoo1, show that CLSTOT achieves significant improvement over the state-of-the-art.
Keywords
Introduction
Cross-lingual sentiment analysis is a hot research topic nowadays,and plays a significant role in creating links between articles written in different languages [36]. Sentiment analysis is the approach we used for extracting information from topics [23]. The topics are mostly based on subjective positions,therefore sentiment is usually issued about a topic. The sentiment analysis in topic is carried out at different levels which are sentence, document and aspect levels [20, 24, 6, 9, 33, 37, 26, 27]. Recently, probabilistic topic models that collectively detect aspects and sentiments have gained much success in determine topic evolution over time [14, 32, 19]. The probabilistic topic models are important for two reasons. Firstly, they provide us with an unsupervised way for discovering topics from documents. Secondly, they result in language models that can show how much words are related to topics and possibly lead to a sentiment. The consumer usually talks about aspects rather than things. Owing to the difficulties and challenges in dealing with such huge tasks, scientists argue that the idea of using sentiment analysis was useful especially in such circumstances[16, 15, 13]. While other state that it is essential to consider the aspects-based sentiment analysis classification by using time [1].
In recent years, developing topic models to explore topic evolution over time has been discussed in many topic applications[30]. Some studies only examined topic and its variation over time rather than discussing the joint modeling word co-occurrence and time [31]. While Sgonet always concentrates on an alternative generative process description of TOT, which is better suited to generate an unseen document, although starting Gibbs sampling using parameters from the previous time slice[28]. Similarly, The Time Mines system[29] for TDT task (a single topic in each document) constructs overview time-lines of a set of news stories. Dynamic Topic Model (DTM) considered as another extension to LDA [9]. is used to model time series data applications. In [26, 27, 14, 19, 16], some pilot studies on cross-lingual sentiment classification were done and had significant impact with various applications. However, most of these studies focused on extracting sentiments about topics in a static way and some of them ignored to use time in the textual data. The intent of these studies was to analyze and evaluate the contents of the topics, but they neglected the area of the topic-sentiment associations between the levels without progressing in the field of the scientific research[19, 31, 11].
Owing to the previous learning observations in this field, we intend to develop a novel CLSTOT approach to extract topic-sentiment associations from the text as well as their evolutions over time. Moreover, the idea based on the principle of our proposed model result in mainly 3 levels, which comprises the topics, topic sentiments, and topic-sentiment evolution over time. This allows our models to preform various functions. For example, the model extracts the hidden topical structures from a document collection, focusing on building the associations between topics and sentiments. Consequently, we provide an efficient tool for tracking and visualizing the strength of topic-sentiment correlation over time. Such information is reported and extracted at once and jointly without any post-processing which is very useful [7]. To address the above challenges, in this paper, we discuss the issue of joint topic modelling over time which mainly focuses on three proposed novel features that are not addressed by researchers in the field of CLSTOT. We first consider time, that is jointly modeled with topics and sentiments which allows capturing the evolution of the topics sentiment over time. Secondly, topic-specific sentiments are extracted from the whole data at once and not for every single document, this provides an overall sight of topic-sentiment correlations. Finally, no post-processing is needed to match similar topics under different sentiment polarities.
In general, the contributions of this study can be summarized as follows: firstly, time is jointly modeled with topics and sentiments providing a quantitative analysis of topic-sentiment evolution over time. Secondly, we specifically present an unsupervised cross-lingual topic sentiment evolution over time model, by integrating the state-of-the art model STOT into the above cross-lingual topic model framework, which can improve sentiment classification for target language by exploiting correspondences with the source language. Finally, through extensive experiments on real world datasets belonging to different domains and languages, and via comparison with existing state-of-art models, we demonstrate the effectiveness and validity of the proposed cross-lingual topic model framework and the CLSTOT model. Experimental results show that the proposed model can be successfully applied to practical sentiment classification applications in order to improve the accuracy of sentiment classification in target languages.
The rest of this paper is organized as follows: in Section 2 we introduce the related work which includes joint topic-sentiment modeling, topic evolution over time, and cross-lingual topic models. In Section 3, we present our proposed model Cross-lingual Sentiment Topic Model Evolution Over Time. Generative process of our model and parameter inference are shown in Section 4. Experimental settings and results are shown in Section 5. Finally, in Section 6, we conclude this paper and future work.
Related work
Joint topic-sentiment modeling
Topic and sentiment modeling tasks are entirely related to the extent that sentiments are usually issued about a topic at hand[7]. To examine the model topic-sentiment conjunction, much work has been carried out using the topic model-sentiment approaches: statistical models for discovering low-dimensional structures (topics) from text based on word co-occurrence patterns. These patterns are captured using latent or hidden variables. Earlier topic models, like LDA and PLSA [2, 12] have mainly focused on extracting similar topics, but more recently progress has been made to these models to capture other aspects of the text, such as sentiment. As an example, Joint Sentiment-Topic model (JST) [18, 17]. Was developed for dealing with topic extraction under different sentiment labels. This is performed by extending LDA with a new sentiment layer inserted before the topic layer. Thus, to generate a word for a document, a sentiment labels
Topic evolution over time
Here, the documents were collected over time such as online discussion forums, emails, news, etc. and consequently their content may evolve and change over time. Here, we focus on the quantitative evolution, i.e., the “amount” of data to discussing some topic at some time-stamp (T). Topic over time (TOT)[37] is an LDA-based model for quantitative topic evolution. For this, the meaning of the topic is assumed to be constant, its size is supposed to evolves and it is captured using a Beta distribution as presented by[33]. LDA model has been used to capture quantitative topic evolution over time by counting the number of documents associated with each topic at each timestamp. On the other hand, qualitative evolution focuses on all other aspects of a topic (word distribution, inter-topic correlation, vocabulary, etc.). In [13]. Alice proposed the Sentence-LDA (SLDA), a probabilistic generative model that assumes all words in a single sentence are generated from one aspect. Furthermore, the authors extended SLDA to aspect and sentiment Unification Model (ASUM), which incorporates aspect and sentiment together to model sentiments toward different feature and applications [9].
Cross-lingual topic models
A cross-lingual topic model (CL-LDA) opens up new opportunities for knowledge extraction research [35, 5]. Our aim is to find a bridge between languages. by using the existing models to combine different languages via parallel or comparable corpora, for example, CL-LDA[8]. Furthermore, we preserve connections between languages by sharing distribution over the latent topic of the bilingual document. Switch LDA[22] extends CL-LDA such as that it can control the proportions of languages in each multilingual topic by introducing a per-word switch variable. Symmetric Correspondence LDA proposed in[10] is a topic model that incorporates a hidden variable to control a pivot language. Since parallel or comparable corpora are relatively limited resources, this kind of models are confined in their applications.
CLLDA on unaligned texts may be applied to a broader class of corpora. For instance, in [34], they used to incorporate soft bilingual dictionary-based constraints into Probabilistic Latent Semantic Analysis (PLSA) so that it can extract shared Latent topics in text data of different languages. Boyd-Graber and Blei in 2009[3] developed the Multilingual Topic (MUTO) model to exploit matching across languages on term level to detect multilingual latent topics from unaligned texts. However, these models do not put into consideration the sentiment analysis[3].
Moreover, in early 2010 Boyd-Graber and Resnik proposed an LDA-based holistic model for multilingual sentiment analysis on the word level. Nevertheless, this model is supervised, while the model proposed in this paper is unsupervised. Besides still our model is also LDA-based on the sentence level which makes it more appropriate for aspect and sentiment detection from reviews[4].
Proposed method
Cross-lingual model-sentiment evolution over time
In this section, we present the CLSTOT model, our approach to modeling cross-lingual topic-sentiment associations as well as their quantitative evolution over time. Our scheme is motivated by the following remarks made on the earlier proposed models:
Time is not jointly modeled with topics and sentiments. Topic-specific sentiments are estimated for each document separately. Similar topics from different sentiment polarities are not automatically matched.
To deal with these shortcoming, we suggest a novel topic model based on three main features: Firstly, time is jointly modeled with topics and sentiments providing a quantitative analysis of topic-sentiment evolution over time. Secondly, topic-specific sentiments are extracted from the whole data and not for every single document, which gives an overall view of topic-sentiment correlations. Lastly, no post-processing is needed to match topics under different sentiment polarities, because the same topic has multiple distributions over words; one for each sentiment polarity. We extend the CLLDA model by adding two new layers, S and T, to capture sentiment and time respectively as shown in Fig. 1.
The graphical representation of CLTOT.
The proposed method is developed on the traditional topic modeling assumptions: each document in the learning collection is a mixture of topics (multinomial distribution over topics). Also, we assume that each topic has multiple facets; one for each sentiment polarity, and consequently multiple multinomial distributions over words. Moreover, we assume that the “force” of topic-sentiment association would evolve over time. Documents from the learning data must be annotated with time (e.g., creation date). Time is first discretized, and each document receives a discrete time-stamp label (e.g., day, month, and year). In the learning step, time modality is captured using the variable
The notation for all models in the paper
The notation specified for the CLSTOT models
For each Topic
Draw a multinomial distribution of the source language:
Draw a multinomial distribution of the target language:
For each document
Draw a distribution Draw a binomial distribution, Draw a sentiment label Draw a time label Draw a topic Draw the language label
If If For each document
Draw a distribution Draw a binomial distribution, Draw a sentiment label Draw a time label Draw a topic Draw the language label
If If
Like LDA, exact inference is intractable in our model. Hence approximate estimation approaches, such as Gibbs Sampling [26], are utilized to solve the problem. Gibbs Sampling is a particular case of Markov Chain Monte Carlo (MCMC)[9], which is a relatively simple algorithm of approximating inference for our models. Due to space limitations, only the final formulas are described here.
Joint distribution
The joint probability of sentiment topic evolution over time is calculated using Bayes conditional independence rule, the joint probability of words, topics, sentiments, and timestamps can be calculated as follows:
The first term is obtained by integrating over
Where
The remaining terms of Eq. (1) are obtained in the same way by integrating over
For the source language: If
Moreover, if
For the target language:
Moreover, if
Therefore, the approximate probability of sentiment
Moreover, the approximate probability of topic
The approximate probability of time
Where
Lastly, the approximate probability of the word
Where
In this section, we validate the proposed CLSTOT model via extensive experiments. Specifically, we first examine its validity on discovering CLSTOT, then we observe the effectiveness of its cross-lingual mechanism to determined the effect of the bilingual dictionary on its behavior. Next, we investigate its performance by applying it to a practical sentiment classification task and quantitatively comparing it with four state-of-the-art baselines.
Experimental setting
Data set and preprocessing
We managed to compile our data set via collecting English and Chinese observations from different regions reflecting different cultures for some significant events as depicted in Table 3. The English reader comments are collected from Yahoo1,and the Chinese reader comments are collected from sina News2. We first remove news reader comments whose length is less than five words. We then remove the punctuation and the stop words. For English comments, we also stem each word to its root from using Porter stemmer (Porter, 1980). For the Chinese reader comments, we use the Jieba package3 to segment and remove Chinese stop words. We utilize an English-Chinese dictionary from MDBG4.
The statistics for the data set
The statistics for the data set
To optimize the number of topics, we empirically ran the models with four values of
Baselines
For performance comparison, we adopted four representative baselines which are CLJAS (Cross-lingual joint Aspect Sentiment), JSTM (Joint Sentiment-Topic Model), ASUM (Aspect Sentiment Unification Model), and TS-LDA (Topic Sentiment Latent Dirichlet Allocation). We chose these models because they are considered as baselines, and they also state-of-the-art. However, they all are monolingual models. CLJST is an extended model for JST, while CLSTOT is an extended model for STOT. Thus, we compare CLJST with JST and CLASUM with ASUM respectively. Through comparing the cross-lingual model with the corresponding monolingual model, we can validate the effectiveness of the proposed cross-lingual mechanism of our model.
Topic coherence
CLSTOT can also extract coherent sentiment-aware topics from news reader generated post collection documents, and evaluate the efficiency of topic and sentiment captured by our models. In order to carry out a quantitative evaluation of topic coherence, we used an automated metric proposed in[25], which is shown in Eq. (14), where topic coherence; denoted as
Where
Average coherence score on the top M words in the K topic discovered on news reader of News reader comments in the source language
Average coherence score on the top M words in the K topic discovered on news reader of News reader comments in the target language
Proportion of Coherent topics generated by each model in 
As our objective is to discover more coherent sentiment topics over time, we chose to evaluate the topics manually, which is based on human judgments. Due to the fact that without enough knowledge, the an notation will not be credible and following [25], we asked two human judges who are familiar with shared knowledge and skilled in looking up the test news dataset, to annotate the discovered sentiment-aware topics evolution over time manually. For this annotation to be reliable, we labeled the generated topics by all the baseline models and our proposed model at the tenth (10) learning iteration.
Topic labeling
Furthermore, we asked the judges to label each sentiment-aware topic evolution over time as coherent or incoherent. Each sentiment-aware topic is represented as a list of the twenty (20) most likely words in the word distribution
Proportion of Coherent topics generated by each model in 
We chose coherent sentiment evolution topic over time which was judged before and asked judges to label each word of the top twenty (20) words among these coherent topics.
When a word was related to the central semantic-coherent concept which represents the topic labeling, the judges had known the concept for each topic and the overall sentiment of the topic. The process of labeling words of each sentiment evolution over time is shown in Figs 4 and 5. The previous figures shows the topic precision generated by each model. When
Average Precision @10, 20 for sources positive and negative.
To explore the quality of topics discovered by CLSTOT, we randomly chose some topics for visualization. We selected six topics, i.e., three real topics and three negative topics. For each topic, we chose the top 10 words, which mostly represent the topic. In Table 6, we present the top words of the selected topics. The three topics with a positive sentiment label, and the three with negative sentiment label topics are respectively shown in Table 6. The six topics are quite explicit and coherent, where each of them tried to capture the topic of a news reader comment. In terms of topic, our model CLSTOT achieves reliable results. Moreover, the CLSTOT model can obtain bilingually aligned aspects due to the feature of topic distribution sharing. This mechanism enables us to develop the CLSTOT on the data in the target language by leveraging the precious resources in the source language. Analysis of these topics shows that CLSTOT can indeed discover coherent sentiment evolution topic over time.
Example of extracted positive negative [English and Chinese] by CLSTOT
Example of extracted positive negative [English and Chinese] by CLSTOT
Average Precision @10, 20 for target positive and negative 
This experiment aims to estimate the validity of the proposed cross-lingual mechanism. For this reason, we calculated the perplexity [2] of the test set under different
Conclusions
In this paper, we proposed a novel unsupervised CLSTOT for sentiment analysis. The CLSTOT model requires neither similar corpora nor labeled sentiment reviews. Through the cross-lingual mechanism, our model can significantly improve sentiment analysis in a target language by leveraging data and knowledge available in the source language. Mainly, CLSTOT can obtain bilingually aligned news reviews from the same topic distribution. Experimental results demonstrated that compared to similar unsupervised baselines, CLSTOT can significantly improve the supervised baseline. CLSTOT also exhibits comparable performance which has convincingly validated its effectiveness and highlighted its value. Moreover, we used seven different topics. We have demonstrated that CLSTOT outperforms four other state-of-the art models in extracting accurate topic-sentiment time associations. We further illustrated the efficiency of the CLSTOT model for homogeneous topics, associated sentiments and their evolution over time using both academic and real-world datasets collected from the web. Moreover, analyzing topic-sentiment evolution on real-world data allowed us to discover and gain insight into hidden phenomena which was impossible with previous models. This last feature generates a wide range of promising applications. For future research , we will try to adapt this strategy to additional languages. We will also consider expanding our model by using word embedding.
