Abstract
Search systems based on both professional meta-data (e.g., title, description, etc.) and social signals (e.g., like, comment, rating, etc.) from social networks is the trending topic in information retrieval (IR) field. This paper presents 2SRM (Social Signals Relevance Model), an approach of IR which takes into account social signals (users’ actions) as an additional information to enhance a search. We hypothesize that these signals can play a role to estimate a priori social importance (relevance) of the resource (document). In this paper, we first study the impact of each such signal on retrieval performance. Next, some social properties such as popularity, reputation and freshness are quantified using several signals. The 2SRM combines the social relevance, estimated from these social signals and properties, with the conventional textual relevance. Finally, we investigate the effect of the social signals on the retrieval effectiveness using state-of-the-art learning approaches. In order to identify the most effective signals, we adopt feature selection algorithms and the correlation between the signals. We evaluated the effectiveness of our approach on both IMDb (Internet Movie Databese) and SBS (Social Book Search) datasets containing movies and books resources and their social characteristics collected from several social networks. Our experimental results are statistically significant, and reveal that incorporating social signals in retrieval model is a promising approach for improving the retrieval performance.
Keywords
Introduction
Before web 2.0, user interactions were limited to creation of links from one website to another one [50]. Nowadays, social web has completely changed the manner in which people communicate and share information on the web. It allows users to interact and produce large masses of social signals. On Facebook, the Like and Share Buttons are viewed across almost 10 million websites daily.1
On Twitter, second most popular social network after Facebook, thanks to its functionalities of tweet and retweet more than 150 million tweets were published just for the 2012 Olympics games.2 Other types of functions such as endorsement, share, comment and rating allow users to interact with web resources. Through these social actions, web resources could become popular by accumulating the counts that people share such information, facilitate and help, users access novel information in convenient manner.While we witness some recent moves from big players towards a more social information retrieval (such as Google and Bing expansion of results with those liked by the users’ Facebook friends), the ways search engines and/or web 2.0 applications exploit social signals (if they ever do) are usually not disclosed. This paper describes an approach that exploits social networks or involve a collective intelligence process to help the user satisfy an information need. Particularly, we focus on exploiting social signals to estimate the social importance (relevance) of the resource to a given query. The main research questions addressed in this paper are the following:
Can these social signals help the search systems for guiding its users to reach a better quality or more relevant content?
How effective is each individual signals for ranking resources for a given query? What are the ranking correlations created by these social signals?
How to combine these social signals in form of social properties? What are the most useful of them to take into account in search model?
Note that we have already investigated the impact of social signals on search effectiveness using machine learning approaches in paper [11]. It only described some preliminary results and it did not deeply evaluate and analyze the results. This paper extends significantly our previous work in the following main additional aspects:
presenting a more complete state-of-the-art on social signals,
exploiting additional signals (e.g., tag, rating and its freshness, etc),
conducting new and extensive experiments on two standard datasets, namely INEX SBS (Social Book Search) and INEX IMDb (Internet Movie Database).
The remainder of this paper is organized as follows: Section 2 presents some related work and the background. Section 3 details our social IR approach. Then, Section 4, reports on the results of our experimental evaluation. Finally, Section 5 concludes this paper by announcing perspectives.
In this section, we report: (i) some background information about social signals and the existing search engines using these signals (e.g., Google and Bing); (ii) related work exploiting social data to estimate document relevance.
Social signals
Social signals represent one of the most popular UGC (User Generated Content) on the web. Indeed, according to [4] web pages include buttons of different social networks where users can express whether they support, recommend or dislike content (text, image, video, etc). These buttons which describe actions being of social activities (e.g., like, share, +1) are related to specific social networks (e.g., Facebook and Google+) with counters indicating the interaction rate with the web resource (see Fig. 1).

Example of a resource with social signals buttons.
A social signal is a measure of the activity on social media. It is a social interaction of a real person with a resource on the web through the functionalities offered by social networks. As with backlinks,3
Backlinks or inbound link is a hyperlink to a site or web page.
List of different social signals types
In general, each social network uses its own social signals with different operating rules. Table 1 summarizes the most popular signals on social networks.
Social signals and search engines
Despite the lack of clear consensus on the exact relationship between social signals and famous search engines (e.g., Google and Bing), there are many reasons why the signals cannot be ignored. Rather than seeing the signals and ranking of results by the search engines as two distinct components. It is useful to consider them as interconnected processes working towards the overall goal: increasing online visibility. Since inception of Facebook or other social networks, social signals also became an important information for SEO (Search Engine Optimization) [39]. They provide information about social interactivity, social behavior and social relations.4
The correlation between social signal and the ranking position of a URL is extremely high. This is valid for all social networks covered by SISTRIX5 Toolbox. The works of Lewoniewski et al. [39] lead to the assumption that the results of social signals also correlate with the quality of Wikipedia articles. Therefore, social signals can be an indicator of the relevance of web resources.Google is still mysterious about the way to exploit social signals to rank its search results, but some studies conducted annually since 2016, searchmetrics6
showed that it exists a high correlation between social signals and the rankings provided by search engines such as Google. However, the degree to which social signals play a role SEO is unclear. John Mueller (Webmaster Trends Analyst at Google) said: “Do social media signals have an impact on organic rankings in Google? Not directly. No. So it’s not that there’s any kind of a ranking effect there. To a large part social networks also have a nofollow on the links that they provide when they post this content, so it’s not the case that would give you any kind of a ranking boost there. What you do sometimes see however is that the social posts show up in the search results.”7Although Google does not have partnership with Facebook, it still has access to public data from Facebook and may use some of them to better understand the popularity of web pages. In 2015, Google signed an agreement with Twitter to index tweets in real time, allowing them to be searchable. Free access to Twitter database means that all the information on Twitter are available for Google automatically without the use of robots crawling. Google algorithms also focus on Twitter profiles who tweet and retweet content, but how do it remains a black box.
In addition, it is no secret that Google gives weight to its own social network, Google+. The content and interactions on Google+ are known to have a positive impact on the ranking of its results. The 2016 Searchmetrics study showed that +1 is strongly correlated with the ranking of Google search results, compared to other well-known criteria such as Facebook signal and keywords frequency.
Bing, the second biggest search engine after Google, is explicit in its use of signals such as tweet and Facebook like as well as other social signals as ranking factors [2,37,45]. Bing has partnered with Facebook in social search [51]. Bing algorithms focuses on the social media content, links, popularity from various social networks that are considered as important factors by Bing to define the ranking of results.
Bing is a notable example of exploiting images and information (posts, signals) from social media to provide the richest results for users [45]. The social media activity is also presented on Bing results pages much more visible than other search engines.
Social content such as tweets, pins (coming from Pinterest) and Facebook data, containing relevant keywords, are often integrated into the Bing search results. Thus, publishing visual contents (image, video, etc.) on the social networks is an excellent way to increase the visibility on Bing. Also, Bing is implementing a service called Social Sidebar that exploits Facebook to enhance a search [28]. It is a third column of the results page that allows users connected to comment and like the relevant results from Facebook without leaving the search page. This service is functional only in USA.
The popularity of UGC, particularly in the context of social media has given birth to many new problems in information retrieval. Specifically, how to exploit these social content in favor of IR is an open question, which gave birth to a new field in IR, Social Information Retrieval (SIR) [6,18].
Some approaches of social search rely on exploiting tags. Abel et al. [1] and Hotho et al. [31] proposed different algorithms based on folksonomies. By analogy with PageRank, FolkRank is based on relation between tag, resource and user, whereas with GFolkRank a resource group is identified by a unique tag in the context of the resource group. In the same context Bao et al. [17] proposed two algorithms, SocialPageRank and SocialSimRank, together with Yanbe et al. [50] suggest SBRank algorithm. These algorithms are motivated by the report that there is a strong interdependence between the popularity of users, tags and resources in a folksonomy. They focus on collective social search and doesn’t respect different engagement types or different trust levels. SocialSimRank calculates similarity between two tags of folksonomy and declares that similar tags are usually assigned to similar resources. In addition, tags are sometimes more reliable than metadata provided by the content producer. However, single tag can hardly cover an entire topic and is more ambiguous for the user that a contextual sentence.
Furthermore, there are several recent works that focus on how to improve information retrieval (IR) effectiveness by exploiting the users’ actions and their underlying social network. Chelaru et al. [22] studied the impact of signals (like, dislike, comment) on the effectiveness of search on YouTube. They showed that, although the basic criteria using the similarity of query with video title and annotations are effective for video search, social criteria are also useful and improve the ranking of search results for 48% of queries. They used feature selection algorithms and learning to rank algorithms. Karweg et al. [35] proposed an approach combining topical score and social score based on two factors: first, user engagement intensity quantifies the effort a user has made during an interaction with document, measured by the number of clicks, number of votes, number of records and recommendation, secondly, trust degree measured from social graph for each user according to his popularity, using PageRank algorithm. They have found that social results are available for most queries and usually lead to more satisfying results. Similarly, in [36,37] Khodaei et al. proposed a ranking approach based on several social factors including relationships between document owners and querying user, importance of each user and users’ actions (playcount: number of times a user listens to a track on last.fm) performed on web documents. They have conducted an extensive experiments set on last.fm dataset. They showed a significant improvement for socio-textual ranking compared to the textual only and social only approaches. On Twitter, Hong et al. [30] used retweets as a measure of popularity and apply machine learning techniques to predict how often new messages will be retweeted. They exploited different features, the content of messages, temporal information, metadata of messages and users, and the user’s social graph. However, banal tweets (e.g., rumors, without interest) can be very popular, such as those concerning celebrities, who generally have a large number of followers. Chan et al. [21] proposed a system called PostScholar, a service that augments the results returned by Google Scholar, a search engine for academic citations. PostScholar detects the Twitter activity related to an article and displays that information on the search results page returned by Google Scholar. An additional hyperlink appears in the results for each article that has Twitter activity associated with it (the number of tweets found for that article, the date of the most recent tweet). These tweets are sorted according to their sentiments scores. Albishre et al. [3] proposed an innovative mechanism to automatically select useful feedback documents using a topic modeling technique to improve the effectiveness of pseudo-relevance feedback models. The main idea behind their proposed model is to discover the latent topics in the top-ranked documents that allow for the exploitation of the correlation between terms in relevant topics. To capture discriminating terms for query expansion, they incorporated topical features into a relevance model that focuses on the temporal information in the selected set of documents. Experimental results on TREC 2011–2013 microblog datasets illustrated that the proposed model significantly outperforms all state-of-the-art baseline models.
Finally, there are other studies initiated by Microsoft Bing researchers [41,45] that show the usefulness of different social contents generated by the network of user’s friends on Facebook. Pantel et al. [42] studied the leverage of social annotation on the quality of search results. They observed that social annotations can benefit web search in two aspects: 1) the annotations are usually good summaries of corresponding web pages, 2) the number of annotations indicates the popularity of web pages. Hecht et al. [28] presented a system called SearchBuddies based on any social information around the user and especially what his friends liked and shared as web page, Facebook pages. Gou et al. [24] proposed a ranking approach taking into account document content and similarity between user and documents user owner in social network. They used a multi-level algorithm to measure the similarity between actors. Experimental results based on YouTube data show that compared with tf-idf algorithm, SNDocRank method returns more relevant documents. According to these results, a user can enhance search by joining a larger social networks, having more friends, and connecting larger communities.
In this paper, our goal aims to exploit social signals for improving accuracy and relevance of convention textual web search. We exploit various signals extracted from different social networks. In addition, instead of considering social features separately as done in the previous works, we propose to combine them to measure specific social properties, namely the popularity and the reputation of a resource. We also attempt to measure the impact of the freshness of the signal on the performance of IR system. Unlike our previous work on social signals [5,7,8,10,12–16], in this paper, the proposed approach is completely supervised by exploiting social signals collected from different social networks as criteria of relevance. It is evaluated on different types of standard test data (INEX Social Book Search and INEX Internet Movie Database). In the following, we present some previous work in connection with this work but are different in regard to the solution proposed in this paper.
In [5] we presented our first basic preliminary study in the form of a poster which proposes a problematic on the users’ information needs coming from the social web, this problem is related to what is called social search or social information retrieval. In [10,12] we have used some features such as comments and likes for ranking web resources, and addressing the impact of number of shares and likes in search relevance. The use of temporal features is inspired by the use of features of this class (e.g., Age of the resource and the signal) in specific domains, such as IMDb movies [13,14]; In these papers, we are particularly interested in: first, showing the impact of signals diversity associated to a resource on information retrieval performance; second, studying the influence of their social networks origin on their quality. We have proposed to model these social features as prior that we integrate into an unsupervised language model. In [15,16] our objective is to study the impact of the new emotional social signals, called Facebook reactions (love, haha, angry, wow, sad) in the retrieval. These reactions allow users to express more nuanced emotions compared to classic signals (e.g. like, share). First, we have analyzed these reactions and showed how users use these signals to interact with posts. Second, we have evaluated the impact of each such reaction in the retrieval, by comparing them to both the textual model without social features and the first classical signal (like-based model). Similarly to the modeling signals in [13,14], these social features are modeled as document prior and are integrated into a language model. We have conducted a series of experiments on IMDb dataset. In [8,9] our goal is to show how these social traces can play a vital role in improving Arabic Facebook search. Firstly, we have identified polarities (positive or negative) carried by the textual signals (e.g. comments) and non-textual ones (e.g. the reactions love and sad) for a given Facebook post. Therefore, the polarity of each comment expressed on a given Facebook post, is estimated based on a neural sentiment model in Arabic language. We note that sentiment analysis of social content is a complex task [20]. Secondly, we have grouped signals according to their complementarity using features selection algorithms. Thirdly, we have applied learning to rank (LTR) algorithms to re-rank Facebook search results based on the selected groups of signals. Finally, experiments are carried out on 13,500 Facebook posts, collected from 45 topics in Arabic language. Experiments results reveal that Random Forests combined with ReliefFAttributeEval (RLF) was the most effective LTR approach for this task. In [7] we have conducted an exploratory study in the impact of users’ traces on Arabic and English Facebook search. In general, during all these years of research, our findings reveal that incorporating social features is a promising approach for improving the IR ranking performance in Arabic and English languages.
2SRM: Social signals relevance model
Our IR approach, named 2SRM, consists of exploiting social signals to define social properties to take into account in retrieval model. We associate to each web resource a social relevance estimated based on these social features (signals and properties). The social relevance score is then combined with a classical topical relevance score (see Fig. 2).

A modular approach for social IR.
Social information that we exploit within the framework of our model can be represented by 5-tuple (
Formalization of our social search model
By analyzing various types of social actions (signals) through many social networks, we define three social properties that are detailed below:
In a summary, we assume that some social actions are more suitable to evaluate popularity of a resource and others are more related to reputation. Therefore, we associate to each of these properties a score calculated by a simple counting (normalized using min-max) of the number of associated actions. The general formula is the following:
Formula (1) is normalized as follows:
Where:
In addition to a simple counting of social actions, we propose to consider the time associated with the signal. We assume that the resource associated with fresh (recent) signals should be promoted.
Let
Let
Table 2 illustrates, through an example of data, the different steps to calculate the final social score.
Calculation example of
Calculation example of
To evaluate our approach, we conducted a series of experiments on two datasets, SBS (Social Book Search) and IMDb (Internet Movie Database). We first evaluated the impact of social signals, taken separately and when they are combined as properties (popularity, reputation and freshness). Secondly, we study the effectiveness of each social signal using machine learning with selection attributes algorithms. We compared our approach which takes into account social signals, with the baseline formed by only a textual model. Our main goals in these experiments are:
to evaluate the impact of signals taken separately and grouped (properties),
to evaluate the effectiveness of signals using machine learning techniques.
List of the different IMDb document metadata fields
List of the different IMDb document metadata fields
Statistics on the number of signals in the IMDb documents returned by 30 topics
List of the different SBS document fields
Statistics on the number of signals in the SBS documents returned by 208 topics
Exploited social signals in the quantification
We used the collections SBS8
and IMDb9 documents provided by INEX. Each document describes a book on SBS and movie on IMDb. It is represented by a set of metadata, which has been indexed according to keywords extracted from fields (see Table 3 and 5). For each document, we collected specific social signals via their corresponding API of 6 social networks listed in Table 7. We chose 30 and 208 topics with their relevance judgments provided by INEX IMDb 2011 and SBS 2015, respectively. In our study, we focused on the effectiveness of the top 1000 results.Table 4 presents statistics on the number of signals in the 1000 documents returned by each topic (30 IMDb topics). According to the averages of signal numbers in the documents, we note that the density of Facebook signals (in average: 85.8 like, 94.1 share and 98.4 comment) is very high compared to other signals (in average:
Table 6 presents statistics on the number of signals in the 1000 documents returned by each topic (208 SBS topics). We note that the density of Facebook signals is very high compared to Amazon/LibraryThing signals but the total number of rating and review is much higher compared to other signals.
Table 7 presents the properties that we want to take into account in our retrieval model. In order to quantify these social properties, we associate them with the corresponding social signals. Specific social signals (actions) have been associated with each property depending on their nature and meaning. In Table 7, we note that the social signals that quantify reputation carry positive opinions, for example, bookmark a resource link by a user on Delicious means that this resource has been added to his favourites list. Concerning like and rating, user clicks on these buttons to indicate that he has enjoyed the resource content. So the presence of these social signals in resource increases the degree of resource reputation. The same applies for popularity, the exploited social signals to estimate it, let us know the position of this resource on the web in terms of trend and propagation. Finally, to quantify freshness the dates of the different actions are not available except the dates of each rating from SBS dataset.
Result of linear combination study
We conducted experiments with models based only on the contents of documents, as well as approaches combining content and social data. Normalized formula (5) is the weighted sum of social relevance and topical relevance:
In this paper, we evaluate the contribution of each social signal/social property and the effect of their combination on relevance. We first select the best parameters α (see formula (5)) and
Comparing social search effectiveness to Solr and BM25 (On IMDb)
Comparing social search effectiveness to Solr and BM25 (On SBS)
Tables 8 and 9 summarize the results of precision (P@10 and P@20), nDCG and MAP for IMDb and SBS datasets, respectively. We evaluated different configurations, by taking into account social signals individually and their combination in the form of social properties. the “Freshness” configuration is only estimated for the SBS collection where we have the appearance time of each rating on a document. In order to check the significance of the results, we performed the Student test [23] and attached * (strong significance against BM25) and ** (very strong significance against BM25) to the performance number of each row in the Tables 8 and 9 when the p-value < 0.05 and p-value < 0.01 confidence level, respectively.
We observe in all cases, with taking into account social features, the results are significantly better compared to textual models. Also, it is clear that combining social signals as properties (popularity and reputation) provides better results than when they are taken individually. The results show that reputation configuration provides better results than popularity. The freshness in our study is related to the recency of actions associated with a resource. The resources that possess fresh signals are promoted in search results list. The overall combination of social properties provides the best results. According to Student test, majority of the results show a strong statistically significant improvement.
In general, experimental results reflect the effectiveness of social signals on search task. More specifically, the results show that the way we have combined social signals to quantify different properties is effective to improve precision and nDCG. Therefore, combination of freshness with popularity and reputation (named “All Properties” in Tables 8 and 9) provides the best improvement compared to a random combination of “All Criteria”.
We can explain these results by the positive sense of reputation property quantified through the counting of signals such as like, mention+1, bookmark and rating, which means favourable and positive opinion for the resource judgment. Social networks urge users to share, comment, evaluate and disseminate the information on a large scale. These interactions allow us to draw conclusions about the social position and quality of these resources in social networks across their popularity, reputation and freshness. Therefore, we can also explain our results by the high rate of user’s engagements on various social networks, which brings together more than a billion users, producing users’ massive interactions “wisdom of crowds” with web resources through these social actions of different natures, often positives.
In this section, we conducted a series of experiments in a supervised environment, using machine learning algorithms with the set of effective social signals identified in Table 7. The aim is twofold: on the one hand we wondered whether the attribute selection really improves the results of a search. On the other hand, we intended to measure the performance of some learning algorithms in this type of classification.
Methodology
Removing the irrelevant and redundant features from the data helps to improve the performance of learning models. Following are the most well-known feature selection algorithms:
While: H specifies the entropy. Entropy is a measure of the uncertainty associated with a random variable.
In this study, we relied on algorithms for selecting attributes to determine the best social signals to exploit in the learning model. Feature selection Algorithms [27] aim to identify and eliminate as many irrelevant and redundant information as possible. We used Weka11
for this experiment. It is a powerful open-source Java-based learning tool that brings together a large number of learning machines and algorithms for selecting attributes.We proceeded as follows: the top 1000 documents for each topic from the two collections (30 IMDb topics and 208 SBS topics) were extracted using the default Lucene Solr model. Then, the scores of all criteria (social signals) are calculated for each resource. We identified relevant documents and irrelevant documents according to the Qrels. The resulting set contains 30000 documents for IMDb and 115248 documents for SBS, including:
2765 relevant documents and 27235 irrelevant documents for IMDb.
2953 relevant documents and 112295 irrelevant documents for SBS.
We observed that this collection has an unbalanced relevance classes distribution. This occurs when there are many more elements in one class than in the other class of a training collection. In this case, a classifier usually tends to predict samples from the majority class and completely ignore the minority class [48]. For this reason, we applied an approach to sub-sampling (reducing the number of samples that have the majority class) to generate a balanced collection composed of:
2765 relevant documents and 2765 irrelevant documents for IMDb.
2953 relevant documents and 2953 irrelevant documents for SBS.
Irrelevant documents for this study were selected randomly. Finally, we applied the attribute selection algorithms on the two sets obtained, for 5 iterations of cross-validation.
We note that these algorithms operate differently, some return an importance ranking of attributes (e.g., FilteredAttributeEval), while others return the number of times that a given attribute has been selected by an algorithm in a cross-validation (e.g., FilteredSubsetEval). In the following, we present the results obtained by each selection algorithm (with and without considering the time) applied on INEX SBS and IMDb datasets. We note that we have used for each algorithm the default setting provided by Weka.
Our goal of this study is to determine the most important signals for IR task and verify if the results obtained previously (Table 8) are consistent. In our case, the selection algorithms are to give a score to each signal based on its significance towards relevance class of the document (relevant or irrelevant). For IMDb (see Table 10), we evaluated 7 criteria where
According to Table 10, we remark that the Facebook signals
Selected signals with attribute selection algorithms (IMDb dataset)
Selected signals with attribute selection algorithms (SBS dataset)
Similarly, on SBS dataset, Table 11 shows that Facebook signals

Machine learning process.
We also conducted a series of experiments exploiting these signals in supervised approaches based on learning models. We used the results returned by Lucene Solr using all the queries from the two INEX collections (IMDb and SBS), each separately, as training sets. We then used three learning algorithms, this choice being explained by the fact that they often showed their effectiveness in IR by exploiting criteria of relevance: SVM [34], J48 (C4.5 implementation) [44] and Naive Bayes [49]. The input of each algorithm is a vector of social signals, either all the signals or just the signals selected by a precise selection algorithm. Each signal is represented by its quantity in the documents. Learning algorithms predict the relevance class for each document (relevant or irrelevant). We applied a cross validation for 5 iterations (5 cross-validation folds). Figure 3 illustrates the learning process we used for evaluating social signals.
We recall that the phase of attribute selection algorithms has highlighted two sets of signals:
in the case of the CfsSubsetEval and FilteredSubsetEval algorithms, the selected signals are
in the case of the other selection algorithms, all the social signals studied on the two collections are selected:
The question at this stage relates to the specification of the input signal vector for the learning algorithms, either we take all the signals, or we keep only those selected by the attribute selection algorithms. In this case, with which learning algorithms these will be combined.
In order to take into account the signals chosen by the selection algorithms in learning models, we relied on the work of Hall and Holmes [27].
Hall and Holmes [27] have studied the effectiveness of some attribute selection techniques by confronting them with learning techniques. Since the performance of the factors differs from one learning technique to another, they have identified the best attribute selection techniques to find the best performing factors according to the learning techniques to be used.
Based on their study, we used the same pairs of learning techniques and attribute selection techniques:
signals selected by WrapperSubsetEval (WRP) are learned by Naive Bayes.
signals selected by CfsSubsetEval (CFS) are learned by Naive Bayes.
signals selected by ReliefFAttributeEval (RLF) are learned by J48.
signals selected by SVMAttributeEval (SVM) are learned by SVM.
Machine learning results (P@20) on IMDb
Machine learning results (P@20) on IMDb
Machine learning results (P@20) on SBS
Tables 12 and 13 present the results of the three learning algorithms of signals that emerged from the study using attribute selection techniques. We find that only the CFS algorithm confirms the hypothesis put forward by Hall and Holmes. Indeed, it is the only one for which the results obtained with the selection of attributes, are 0.4927 (IMDb) and 0.1223 (SBS), exceed the use of all the signals, 0.4802 (IMDb) and 0.1100 (SBS). We have shown that machine learning approaches have better efficiency (precision) with attribute selection approaches. We then note that all learning models outperform textual models (Lucene Solr model and BM25) as well as our first propositions based on the linear combination approach. We finally find that the J48 decision tree is the most appropriate model, it takes into consideration all the social signals, the improvement rates compared to Naive Bayes and SVM are 13% and 15% on IMDb as well as 6% and 17% on SBS, respectively. In addition, the J48 gives the best improvements over all previous approaches, the improvement rate compared to the default model BM25 (configuration named Lucene Solr in Tables 8 and 9) is 144% for the IMDb collection and 152% for the SBS collection, while comparing to the best results obtained by the model based on linear combination (configuration named “All Properties” in Tables 8 and 9) is 27% for the IMDb collection and 63% for the SBS collection.
Finally, all these experiments clearly show that social signals allow to enhance a search. These improvements show the interest of social relevance, knowing that qualitative properties (popularity and reputation) and the temporal property (freshness) provide a significant improvement compared to the configuration ignoring these properties (textual model only or signals teken individually). We observe that the resources having more positive data (e.g., like, +1, rating) are trustworthy than the ones don’t possess these social signals. If multiple users have found that the resource is useful, then it is more probable that other users will find these resources useful too. After these experiments, we observe that learning models are much more suitable than linear combination on exploiting of this type of social signals to enhance a search. We can say that the J48 learning model with selection attribute algorithm improves a precision of search results significantly.
In order to analyze social signals and determine if there is a link (dependence/independence) between them and the document relevance, thus that between them in pairs, we conducted a correlation study. Our goals are as follows:
first, determine the social signals that are correlated with the relevance, and facilitate the interpretation of the results. second, determine the redundant signals, and those that have a same effect on the retrieval improvement.
Correlation between signals and relevance
According to a June 2014 study from Searchmetrics,12
among 22 ranking factors identified, social signals account for 5 of the 6 most highly correlated with Google search results. In addition, BrightEdge13 survey released in 2013, 84% of search marketers say social signals such as like, tweet, and mention +1 will be either more important (53%) or much more important (31%) to their SEO (Search Engine Optimization) compared to previous years.
Correlation between social signals and Google search results.
Social signals continue to become more and more a highly correlated factor with the results of Google. Although we did not see a lot of scientific studies on these signals, some marketing organizations such as Searchmetrics continue to analyze them. According to the 2015 Searchmetrics study [47], the correlations of social signals rankings are practically unchanged compared to 2014 and remains at a high level. The first results returned by Google contain more social signals, this factor increases exponentially in the first places. Figure 4 shows the 2015 results of the correlation between social signals and Google results.
We analyzed the ranking correlation between signals and relevance using a correlation coefficient of Spearman’s Rho (

Spearman’s Rho correlation values for the social signals pairs
Figure 5 shows the values of correlations between ranges social signals (individually and grouped as properties: popularity and reputation) with respect to documents relevance. This study shows that Facebook like (0.29) has the highest correlation among the other individual signals, followed by number of Fecebook comment (0.28). Other high-ranking factors include Facebook share (0.27) and tweet (0.23). Concerning the signals grouped as properties as well as the total of Facebook signals, are the most correlated with relevance compared to signals taken individually. However, the popularity generates the highest correlation compared to the reputation and total Facebook signals.
Finally, the ranking correlation analysis shows that all social signals are positively correlated with relevance. This study justifies our hypothesis and the results obtained above (see Table 8) and confirms the interest of social signals exploited: Well positioned resources have a high number of like, share and specific resources stand out in the top search results with a very high mass of social data. On the one hand, this means that the activity on social networks continues to increase, on the other hand, it means that the frequently liked or shared content is increasingly correlated with good ranking of relevance.
To examine the linear relationship for each pair of social signals, we compute the pairwise overlap between the features by averaging the similarity of their top-1000 rankings over all queries. Atypical method for measuring the similarity of two ranked lists is using the Spearman’s Rho metric. The more Rho is close to 1 (in absolute value), the more the relation is strong and vice-versa.
In Table 14, we provide the Spearman’s Rho scores that are normalized to
Finally, this is a preliminary correlation study, we are well aware that further reflection to better address these issues is needed.
Conclusion
This paper proposes a search model exploiting social signals. These signals (User Generated Content), collected from several social networks, can quantify some social properties such as popularity, reputation and freshness. The proposed model combines linearly two relevance scores: (1) topical, estimated using classical IR model and (2) social, estimated using some social features, popularity, reputation and the freshness of resources. Experimental evaluation conducted on two INEX datasets IMDb and SBS shows that the integration of social signals and their properties within a textual search model allows to improve the quality of the search results. Our evaluations using attributes selection algorithms and three state-of-the-art learning algorithms support our hypothesis: the rankers based on the social signals, including both the popularity, the freshness and the reputation outperform those built by using only basic textual features. We found that J48 brings the best improvement in terms of effectiveness compared to baseline and all our other proposed configurations. Analyzing ranking correlations, we note that all social signals present a positive correlation. Meanwhile, this correlation agreement justifies the significant improvement for our proposed social approach.
For future research, we plan to address some limitations of the current study. We plan to integrate other social data into a proposed approach (emotions, event reactions, etc.). Also, we plan to study the importance of social networks and social actors of these signals and their impact on the relevance. Further experiments on other types of collections are also needed. This requires tracking users’ personal profiles as well as those of their followers and those of users they share, like, rate,tweet, etc. We intend to collect these data in the future to evaluate the user preferences, compared to social neighbors, to solve the personalized search. This is even with these simple elements, the first results encourage us to invest more this track.
