Abstract
The improvements in mobile technologies led to the wide adaptation and triggered the demand for location based services. In this respect, examining user similarities enable the analysis of user interests in terms of the determination of purchasing preferences and actual needs. User similarities are generally extracted from consumer life style, demographical information or the reflections from previously sent messages. In spite of the fact that these factors may not directly influence the purchasing decision, uncertain or lack of information can be encountered while establishing recommendation systems. Thus, researchers try to search other indicators that can reflect customer characteristics from spatial data, digital contribution in social media and search history for preferable representation of the changes in purchasing tendency. In this study, social platform based interval valued intuitionistic fuzzy location recommendation system is proposed by considering three common social platforms: Trip Advisor, Zomato and Foursquare. To perform restaurant offers to appropriate social platform users, a sentiment analysis is conducted to selected restaurants and number of negative, positive and neutral comments are extracted. After that, restaurant and location information are examined by using user, restaurant and location clustering via fuzzy clustering. Finally, intuitionistic fuzzy similarity matrix based collaborative filtering is used for restaurant offers to similar users.
Introduction
Because of increasing demand on mobile technologies and applications, location based systems can present customers’ special interests in a particular location and time [33]. More generally, location based systems (LBS) ensure content providers to send convenient messages to application users considering their visiting experience in a specific location [16]. The aim of these services is tracking and navigating to transfer time based push messages.
As a result of increasing penetration rate of mobile advertising, targeted advertising, which allows sending personalized information (including personalized price offers) to customers, has drastically affected from LBSs with regard to companies’ marketing and communication strategies. As a consequence of this situation, LBSs must reflect consumer interests more sensitively for implementing targeted advertising systems.
During the recent years, location identification adapted recommender systems in targeted advertisement is conducted a substantial base which enhance companies’ transmission channels to customers with limited advertisement budgets [36]. Thus, LBSs are relied on time and preference based services for tracing and tracking the current position of application users. The main aim is the prediction of future visits by considering previous places [10]. On the other hand, users are commonly irritated from instant push up messages because of unconcerned context and irrelevant price offers [41]. Additionally, privacy concerns that users are not tend to share their current locations, are another aspect for not to be interested in these push up messages. These concerns cause the low penetration of LBSs [30]. Furthermore, user relevancy and customer needs sustainably vary when their current and future locations are analyzed. Therefore, updating contexts to the recent ones enables generally-accepted and commonly used recommender systems. For this reason, companies are tend to find out more effective methodologies that ensure both individualized concepts and commonly preferable alternatives. To cope with digressiveness problems, previously visiting daat is required. Hence, potential location identification based recommendation systems can be evaluated as the most crucial point to determine the user purchasing probability when considering future visits [33].
In addition to all these, social platforms are another part that firms are utilized for both social relation data and current positions of the users for better understanding of customer needs and expectations. With the development of social networking, it is extremely beneficial to recommend advertisements based on social information of users. Social platform based recommender systems utilize the social relation data, such as friendships and current locations of the users to improve recommendation results. The main contribution of using social platforms for LBSs is that personal offers for customers can be conducted considering their user data and comments and also, shared location information. These circumstances enhance recommender system sensitivity for personalized location offers [22].
Although there are crucial advantages of using social platforms location recommendation systems, vague information can be appeared due to the lack of user shares and profile specification. For that reason, fuzzy set theory provides coping with this imprecise information in an efficient manner with the help of linguistic variables. Human reasoning works better on the range of the linguistic values as it better captures the vagueness. Taking this as an advantage, the recommender system can use fuzzy information to interpret user locations, social platform data (such as current location, previously visited locations or comments), and similarity of two users more efficiently [23].
From this point of view, this study focuses on performing an intuitionistic fuzzy set theory based recommendation system by combining the data gathered from three commonly used social platforms: Zomato, Trip Advisor and Foursquare. To perform restaurant offers to appropriate social platform users, a sentiment analysis is conducted to selected restaurants and number of negative, positive and neutral comments are extracted. After that, restaurant and location information are practiced by adapting user, restaurant and location clustering via fuzzy clustering. Finally, intuitionistic fuzzy similarity matrix based collaborative filtering is performed for restaurant offers to similar users.
Literature review
This study relies on the combination of sentiment analysis and location based systems. Thus, a brief literature review is presented to reveal former studies and applications and highlight the points that have been ignored in the literature.
Sentiment analysis
Sentiment Analysis is the process of the classification of opinions about a particular topic from a written text which contains positive, negative or neutral comments by well-developed lexicons. The data is mostly taken from documents, sentence, entity or aspect. The wide application of sentiment analysis can be realized both in the studies from academicians and practitioners ([26]). For instance, companies are more prone to understand people’s perceptions and consumer reviews on products in order to enhance their services ([9]). Additionally, purchase decision on marketing ([3, 12]) and tourism management ([1]) are two of the most studied topics for sentiment analysis to explore different opinion groups.
From literature review, sentiment analysis is mainly adapted in terms of applying sentiments to the documents or focusing on seeking sentiments of words, subjective explanations, subjective clauses and topics stored in web sites or databases [29]. Besides that, sentiment analysis is studied on the purpose of identifying opinions for different business intelligence areas [18]. Former studies mostly cope with customer segmentation ([38, 40]), social text summarization ([43]), social and conventional media ([44]), social reviews ([14]) cinema audience evaluation via Internet Movie Database (IMDb) ([24]), e-learning ([4]) stock market predictions and financial evaluations ([8]), market evaluations considering social networks ([22]) and early identification of emerging political topics on Twitter ([35]) etc.
Sentiment analysis has generally suffered from three main issues: aspect detection, opinion word detection and sentiment orientation identification [45]. Aspect is the major indicator for the determination of the topic of an opinion [19]. Review ratings and overall ratings can be stated from numerical values. On the other hand, text comments should be analyzed for the individual assessment of the opinions which could be difficult to extract from the document directly. For example, a comment can include positive idea about services but it can also implies negative idea related to the menu. Thus, aspect level based sentiment analysis can be useful for understanding the customer satisfaction level according to different co-subjects [5].
In addition to the advantageous proceeds of sentiment analysis, differentiation of the comments by clustering methods decreases the need of dealing with huge amount of training data with a high level of accuracy ([25, 34]). Clustering methods appeared in the literature were generally practiced with hierarchical methods, compartmentalize methods, density based methods that search sets appeared in low densities and plotting raster based approaches that decrease the computation time ([2, 32]). Note that the papers which used clustering and sentiment analysis together generally concentrate on implementing different types of methods in the first stage of sentiment analysis. However, if clustering is applied after the sentiment analysis step, tweets could be evaluated with more precise classes [29].
In our study, first the corpus which is a large and structured set of texts that are prepared for further analysis, is gathered by text cleaning process. For assigning a polarity (positive or negative) to each word and query, both the synonyms and antonym should be processed with automatically or manually detected seed words, After that, transforming the unstructured data to term-document matrix (TDM) is conducted for displaying the corpus. The TDM is described with a matrix form in which the documents or sources can be presented with the rows and terms represented with the columns. After TDM is constituted, the proceeding stage is to determine novel and useful patterns from this matrix. For describing the themes of comments, classification and clustering methods are two of the most useful methods to clarify the number of positive, negative and neutral comments as adapted in this study [42].
As mentioned before, social media analysis has been studied mostly for the further analysis of sharing comments and giving information via Internet as a straightforward way of understanding customer needs. Additionally, most of the studies emphasis on gathering essential information from social media using data and text mining approaches. Although it is a well-adapted method among researches and great number of applications were appeared in the literature, the gap in analyzing the outcomes of these methods has not been filled [39]. Thus, this study proposes making convenient outcomes from social media reviews in order to understand the needs of customers and extract business value.
Location based recommendation systems
Location-based systems can be defined as the overall services that use the spatial location of the user to enhance a special marketing exhorter [28]. In other words, location-based systems gather real time location data of consumers, establish mobile connection to other mobile devices and push a relevant content when they appeared in appropriate field. This enables the improvements in location-aware functions as realized in shopping malls [7]. The main contribution of these systems is extracting knowledge from where the services are settled. The knowledge can be gathered from geographic locations of users who downloaded the application-related tool in their mobile phones. In addition to that, sensor data captures interactions between users and locations via Bluetooth technology. Thus, in order to increase the brand awareness, large-scale retail companies began to utilize LBSs for their retail activities [47]. Other specific applications can be summarized as location-based advertising and tourist route recommender systems [20].
From the reflections of literature review, LBSs have been widely adapted in indoor positioning applications such as location-based advertising and location-based mobile advertising, mobile shopping ([48]) recognition of previously visited places for making further predictions ([37]), and social media based recommender systems [17]. As seen from these studies, location-based applications facilitate the tracking and traceability of individuals’ physical moving in indoor fields. Thus, positioning and segmentation algorithms play critical roles in interpreting individuals’ personal positions and visiting tendencies to make suggestions for location-specified promotions.
Although the advantages of location-based systems are obvious, some drawbacks are discussed when practicing. The problems that users realized are mainly consist of privacy issues and concerns of the messages: customers do not tend to share their location data via Bluetooth and generally, do not want to get push up messages. Besides privacy issues, context of the promotion message should be related to the users interests [6]. For instance, location based recommendation system is generally requires the data gathering from diversified social platforms for better construction of relationship between customers, locations (shopping malls) and their relationships with each other [11]. According to the mentioned situations above, motivation of this study is location based recommendation system configuration by considering previously visited location data and users. Also the identification of alternative locations with respect to the similarities are assigned for providing wide range of location based alternatives. Thus, location clustering can be adapted for constructing the relativeness of the restaurants and alternatives by considering the similarities of location clusters and restaurant groups.
Interval valued intuitionistic fuzzy sets and interval valued intuitionistic fuzzy location based system methodology
In this part, a brief background of interval valued intuitionistic fuzzy set theory is given and proposed methodology is explained with theoretical details for better understanding.
Preliminaries for interval valued intuitionistic fuzzy set theory
Intuitionistic fuzzy sets (IFSs) are defined using membership and non-membership degrees
in order to present the imprecise information. The interval-valued intuitionistic fuzzy
set (IVIFS) is a modified form of IFSs that contains T[0,1] as a subinterval of interval
[0,1] and assume that X be a set as a non-empty set. An interval valued intuitionistic
fuzzy set in X is identified as
To calculate the distance measure of Š 1 and Š 2, the following formula is adapted:
For 0 ⩽ d ( Š 1, Š 2) ⩽1.
Assume that
IVIFS based similarity measure for Š 1,
Š 2 is given in the following:
Where,
We propose a location based recommendation system based on the combination of sentiment analysis that determines the related location’s view according to the customers (opinion targeting) and clustering based on the similarity of users and locations. Figure 1 represents the structure of the proposed location based recommendation system which is comprised on four key components: Data collection and preprocessing, sentiment analysis, grouping users and locations and recommendation system structure. First of all, data is needed to be described in interval valued intuitionistic fuzzy number form in order to apply the recommended algorithm. This can be already achieved by considering the data gathered from diversified social media platforms and thus, data should be stated with intervals in order to reflect different aspects of ratings. Secondly, review texts and review titles should be analyzed as negative, positive and neutral comments and these comments should be named as “number of positive comments”, “number of negative comments” and “number of neutral comments” for each restaurant. Additionally, users and locations are grouped according to interval valued fuzzy c means clustering before the implementation of interval valued intuitionistic fuzzy recommendation system. The extracted groups are utilized for making proper location suggestions for different customer groups.

The structure of the proposed location based for recommendation system restaurants.
The sentiment analysis applied in this study is adapted from Turban et al. [13]’s study. The steps of the methodology can be summarized as follows:
Step 1-Establishing the corpus: Corpus is a large and structured set of texts that are prepared for further analysis. The aim of this phase is to gather all of the consumer reviews and comments from the related social platforms which are relevant to the problem. Quality and number of the documents considered in the corpus are two of the most substantial factor for the effectiveness of the separation. Note that, the corpus should be defined and collected using manual or automated techniques such as software programs that periodically collect data from diversified sources. The source of data may contain HTML files, emails, web blogs or textual documents. As these various types of documents are collected, they should be organized before the application of sentiment analysis. In this study, user reviews formed the corpus.
Step 2- Sentiment Analysis: As described from previous section, sentiment analysis includes the utilization of natural language processing, document analysis and computational linguistics to clarify and extract subjective information in source materials. The term opinion mining is also used instead of sensitivity analysis. The aim of sentiment analysis is to determine the position of a user considering a relevant topic or the determination of overall contextual polarity of a document. The attitude may be his or her judgment or the intended emotional communication.
Step 3- Setting up the Term Document Matrix: Term Document matrix (TDM) aims to transform the unstructured data into a structured representation of the corpus. The TDM is a matrix form in which the documents are represented by the rows and terms are stated by the columns. In Fig. 2, a sample TDM matrix is given.

A sample TDM matrix.
The numerical values in the TDM matrix denote the relationship between the terms and the documents such as how frequently a term appears in a document with various measures. For converting unstructured data into a TDM matrix, the underlying assumption is that the meaning of the documents can be introduced within the matrix with the measures in the cells. On the other hand, all the terms are not equally important for representing a document due to the auxiliary verbs which should be excluded from the process. Note that the mentioned terms are called “stop terms” and they should be precisely determined before or during the TDM phase. Additionally, “include terms”, which explain the predefined terms that are chosen for establishing TDM, should be described in advance. Besides including terms, the “synonyms”, which the pairs of terms are refined at the same time, should be clarified. Finally, substantial phrases can be extracted after obtaining the entire class instead of separate words. Because the success of the sentiment analysis relies on the accuracy of TDM, the phase involves feedbacks that the analyzer can track the former tasks and renovates them for better outcomes.
Step 4- Specifying the themes: After building the TDM and dealing with the dimensionality problem, the next task is to extract novel and beneficial patterns from this matrix. In order to determine the themes of reviews, k-means clustering is utilized. From k means clustering adaptation, theme groups can be extracted for further recommendation system adaptation.
Step 5- Analysis of the results: The overall results of sentiment analysis are evaluated with defined themes to analyze the performance of each theme.
Among other clustering approaches, fuzzy c-means (FCM) is identified as the most
commonly utilized clustering technique which is relied on the minimization of the
deviation to cluster centers. Minimization of objective function can be denoted as a
nonlinear optimization problem as follows:
Note that Z shows the data set, U presents the fuzzy partition matrix, V denotes cluster centers’ vector. Additionally, N denotes the number of observations, c shows the number of proper clusters and μ is stated as the membership value. Additionally, m is fuzzifier that identifies the fuzziness degree. In the formula, z j - v i is called as the distance measure for observation j and cluster center i [31].
The phases of fuzzy c-means (FCM) algorithm are identified by Babuska [31]: Establish
U = [u
ij
]
matrix,
U(0)
At t-step: Define the vectors of the centers
V(t) = [v
i
]
using
U(t)
Update the values of
U(t),
U(t+1)
If ∥U(t+1) - U(t) ∥ < ɛ is the stopping criteria; then return to step 2.
The methodology of interval valued fuzzy c means is an extended form of fuzzy c means
clustering. In this clustering paradigm, two values of the fuzzifier (m1 and m2) should
be considered for upper and lower limits. Thus, two objective functions are needed for
conducting the clustering process [15].
Multi criteria IVIFRS includes the fundamental functions expressed in the
following: Prediction: Express the values of
[μ
lD
(D
i
) -,
μ
lD
(D
i
) +] ,
[v
lD
(D
l
) -,
v
lD
(D
l
) +] ,
∀l ∈ {1, . . . , s} ,
∀ i ∈ {1, . . . , k} . Recommendation: Find out i* ∈ [1,
s] to ensure
Considering Equation (8), assume that X1 and X2 be two IVIFMs for multi criteria based IVIFRSs. The interval valued intuitionistic fuzzy similarity degree (IVIFSD) is stated in the following way:
The formulas to predict the values of linguistic labels of user
Pu(∀u∈{1,...,n})
to location
Sj(∀j∈{1,...,m})
according to the restaurants D1,
D2, . . . D
k
are addressed as given in [21]:
Note that composition operations are required for the aggregation of multiple criteria.
The composition operation is adapted as follows:
Since the increasing potential of online marketing and sales operations, social networks have become an essential tool for the extension of communication with the customers. Thus, diversified types of online congregations, firms, blogs and forums focused on the development of their social network attraction. Therefore, data gathered from social networks should be evaluated for constructing location based systems. In addition to that, the need of competitive analysis of the shopping malls should be extracted from the differentiation as positive and negative. Contribution of the feelings to recommendation system will be beneficial especially in offering specialized alternatives (here restaurants) [36].
Data collection and preprocessing
The data was gathered from various social media platforms (Trip Advisor, Zomato and Foursquare) and includes 251 restaurants, 201 users and 136 places. Data contains both user information and location information. The reasons for using different social media platforms are (1) Reflecting different users’ reviews and ratings (2) Combining diversified data sources for better information extraction, (3) Provide missing information from other social media platforms. Trip Advisor and Foursquare are selected because they are well known and widely used across the world. They also provide enormous amount of user content which enhance the results of the proposed methodology. In addition to social media data, Google search engine is also utilized for reflecting number of Google reviews, average hours spent in the shopping mall and rating scores. Location data contains only the restaurants in Istanbul and numerical variables are standardized for better representation in clustering process. The list of the variables that are used in the analysis can be found in Table 1.
Variables used for location based recommendation system provided by users
Variables used for location based recommendation system provided by users
Before implementing sentiment analysis, the following steps are conducted: (1) Identify the sentiment of user reviews as positive and negative. (2) Remove reviews which are not meaningful and have missing character(s) (3) Remove stopwords and punctuations in the corpus (4) Converting word into lower cases. After number of positive and negative reviews are gathered, locations and users are grouped for constituting homogenous groups. These groups can be beneficial for offering a great variety of locations to diversified users.
Since location data is gathered from various sources, ranking, average point, average price and score distribution are stated as intervals. A sample dataset is given in Table 2. Note that user and restaurant information can be represented in the similar manner as Table 2.
A sample dataset for location clustering
Finally, data for interval valued intuitionistic fuzzy recommendation system is prepared. The data contains four dimensions: user group (m), restaurant group (r) location group (n) and ratings (k). In Table 3, a sample of data is presented for better understanding before adapting recommendation system methodology.
A sample of data for location based recommendation system
Location data obtains shopping malls in Istanbul, recommended time to spend, location ranking, number of comments, score distribution (Excellent, Very good, Moderate, Bad and Terrible) and foreign and local comments from the mentioned social platforms in order to reflect different aspects of the shopping malls. Additionally, restaurant data includes restaurant ranking, average price, recommended time as seen in Google search, number of comments in Zomato and Trip Advisor, average score on Zomato and Foursquare, number of votes, number of negative and positive comments, average point of places nearby the restaurants are also implicated in restaurant grouping. User data is gathered from Trip Advisor including membership time period, participation level, number of contributions, average score of previously visited places, number of helpful votes, number of added photos and number of followers. Note that some of the users do not have enough necessary information that the relevant columns are marked as “Null”.
Step 1. Sentiment analysis: In the first stage, sentiment analysis is adapted for determining the classes of negative, positive and neutral comments for locations from Trip Advisor. After text cleaning process in which misspelled words are eliminated, seed words are selected for noticing positive comments as (’perfect’, ‘great’, ‘good”, ‘very good’, ‘fair’, awesome’,’fantastic’), negative comments as (‘puff’, ‘off’, ‘bad’, ‘poor’,’terrible’) and neutral (‘don’t know’, ‘no comment’) as appeared with Turkish words.
The next step of the sentiment analysis is the gathering of the term-document matrix that determines the words as “frequency numbers” for handling the significance and importance levels of the terms appeared in the related comment texts. The literature presents various functions to be used for constituting the TDM. In this study, TF-IDF (term frequency-inverse term frequency) function is used. Note that TF-IDF is a combination of term frequency and inverse document frequency that produces a composite weight for each term in related document. By utilizing TDM, the unstructured comment data is transformed to structured form. Finally, clustering the comments based on TDM data is adapted via Rapidminer 5.3 considering the preprocessing of the data. For instance, comments are separated to words, transformed to lower case and filtered from irrelevant prepositions and conjunctions. After the preprocessing, k-means clustering is adapted by determining the most appropriate number of clusters (k). Note that different values between 2 and 8 is applied and according to the average distances from cluster centers, best clustering is performed when k = 3. The number of extracted comments for some of the locations (shopping malls) are presented in Table 4.
Number of positive, negative and neutral comments from Trip Advisor
Number of positive, negative and neutral comments from Trip Advisor
Step 2- User, location and restaurant clustering: As mentioned before, location similarity and grouping is necessary as the initial step for location prediction of future visiting tendencies of the customers. Thus, in the second stage, users, restaurants and locations are grouped using interval valued fuzzy c means clustering. In order to determine the optimum number of groups, Xie-Beni index values for different values of c parameter are determined and four clusters are gathered from whole dataset of users and locations. Second, cluster centers are randomly assigned and after that, distance between cluster centers to the given data is measured. Finally, locations are formed as groups similarly using R package as seen from Figs. 4 and 5. In this step, user clustering is explained in details to provide better understanding of the clustering process.

Trip Advisor screen shots for some of the properties.

Cluster plot for locations.

Cluster plot for users.
After processing twenty iterative steps of fuzzy clustering to user data, the algorithm is stopped with an acceptable error. As seen from Table 5, 5 of the observations are assigned in Cluster 1 (‘passive social media users’-P1), Cluster 2 (‘prestigious place followers using active social media’-P2), and Cluster 4 (‘frequently visiting users using social media actively’-P4) and 3 observations are appeared in Cluster 3 (‘causal people using active social media’-P3).
Centroid and spread points of clusters
Additionally, Table 6 indicates
the centroid points from observed data and score function is calculated for making
comparisons properly. Therefore, initially prototypes are presented as interval valued
fuzzy numbers. The last columns of Table 5 denote the sum of squares (SSQ) between the object
Membership degrees of each cluster for each object for user clustering
In addition to user grouping, location groups are denoted as “popular place-S1”, “rarely visited place-S2”, “middle income oriented place-S3” and “rich customer oriented place-S4” for indicating the location property of the cluster. At last, restaurants are grouped as “cheap restaurant-D1”, “prestigious restaurant- D2”, “beef and steak restaurant-D3” and ”café-restaurant- D4”.
Step 3-Interval valued Intuitionistic Fuzzy recommendation system: After gathering clusters, multi criteria interval valued intuitionistic fuzzy recommendation system (IVIFRS) is applied to user, restaurant and location groups considering transformed IVIFN based data of restaurants. Recommendation system matching for each user group to a specific restaurant is applied by the determination of similarity measures and predicting the values of linguistic labels of user P u to location S j and naturally, restaurants. Results are listed as seen from Table 7. An illustrative example is given to explain the adaptation of IVIFRS to clustering section in the following:
Training dataset for the values being predicted according to the proposed method
According to the similarity calculation, the values given in Table 7 are utilized to predict
P4. The parameters are applied as α = 0and
δ = 0.5, and also
w1i = w2i = w3i = 0.2
respectively. Similarity between other users are calculated as SIM
(P1 - P4)=0.76, SIM
(P2 - P4)=0.71 and SIM
(P3 - P4)=0.43. The predictive
values of IVIFRS based methodology are given as
Finally, defuzzification is adapted as D1 = 0.54, D2 = 0.23, D3 = 0.45 and D4 = 0.29.
From Table 8, restaurants are recommended from restaurant group lists to users and location groups. As seen from the results, users who visit popular place-(S1) generally prefer most crowded shopping malls. In the similar manner, users who generally prefer to go middle income oriented place are tend to eat local food based restaurants. Causal people using active social media are exactly opposite of the people who visit popular places and locations are mostly focused on middle income level because of their tendency to go favored restaurants which are “esoteric” that address “high income” group.
Restaurant recommendation results to each user group with each location group
The impact of sentiment analysis and clustering assisted interval valued FcM algorithm on fuzzy data can be realized by comparing the proposed methodology with several methodologies in the same context. The selected studies for the comparison process are Son and Thong [21]’s study which is related to medical diagnosis and Nilashi et al. [27]’s that proposes a recommender system for tourism industry. For better understanding of the comparison phase, these two studies are briefly selected for the reasons given as follows:
(1)Son and Thong [21]’s study presents intuitionistic fuzzy set theory adaptation to recommender systems for measuring the contribution of using interval valued intuitionistic fuzzy set theory rather than using intuitionistic fuzzy set theory in recommender systems and also, evaluating sentimentanalysis phase contribution to existing recommender systems.
(2) Nilashi et al. [27]’s study is selected for the similar study topic and comparison of social platform based recommender systems to demonstrate the contribution of fuzzy set theory to recommender systems.
The comparison is conducted based on Mean Absolute Error (MAE) and the computational time. The same methodologies are adapted to the given dataset and executed on Intel® Celeron ® CPU 847@ 1.10 Ghz 4 GB RAM. The results are considered using the intuitionistic defuzzification method presented in Albeanu and Vladicescu [15] for 10 more iterations. Results are presented in Table 9.
According to the final results, proposed method is clearly outperformed considering with both MAE results and computational time especially when number of iterations are increasing. Some of the concluding remarks from the comparison phase are given below:
(1) Multi criteria interval valued intuitionistic fuzzy recommendation system (IVIFRS) is capable to perform the recommendation of the special offers using both for interval valued intuitionistic fuzzy data and semi- interval valued intuitionistic fuzzy data. Interval valued intuitionistic fuzzy data provides better presentation of uncertain data (recommended time, score distribution, average price etc.). This situation provides the late adaptation of defuzzification process which minimizes the information loss while performing clustering process.(2) Interval valued intuitionistic fuzzy recommendation system is integrated with clustering process which provides the broad access of the similar alternatives and increases the acceptance of the special offers (3) Using both semi-interval valued intuitionistic fuzzy sets and interval valued intuitionistic fuzzy sets enables the flexibility and usefulness of proposed approach as indicated in Son and Thong [21]’s study.
(4) Nilashi et al. [27]’s study applied dimension reduction (PCA) to the relevant data in a similar topic. Our dataset also includes considerable number of features. On the other hand, with using fuzzy adaptation to clustering techniques, dimension reduction phase is eliminated with using less computational time to more complicated dataset (interval valued intuitionistic fuzzy data presentation) as seen in Table 9.
(5) Although Son and Thong [21]’s fuzzy adaptation to recommender system is similar to our approach, interval valued intuitionistic fuzzy application to recommender system outperformed better when considering MAE and computational time results except one type of iteration in Table 9. This indicates the usefulness of sentiment analysis application outcomes (extracting positive and negative comments) to fuzzy clustering process and fuzzy recommender systems.
As seen from the overall results, interval valued intuitionistic fuzzy recommender system is adaptable to social platform data with many contributions: better uncertain data presentation, better computational time and MAE results. If the dataset size increases to large volumes, the amount of information to be processed will exceed the computing capacity of a single machine. In such cases, distributed computing that divides the original dataset into smaller subsets that are obtained with multiple nodes of a cluster, commonly used. Thus, each node proceeds only the divided data stored in it. The outputs exist from all the nodes and then these outputs are aggregated to obtain the final result [50].
In this paper, we focused on implementing interval valued intuitionistic fuzzy recommendation system using sentiment analysis to reveal number of positive, negative and neutral comments made by users and additionally, user, location and restaurant clustering are conducted to provide more options to recommendation system users. In order to implement user, location and restaurant clustering properly, data is manually gathered from social media platforms before restaurant recommendation system is established. For this reason, data is extracted from Trip Advisor, Zomato and Foursquare for adapting the clustering process that will ensure broad range of alternatives in mentioned three dimensions. Note that, users, restaurant and location clustering are implemented using interval valued fuzzy c means clustering separately and user data considers the previously visited location scores which enable the representation of historic base of the users. Then, IVIFRS methodology is conducted for restaurant recommendation utilizing the formed user, restaurant and location groups.
The reasons for conducting interval valued intuitionistic fuzzy numbers based decision making process are: (1) interval valued fuzzy sets provide more degrees of freedom in decision making process when users are not exactly express the membership and non-membership degrees. Thus, modelling the uncertainty will be easier compared with Type 1 fuzzy numbers and triangular fuzzy numbers, (2) Compared with other fuzzy sets, IVIFSs are able to present uncertainty of inaccurate information by closed interval based memberships and non-memberships. This ensures detailed representation of intuitionistic fuzzy sets while evaluating vague information [21].
Results demonstrate that using such this social platform data considering various locations with interval valued fuzzy numbers has the potential to represent customers’ life style and interests by not using crisp numbers for getting insight about each individual user and location properties before implementing personalized recommender systems. In addition to that, fuzzy recommendation system is applicable and effective especially not having crisp values in the related database. The novelty of the paper is not only the effective usage of social platform data via sentiment analysis and fuzzy clustering. In addition to these properties, the formulas to predict the recommendation possibility based on similarity measure using interval valued intuitionistic form of fuzzy recommender system and late adaptation of defuzzification method are other aspects of the effective usage of recommendation systems. Besides that, the paper fulfills the gap indicated in Son and Thong [21]’s study that the small and medium sized real dataset implementation of intuitionistic recommendation system can be beneficial and also, hybrid algorithm based on fuzzy clustering adaptation will enrich recommendation system performance in advance.
To sum up, the proposed recommender system has some advantages when compared with other fuzzy recommender based systems: (1) Better presentation of data that was gathered from diversified sources such as Zomato, Trip Advisor and Google search by using interval valued intuitionistic fuzzy dataset. (2) Contribution of text data (comments) to recommendation process by using sentiment analysis. (3) Fuzzy c means clustering is utilized for gathering broad range of restaurant offers to specified customer groups (segments). (4) Contribution to imprecise social platform data to location based recommender systems is obvious as seen in Table 9.
For further studies, applications of fuzzy recommendation system with massive datasets should be examined in order to check the efficiency of the proposed approach. Secondly, other applications of fuzzy clustering frameworks such as entropy based clustering c-medoids ensure the implementation of the study and generalization of fuzzy recommendation systems. Finally, other theoretical analysis of IVIFRS operations can be conducted using t – norm. These future works will enrich the dissemination of fuzzy recommendation systems both for practical and academic perspectives.
