Evaluation of opinion visualization techniques

Abstract

In this article, we are reporting the findings of a usability study of opinion mining systems’ visualizations. The objectives of this study are to first to rank the visualizations of the opinion mining systems and second to identify important visualization metrics. A questionnaire survey was designed to ask users their level of agreement or disagreement about the 11 selected visualizations against a set of information visualization metrics on a Likert scale. The data were collected by conducting seminars and using a web-based online questionnaire (N = 146). The collected data were analyzed using descriptive statistics and an independent sample t-test to rank the visualizations and investigate differences between perceptions of the two groups of respondents (the participants of the seminars and the online questionnaire), respectively. The results revealed that simple, eye pleasing, easy to understand, user-friendly visualizations with less pre-knowledge required rated higher than others. It is concluded that the participants of the online questionnaire mostly required more pre-knowledge to comprehend the visualizations as compared to the participants of the seminars. The important information visualization metrics are eye pleasing, easy to understand, user-friendly, informative design, usefulness, and representation style. The results of this study could aid in the design and development of visualizations for opinion mining systems.

Keywords

Information visualization opinion mining opinion visualization sentiment analysis visual opinion mining

Introduction

Opinion mining processes a set of search results for a given item, generates a list of product features, and aggregates opinions about each feature.¹ The objectives of opinion mining include mining, summarizing, and visualizing people’s opinions about organizations, individuals, government, products, and services from online reviews. Mining, summarizing, and visually presenting online reviews manually are time consuming and tedious.² Different opinion mining systems have been proposed in the literature for this purpose, focusing on various aspects, for example, identifying and measuring opinion words.^3,4 However, considerably less work has been done on finding effective ways to present these opinions to end users.⁵

There is a need for the careful design of visualization techniques to present customer’s opinions with sufficient visual cues and different levels of abstraction (summarization), as this information has a significant impact on building a successful business.² Visualization techniques empower customers to draw meaningful conclusions by giving a purposeful representation of the data. It provides an appropriate starting point for the interactive exploration of attractive patterns.⁶ People’s opinions as free text present many challenges for visual analysis due to heterogeneity, voluminosity, and higher dimensionality. The fundamental challenge is to present complex multidimensional thematic space in easy to understand ways.⁷ It is also difficult to provide integrated views of data with side-by-side visual comparisons between the opinions of different customers’ groups.² Another noteworthy challenge is to present information in a way that is easy to understand at a glance, even without particular training.⁸

Different kinds of visualization techniques were applied to present peoples’ opinions, including radial, bar chart, pie chart, and graphs. Every technique has its own level of abstraction, advantages, and disadvantages. Opinion visualizations are designed to meet the needs of data analysis experts; therefore, most of the existing opinion visualization techniques did not consider novice end users. Usability issues are very critical for information visualization.⁹

Better usability of these visualization techniques is required when used by customers.¹⁰ Each technique requires a study in order to determine its usability and usefulness. There is also a need to consider users’ recommendations while designing visualizations of opinion mining systems which can only be possible when users evaluate the existing techniques. To the best of our knowledge, no prior study was conducted to evaluate opinion visualization techniques for users’ opinions.

This article presents the evaluation of innovative visualizations of some of the existing opinion mining systems. To be specific, this study aims (1) to rank the visualizations of the opinion mining systems, (2) to investigate differences between perceptions of the two groups of the respondents (the participants of the seminars and the online questionnaire), and (3) to identify important visualization metrics.

The rest of this article is organized as follows: section “Background study” discusses the noteworthy visualizations of existing opinion mining systems. This is then followed by section “Usability,” which presents existing work on usability studies. Section “Research methodology” discusses the research methodology used in this study. Section “Results and discussion” presents the results and discussion, and finally, section “Conclusion” concludes this article.

Background study

A systematic literature review process comprising searching and screening steps was used in the study. Pertinent research papers were searched from different digital libraries, including the ACM Digital Library, IEEE Xplore, ScienceDirect, Scopus, Emerald, SpringerLink, CiteSeerX, Web of Science, Google Scholar, and Google. The keywords used were “opinion mining,” “sentiment analysis,” “visual opinion mining,” “opinion mining system,” “opinion mining on online reviews,” “opinion mining on web,” “opinion extraction,” and “sentiment classification.” More than 50 relevant papers were gathered in the search step. Three levels of screening were employed using three criteria. In the first level screening, 30 papers were selected based on the research focus, that is, opinion mining (first criterion). At the second level, 16 papers that discussed opinion mining systems with visualizations were selected (second criterion). There were four main types of visualization, namely, radial, graph, hierarchical, and bar chart. Finally, 11 papers were selected based on intuitiveness, complexity, and level of abstraction for each of the visualization types (third criterion). Table 1 shows the details of the selected visualizations. These visualizations are discussed below.

Table 1.

Details of the selected visualizations.

No.	Authors	Visualization	Type of visualization
1.	Wu et al.²	Opinion wheel	Radial
2.	Gregory et al.⁵	Rose plot variation	Radial
3.	Chen et al.⁷	Coordinated graph	Graph
4.	Morinaga et al.¹¹	Positioning map	Graph
5.	Miao et al.¹²	Line graph and pie chart	Graph
6.	Xu et al.¹³	Comparative relation map	Graph
7.	Gamon et al.¹⁴	Tree map	Hierarchical
8.	Oelke et al.¹⁵	Visual summary	Hierarchical
9.	Gamon et al.⁸	Glowing bar	Bar chart
10.	Wanner et al.⁶	Bar chart with symbols	Bar chart
11.	Liu et al.¹⁶	Bar chart	Bar chart

Radial visualization

Data in radial visualizations are arranged in a circular or elliptical fashion and are an increasingly popular technique in visualization research.¹⁷ Opinion wheel² and rose plot⁵ utilize radial visualization in opinion mining. The opinion wheel is a combination of positioning map inside an opinion triangle, which is bounded by an opinion ring (radial visualization) as shown in Figure 1. It provides an integrated view of multiple dimensions such as temporal, demographics, and spatial information for the hotel customers’ feedback data.

Figure 1.

Opinion wheel shows customer opinion according to age.²

The opinion triangle, which is bound by the opinion ring, is used to present customers’ opinions. The vertices of the triangle represent negative (N), positive (P), and uncertainty (U) opinions. Each point inside the triangle represents a customer’s opinion, and its position determines the semantic orientation of the opinion. For example, a point closer to the positive vertex represents a positive opinion. The opinion ring consists of colored histograms of different data dimensions to present categorical information. The size and color of the histograms encode the number of customers in each age group as shown in Figure 1. The strengths of the opinion wheel include low level of abstraction and multidimensional view of data in an effective way.

A visual analytical tool was developed by employing a radial visualization to explore the sentiment contents in a large collection of documents.⁵ The authors modified the rose plot (originally used by Florence Nightingale in 1858¹⁸). The first modification was the use of colors with different shades (light and dark) to represent emotion categories, whereas the second modification introduced the unit circle in the rose plot to display the mean and deviation values of opinions by drawing the appropriate rose plots outside (larger than mean) or inside (smaller than mean).

In Gregory et al.’s study,⁵ each document was assigned a score according to the eight emotion categories consisting of four concept pairs (positive–negative, pleasure–pain, cooperate–conflict, and virtue–vice). The glyph on the right-hand side shows the higher value of negative and vice emotions than the mean value. The rose plot is a powerful visualization to analyze and compare a large collection of documents with respect to emotional categories. Gregory et al.⁵ used histograms to present the number of documents in each group.

Graph

The graphs adapted for visualizations in opinion mining include coordinated graph,⁷ comparative relation map,¹³ positioning map,¹¹ line graph, and pie chart.¹² Chen et al.⁷ introduced a novel visualization approach for the analysis of conflicting opinions of customer reviews on the controversial bestseller novel “Da Vinci Code.” Terms (opinion words) extracted from positive and negative reviews were classified into different clusters using “TermWatch,” which is a visual interface in the IMPROVISE software. These terms were then presented using a coordinated graph (Figure 2).

Figure 2.

Coordinated graph representing positive and negative terms.⁷

The top and the bottom half of the graph present terms used in positive and negative reviews, respectively. The vertical bar thickness shows the number of terms for each month. The arcs connect months in which common terms appear and their thicknesses represent the number of common terms. The authors also used a spectrum graph to display the distribution of positive and negative reviews over time. The coordinated graph provides an effective way for the comparison of terms with respect to time.

Similarly, Xu et al.¹³ used graph to compare Nokia E71 with its competitors, that is, Nokia E61, Blackberry Curve, iPhone, and Blackberry Bold 9000, on different features, such as size, function, and looks (Figure 3).

Figure 3.

Comparative relation map shows comparison between competing mobile phones.¹³

The red box shows the number of favorable statements for Nokia E71, and the blue box shows the number of favorable statements for competitive products. The comparative relation map was very helpful (1) to highlight the relative strengths and weakness of products, (2) to analyze threats from competitors and enterprise risks, (3) to support decision making and risk management, and (4) to design new products and marketing strategies.

There are many other examples of the use of graphs for opinion mining visualizations that utilize positioning map, line graph, and pie charts. Morinaga et al.¹¹ deployed two-dimensional positioning maps for the comparison of competing cellular phones. Five cellular phones with their corresponding four characteristics (keywords) were compared and plotted on a map. Finally, Miao et al.¹² presented a visualization system “Amazing” for mining and summarizing electronic products. In this system, a line graph was deployed to present the number of positive and negative comments about a product over time, and a pie chart was used to show the proportion of positive and negative reviews.

Hierarchical visualization

The third type of visualization technique is the hierarchical visualization. The opinion visualizations that utilized hierarchical technique include tree map¹⁴ and visual summary report.¹⁵ Gamon et al.¹⁴ introduced a prototype system “Pulse,” with an intuitive visualization for mining customer reviews on cars. In this system, sentences were grouped together based on similar features, which are then displayed as boxes using a Tree Map as shown in Figure 4. The size and color of the boxes indicate the number of sentences and the average sentiment of a feature, respectively. The tree map provides users a high level of abstraction for common features, most positive and negative features, and overall sentiment associated with the features.

Figure 4.

Tree map representing common car’s features and overall sentiment associated with the features.¹⁴

Oelke et al.¹⁵ introduced a visual summary report for printers and book reviews. Visual summary report provides a quick analysis of printers’ reviews. It describes the frequent features of given products along with their corresponding sentiment based on a color scale. Blue and red colors with different shades were used to represent the positive and negative tendency of the sentiment of the product’s features, respectively, as shown in Figure 5. The size of the inner box represents how many people commented on a particular feature. The strength of visual summary is its scalability with respect to the number of features and products.

Figure 5.

Visual summary of printers’ reviews.¹⁵

Bar chart

The final visualization techniques involve the use of bar charts, that is, glowing bars,⁸ bars with different shapes,⁶ and bar chart.¹⁶ Gamon et al.⁸ presented an analytic prototype system “BLEWS” for news with glowing bars for the visualization of popularity and liberal and conservative views about news articles. Two boxes of blue and red colors with varying size on both sides of an article headline were used to depict the number of liberal and conservative references, respectively, as shown in Figure 6. The size of the boxes depends on the number of references. The order of article from top to bottom describes the popularity of the news article. Emotional charge (the degree of excitement and agitation of the author at the time of writing the text) was presented by a glow around the boxes.

Figure 6.

Glowing bars showing popularity and views about a news article.⁸

Similarly, Wanner et al.⁶ proposed an innovative visual tool for the sentiment analysis of news feeds. News articles were displayed on a daily basis; two horizontal lines represent 1 day and each colored object represents one news item as shown in Figure 7. Symbols with different colors and shapes were used to show the presence of certain keywords in the news. The sentiment score of news item was determined by its vertical position. The strengths of this visualization are zooming, filtering, details on demand, and similarity search operations that can be applied to find out interesting trends and to highlight particularities and emotional contents in the news.

Figure 7.

Bars with symbols depicting various keywords in a news item.⁶

Finally, Liu et al.¹⁶ developed and presented a visualization system “Opinion Observer” to compare reviews of competitive electronic products. The “Opinion Observer” highlights the strengths and weaknesses of each product by utilizing a bar graph as shown in Figure 8. The bars represent different features of given products. The size of bar above and below the x-axis represents the number of positive and negative comments about each feature. Different colors of bars were used to differentiate between given products. The strengths of this visualization are simple and easy to understand design.

Figure 8.

Bar graph comparing prominent features of competing products.¹⁶

Usability

In this section, we introduce usability and some of the existing work on usability of information visualization. The usability, expressiveness, and quality of visualizations are equally important to data usability in any system.¹⁹ According to the International Organization for Standardization, usability is “the extent to which intended users of a product achieve specified goals in an effective, efficient and satisfactory manner within a specified context of use.”^20,21 Usability can be examined from various aspects.^22,23 Table 2 shows usability aspects defined in the literature, with the most common aspects being efficiency (time to complete tasks), effectiveness (percentage of tasks completed), and user satisfaction (how well the users liked using the system to complete these tasks). Usability can be measured by evaluating users’ experience of interacting with the system, which involves a focus on the interface.³⁰ Standard metrics exist today in many areas of information technology to measure performance, either of the system alone or of the combination of users and the system. For example, word error rate metric in speech recognition, precision and recall metrics in information retrieval, and efficiency, effectiveness, and user satisfaction metrics in human–computer performance are used to measure the performance of a system.³¹

Table 2.

Usability aspects.

Authors	Usability aspects defined
Oulanov and Pajarillo²⁴	Efficiency, helpfulness, and adaptability
Brinck et al.²⁵	Functionally correct, efficient to use, easy to learn and remember, error tolerant, and subjectively pleasing
MIT Information Services and Technology Department²⁶	Navigation, language and content, functionality, architectural and visual clarity
Lee²⁷	Usefulness, effectiveness, satisfaction, supportiveness, and intuitiveness
Nielsen²⁸	Learnability, efficiency, memorability, low error rate or easy error recovery, and satisfaction
Booth²⁹	Usefulness, effectiveness, learnability, and attitude
Mayhew²² and Smith and Mosier²³	Consistency, user control, appropriate presentation, and error handling

Zviran et al.³⁰ investigated the effect of user-based design and website usability on user satisfaction for commercial websites by conducting a questionnaire survey. The data were analyzed using the System Usability Scale (SUS), and results indicate that the stock trading websites have the lowest user satisfaction than online shopping and customer self-service websites. Similarly, the user interfaces of 775 news mobile websites in terms of screen space usage were analyzed by Jeong and Han.³² The authors identified most of the screen space was wasted on advertisement. In Malaysia, the usability of International Islamic University Malaysia (IIUM) website was investigated by conducting a case study with 10 participants.³³ The participants were required to fill a checklist as a post-test questionnaire. The results showed that the interface conforms at least 70% of usability properties such as visibility of system status, user control and freedom, consistency and standards, error prevention, and aesthetic and minimalist design. Kim and Kim³⁴ conducted a usability study for the evaluation of “dCollection”—a single consortium for institutional digital repositories in Korea in terms of satisfaction, supportiveness, usefulness, and effectiveness. Two usability experiments (laboratory and remote test) were conducted with a focus group study. Based on the evaluation results, different ways for the improvement of usability in digital repositories were proposed, that is, improving visual appearance and clustering and displaying similar documents together.

The influence of user satisfaction on customers’ loyalty and positive word-of-mouth in e-banking was studied by Casaló et al.³⁵ The result of this study showed that user satisfaction on a bank website in Australia had a positive effect on customer loyalty and positive word-of-mouth. Similarly, bank users were found to be more satisfied in tag-based interfaces as compared to conventional interfaces in both online and mobile context.³⁶ Another study explored the impact of user interface on user satisfaction, effectiveness, and efficiency of learning Adobe Flash CS4 software, and results revealed that user satisfaction of learning this software was better on graphical user interface instead of command line.³⁷ Riel et al.³⁸ studied the effect of customer satisfaction on e-loyalty and found e-loyalty to be strongly influenced by customer satisfaction. Similarly, speed, navigation, reliability, deliverability, and quality of results of a search engine were found to influence users’ satisfaction by Dudek et al.³⁹

Visualizations are effective only if users can interpret the underlying information. Additionally, the interaction mechanism of a visualization tool, that is used to manipulate the data and produce different views, contributes toward the effectiveness of visualizations. Therefore, in addition to the typical usability measures of effectiveness, efficiency, and user satisfaction, a number of evaluation areas and metrics specifically geared toward information visualization have been proposed in literature, including situation awareness, interaction, creativity, utility, cognitive, and perception^31,40,41 as shown in Table 3.

Table 3.

State-of-the-art information visualization evaluation areas and metrics.

Study	Assessment area		Suggested metrics
Zuk and Carpendale⁴²	Perceptual and cognitive		Ensure visual variable has sufficient length
			Do not expect a reading order from the color
			Color perception varies with size of colored item
			Local contrast affects color and gray perception
			Consider people with color blindness
			Pre-attentive benefits increase with the field or size variation
			Preserve data to graphic dimensionality
			Put the most data in the least space
			Remove extraneous link
			Provide multiple levels of details
			Integrate text wherever relevant
Zuk et al.⁴¹	Perceptual		Color, gestalt, aesthetics, pre-attentive
	Cognitive		Integration, reasoning, comprehension
	Usability		Consistency, feedback, error recovery
Amar and Stasko⁴³ (knowledge- and task-based framework)			Expose uncertainty
			Concretize relationship
			Determination of domain parameters
			Multivariate explanation
			Formulate cause and effect
			Confirm hypotheses
Shneiderman⁴⁴ (visual information–seeking mantra)			Overview, zoom and filter, details on demand
			Relate, extract, history
Wehrend and Lewis⁴⁵ (operational tasks)			Identify, locate, distinguish, categorize, cluster, distribution, rank, compare within and between relations, associate, correlate
Scholtz³¹	Situation awareness interaction		Perception, comprehension, projection
			Ability to view occluded information
			Ability to move up and down in the level of abstraction of the views
			High-level “undos”
			Data sharing, not view sharing
	Creativity		Quality of solutions
			Number of unique alternatives considered
			Degree of radicalism/conservatism of alternatives considered serendipitous solutions
			Time to come up with solutions, satisfaction with solutions
			Cost (person-time) to come up with solutions
			Cost of the solution versus the utility of the solution
			Ease of use of the support tool
			Buy-in to the use of the support tool
	Utility		Number of citations in the analytic product
			Analyst’s confidence in product recommendations
			Number of hypotheses investigated in product
Scholtz⁴⁶	Usefulness
	Efficiency
	Intuitiveness
Forsell and Johansson⁴⁷ (standardized heuristic)			Information coding, minimal actions, flexibility, orientation and help, spatial organization, consistency, recognition rather than recall, prompting, remove the extraneous, data set reduction
Brath⁴⁸	Data ink		Number of data points, data density
	Cognitive complexity		Number of simultaneous dimensions, maximum of the number of dimension from each separate task representation, appropriate representation and dimension score
	Occlusion		Occlusion rate
	Reference context		Percentage of identify data point
Freitas et al.¹⁹	Visual presentation	Cognitive complexity	Data density, data dimension, display of relevant information
		Spatial organization	Logical order, occlusion, display of details, reference context
		Information coding	Information mapping, realistic techniques
		State transitions	Image generation time, visual spatial orientation
	Interaction	Orientation and help	Control of additional detail, undo, representation of additional information
		Navigation and querying	Selection of objects, viewpoint manipulation, geometric manipulation, growing, searching and querying
		Data set reduction	Filtering, clustering, planning
Connell and Choong⁴⁹ (analyst-centered metrics)			Empowering analysis, improving analytic products, collaboration, ease of use, immediate feedback, error and critical incidents, minimal action
Plaisant et al.⁵⁰	Component level		Efficiency, effectiveness, satisfaction, scalability, speed, accuracy
	System level		Usability, learnability
	Environment level		Adoption, productivity, satisfaction
Bai et al.⁵¹	Visual representation model		Visualization elements, effectiveness of mapping, structure, showing comparison/emphasis, representation style, providing details
	Information presentation model		Orientation and help, query and navigation, data set reduction, presentation style, showing comparison/emphasis, structure, providing details
	Psychology of the observer		Personal preference, opinion of authorities
	Information quality		Usefulness, comprehensiveness, effectiveness, correctness, conciseness
	Visual impact		Eye pleasing, visual appeal, visually uncomfortable, stunning
	Overall design style		Clear design, informative design, intuitive design, balance design, classic design, dynamic design
	Overall performance		Easy to understand, user-friendly

Zuk and Carpendale⁴² developed perceptual and cognitive heuristics for the analysis of uncertainty visualizations. They integrated principles from Tufte,⁵² Bertin,⁵³ and Ware⁵⁴ to propose a framework that provides insights into the strengths and weaknesses of various aspects of visualizations. In another study, Zuk et al.⁴¹ used three different sets of heuristics (perceptual, cognitive, and usability) to evaluate a specific system by conducting a user study. The participants were asked to use these heuristics to evaluate the system. Their findings exhibited that the results have a high dependency on the heuristics used and the types of evaluators chosen. A knowledge and task-based framework for the design and evaluation of visualizations was proposed by Amar and Stasko,⁴³ which is based on “analytic gaps.” These gaps defined the obstacles faced by visualizations in facilitating high-level analytic tasks, such as decision making and learning. Shneiderman⁴⁴ defined seven tasks, “overview, zoom, filter, details on demand, relate, extract, history,” that a visualization technique should provide. Based on these tasks, different experiments were conducted to measure the completion of tasks and difficulties while users interact with Cam Trees, Information Cube, and Information Landscape visualizations on different data sets by Wiss et al.⁵⁵ The results showed that these visualizations behaved differently on different data sets, and a guideline was provided for the selection of visualization for a specific data set. Similarly, 11 operational tasks were identified by Wehrend and Lewis⁴⁵ that one might do with a visualization, including identify, locate, and distinguish. Amar and Stasko⁴³ and Shneiderman⁴⁴ provided high-level heuristics, which were more difficult to apply than the heuristics from Zuk and Carpendale.⁴²

Scholtz³¹ proposed evaluation areas, namely, situation awareness, interaction, collaboration, creativity, and utility, to be included in basic usability to evaluate visual analytic environments. Qualitative metrics for evaluating the utility of visual analytic environments were also developed by Scholtz⁴⁶ using heuristic reviews. This evaluation addressed many usability issues: usefulness, efficiency and intuitiveness of the visualizations, user interaction, and analytical process. First, a team of visualization experts and analysts defined a list of tasks and data sets required to accomplish these tasks. Then, the developers of the systems acted as users in the study who performed the required tasks and provided their evaluation (reviews) to reviewers. The reviewers identified the important heuristics, such as complexity of a visualization, lack of labels, and misleading color coding. In 2011, Scholtz⁵⁶ conducted another study based on the heuristics defined by Amar and Stasko,⁴³ Zuk and Carpendale,⁴² and Shneiderman⁴⁴ to identify what factors are important to analysts in evaluating a visual analytics system.⁵⁶ A user study with three analysts was conducted in a group setting. The analysts performed the required tasks and were asked to comment on the utility and ease of use of visualizations, the efficiency of interactions, and different aspects of the visualizations they liked and dislike, and so on. Based on the analysts’ comments and the result of previous studies, a guideline was proposed to evaluate visual analytics environments, which synthesized guidelines developed by researchers in various domains, such as websites and user interfaces.⁴⁶ An overview of different types of evaluation scenarios for understanding visualizations, for instance, evaluating user performance and user experience, is presented by Lam et al.⁵⁷ In this work, the authors encapsulate the most common evaluation goals, outputs, evaluation questions, and approaches for each evaluation scenario. This survey assists practitioners in setting the right evaluation goals, picking the right evaluation questions to ask, and considering a variety of methodological alternatives for the chosen goals and questions.

In well-established fields, such as user interface or website design, guidelines have been developed based on the results of empirical studies or the experience of experts, which can speed up the design and evaluation process to replace some of the need for the observation of actual users.⁵⁸ However, there is currently no agreed-upon set of metrics or guidelines in literature for the evaluation of visualizations and interaction mechanism. To overcome this issue, several efforts have been made, which defined sets of useful guidelines and metrics.^47,56,59 Brath⁵⁹ provided the guidelines for interaction mechanism and visualization to create interactive visualizations. The guidelines are divided into three parts: knowing the goal, interaction, and visualization. For the first part, knowing the goal, he recommended that a real problem and its supported data must exist. The author also suggested that an interaction mechanism should be simple, effective, consistent, and interactive, which allows a variety of operations, such as slicing, filtering, drill down, and zooming. The guidelines for visualizations include the addition of legend, scale, and annotation. Forsell and Johansson⁴⁷ took a step toward defining a standardized set of heuristics. They conducted a study to identify a compact set of heuristics that cover most of visualization usability issues. In their experiments, evaluators rate how well a total of 63 heuristics (previously published) can present a collection of 74 usability problems. Based on the evaluator’s rating, they derived a new set of 10 heuristics, namely, information coding, minimal actions, flexibility, orientation and help, spatial organization, consistency, recognition rather than recall, prompting, remove the extraneous, data set reduction.

Freitas et al.¹⁹ defined four classes to evaluate information visualization techniques, namely, completeness, spatial organization, information coding, and state transition, and three classes, namely, orientation and help, navigation and querying, and data set reduction, for the analysis of interaction mechanism. For each class, they defined a set of metrics to evaluate the usability of visualization as shown in Table 3. Data visualization metrics have been proposed by Brath⁴⁸ to evaluate the efficiency of three-dimensional plots by measuring the number of data points, the number of dimensions, occlusion rate, the density of data points, the number of simultaneous dimensions, and the number of identifiable data points. Metrics for interaction mechanism were not defined in this study. Analyst-centered information visualization metrics were proposed by Connell and Choong⁴⁹ to measure analysts’ needs and experiences. These metrics are based on understanding the behavior of novice and experienced analysts and their workplace requirements (Table 3). Plaisant et al.⁵⁰ defined three levels of visualization evaluation: component level, system level, and environment level. The component level focuses algorithms, visual representations, interactive techniques, and interface designs. The system level includes interfaces, which combine and integrate multiple components. The third level is the environment level, which addresses evaluation issues related to adoption. The authors also listed metrics for each level (Table 3). Similarly, Bai et al.⁵¹ defined seven assessment areas, namely, visual representation model, information presentation model, psychology of the observer, information quality, visual impact, overall design style, and overall performance for evaluation of information visualizations. Each area is further divided into a set of evaluation categories and sub-categories (metrics). These metrics can be utilized to evaluate different areas of information visualizations.

Pillat et al.⁶⁰ conducted an investigational study to evaluate two information visualization techniques, Parallel Coordinates and Radviz, by conducting experiments with users in a laboratory with the presence of observers. Some usability problems were identified in the visualizations as pointed out by the users, such as difficulty in the interpretation of the Radviz’s layout. To identify the role of eye tracking technique on usability, AppVis 1.0 network management tool was evaluated by Pretorius et al.,⁶¹ and results showed that better evaluation of visualization can be achieved by integrating eye tracking method in traditional evaluation methods. An evaluation study of the visual analytics system “Jigsaw” with 16 participants is reported by Kang et al.,⁶² which describes an investigative analysis exercise. The participants have given 50 documents with the goal to identify a hidden terrorist plot. Qualitative data, such as observations, follow-up interview notes, and videos, and quantifiable data, such as the number of documents viewed and the number of queries performed, were analyzed, which suggested that the system can benefit investigative analysis.

The literature indicated that various studies have been conducted in myriad domains, such as websites,^30,32,33 software,³⁷ digital repositories,³⁴ user interface,^35,36 and information visualization^60,61 to investigate their usability using questionnaires, interviews, focus group, and tests with users in remote or laboratory settings. However, there is still a lack of usability studies that evaluate information visualization techniques, especially in the opinion mining domain, except Wu et al.² who evaluated the opinion wheel from data analysts. To the best of our knowledge, no usability study was found in the literature that investigated opinion visualization techniques focusing on customers’ point of views. It is important to consider customers’ recommendations for better usability and usefulness as they are the main users of the systems. This study aims to fulfill the gap by ranking the visualizations of opinion mining systems and identifying important information visualization metrics.

Research methodology

In this section, the conceptual design and the research methodology are described.

Method

Instrument

A questionnaire survey consisting of 10 structured closed-ended questions was developed to collect the data. The 10 evaluation metrics (and the assessment areas) were adapted from Bai et al.⁵¹ (see Table 4). We selected six out of seven assessment areas because it was difficult to collect data for the seventh assessment area, which is about domain experts’ opinion. Each assessment area defined a set of evaluation categories and sub-categories (metrics). The questionnaire had two parts. Part A required the participants to fill in their demographic profiles, such as age and gender. Part B required the participants to state their level of agreement or disagreement about visualizations against each metric on a Likert scale that ranges from Strongly Disagree (1) to Strongly Agree (5) (see Table 4).

Table 4.

Questionnaire with metrics and assessment areas.

Assessment area	Metrics	Questions
Visual impact	Eye pleasing	Q1: The visualization is eye pleasing.
Overall performance	Easy to understand	Q2: The visualization is easy to understand.
	User-friendly	Q3: The visualization is user-friendly.
Overall design style	Informative	Q4: The visualization is informative.
	Intuitive	Q5: The visualization is intuitive.
Information quality	Usefulness	Q6: The visualization is useful.
	Comprehensiveness	Q7: The comprehensiveness of data is good.
Visual representation model	Comparison ability	Q8: The comparison of data is good.
	Representation style	Q9: The representation style of data is good.
Information presentation model	Pre-knowledge required	Q10: Pre-knowledge is required to understand the visualization.

Instrument refinement

The questionnaire was pre-tested by conducting a pilot study to judge its feasibility. The pilot study was performed with the help of a focus group to gain the participants’ understanding about the questionnaire. There were 15 participants in the focus group study, 5 in each group. The participants were academicians from COMSATS Institute of Information Technology, Pakistan. The participants discussed the questionnaire with each other and provided their level of understanding, suggestions, and comments. Initially, 14 metrics were selected; however, after the pilot study, 4 metrics, namely, visual appeal, visually uncomfortable, stunning, and conciseness, were removed as the first 3 metrics can be represented by eye pleasing. Similarly, conciseness is closely related to the comprehension metric. Then, the refined questionnaire was again discussed with two participants of the focus group study to finalize its contents, and the finalized questionnaire was used to collect data.

Data collection

Seminars and a web-based online questionnaire were used to collect the data.

Seminar

The first author of this article (A.S.) contacted the coordinator of Computer Science Department of COMSATS Institute of Information Technology, Pakistan, for conducting seminars on opinion visualization techniques. The coordinator made arrangements for the seminars and informed the author, students, and academicians about the venue and the time of the seminars. The target participants of these seminars are students and academicians with a computer science background, especially students who took the “Human–Computer Interaction” course. This course introduced some of the prominent visualization techniques, such as tree map. As a result, the students who took the course had a better knowledge about visualization techniques. The reason behind targeting computer science personnel is that they are the largest group of the Internet users and are more likely to consult and use online opinion information than other Internet user groups.

Three seminars were conducted to present the information about state-of-the-art opinion visualization techniques and for a dynamic discussion about the visualization techniques with target participants. The first author of this article (A.S.) gave a 10-min presentation at the beginning of each seminar in order to present the objectives of the study, a brief introduction of selected opinion visualization techniques, and instructions on filling the questionnaire, so that the participants had an adequate understanding to fill in the questionnaire. Then, a question–answer session was held in order to clarify the understanding of the participants about the visualization techniques. In the session, the participants asked mostly the questions about the meaning of a symbol or metaphor in the visualizations such as the interpretation of symbols in Figure 7. Approximately, the session lasted for 30 min. After the session, the opinion visualization techniques were displayed, and the participants were requested to fill in the questionnaire. After that, an interactive discussion was held between the participants and the presenter in which the participants provided their preferences about opinion visualizations. The preferences were noted down. Approximately, 60 min were consumed in the first seminar. The same procedure was repeated for second and third seminars.

Online questionnaire

To increase the number of responses, an online questionnaire was also created, and the link was distributed via e-mail to computer science students of different universities in Malaysia. We added a video in the online questionnaire that briefly introduced the selected opinion visualizations. One of the limitations of the online questionnaire is the lack of face-to-face communication, especially a question–answer session to clarify concepts. To overcome this limitation, we added a description of concepts, metaphors, and symbols in the online questionnaire, which was asked by the participants of the seminars in the question–answer session. It took approximately 5 weeks to collect the data.

Participants

A total of 146 users participated in the data collection. The participant’s size of the seminar was 110 (22 females and 88 males). The participant’s size for online questionnaire was 36 (17 females and 19 males). Table 5 shows the details of the participants (M: 25.57, standard deviation (SD): 5.55). Most of the participants belonged to the 21- to 30-years age group as they used more Internet than other categories. A large number of participants have prior experience of getting decision-oriented information from online reviews.

Table 5.

Details of the participants.

	Categories	Number
Gender	Male	116
	Female	30
Age (years)	<20	18
	21–30	113
	31–40	11
	>40	4

Results and discussion

The collected data were analyzed using SPSS 18 (Statistical Package for the Social Sciences) to achieve the objectives of this study. The mean (mean+: Table 6) value of a metric across the visualizations was also calculated for the ranking of the visualizations and to identify important information visualization metrics. An independent sample t-test was conducted to investigate differences between perceptions of the participants of seminars and online questionnaire.

Table 6.

Visualizations		Eye Pleasing	Easy to understand	User-friendly	Informative design	Intuitive design	Usefulness	Comprehensiveness	Comparison ability	Representation style	Pre-knowledge required
Mean+		3.08	3.08	3.03	3.19	2.77	3.09	2.99	2.95	3.01	2.94
Bar chart	M (SD)	3.77* (1.19)	4.02* (1.10)	3.84* (1.24)	3.91* (1.04)	3.24* (1.19)	3.78* (1.10)	3.51* (1.21)	3.53* (1.17)	3.60* (1.15)	2.67 (1.44)
Glowing bar	M (SD)	3.88* (1.09)	3.73* (1.26)	3.65* (1.18)	3.69* (1.14)	3.10* (1.10)	3.63* (1.08)	3.42* (1.15)	3.31* (1.17)	3.43* (1.24)	2.79 (1.36)
Tree map	M (SD)	3.53* (1.33)	3.45* (1.28)	3.52* (1.16)	3.60* (1.13)	3.10* (1.12)	3.36* (1.21)	3.26* (1.24)	3.16* (1.19)	3.41* (1.24)	2.92 (1.28)
Line graph and pie chart	M (SD)	3.25* (1.12)	3.60* (1.26)	3.51* (1.18)	3.39* (1.13)	2.93* (1.34)	3.36* (1.15)	3.18* (1.21)	3.03* (1.15)	3.08* (1.06)	2.44 (1.33)
Visual summary	M (SD)	3.11* (1.18)	2.91 (1.17)	2.86 (1.18)	3.10 (1.03)	2.73 (1.15)	3.02 (1.08)	3.01* (1.02)	2.95* (1.13)	2.98 (1.23)	3.01* (1.31)
Comparative relation map	M (SD)	3.10* (1.11)	3.02 (1.13)	2.84 (1.16)	2.97 (1.09)	2.71 (1.06)	3.01 (1.11)	2.97 (1.07)	2.94 (1.16)	2.95 (1.25)	2.99* (1.27)
Rose plot	M (SD)	2.77 (1.10)	2.83 (1.23)	2.71 (1.25)	2.96 (1.09)	2.66 (1.02)	2.93 (1.09)	2.82 (1.13)	2.80 (1.21)	3.03* (1.21)	3.07* (1.40)
Bar chart with symbols	M (SD)	3.01 (1.24)	3.09* (1.23)	3.01 (1.23)	2.99 (1.12)	2.68 (1.17)	2.97 (1.13)	2.83 (1.11)	2.82 (1.13)	2.64 (1.21)	2.86 (1.26)
Opinion wheel	M (SD)	2.60 (1.18)	2.25 (1.36)	2.32 (1.20)	2.79 (1.11)	2.62 (1.14)	2.51 (1.07)	2.73 (1.12)	2.61 (1.21)	2.77 (1.21)	3.53* (1.57)
Coordinated graph	M (SD)	1.99 (1.15)	1.84 (1.15)	1.78 (0.99)	2.34 (1.22)	2.11 (1.12)	2.27 (1.18)	2.22 (1.18)	2.39 (1.18)	2.25 (1.22)	3.42* (1.54)
Positioning map	M (SD)	2.58 (1.19)	2.81 (1.30)	2.85 (1.15)	3.01 (1.17)	2.40 (1.12)	2.84 (1.25)	2.67 (1.10)	2.66 (1.18)	2.79 (1.18)	2.80 (1.40)

SD: standard deviation.

Ranking of visualizations

In order to achieve the first objective, that is, ranking of the opinion visualizations, descriptive statistics, such as mean and SD, of all metrics for each of the visualizations were analyzed. Table 6 shows the ranking of the visualizations. The contents in gray shade show the metrics which scored higher than the mean+ values. The metrics scored higher than mean + value are also marked with the symbol *. The visualization ranks higher than others based on the number of gray-shaded metrics. Therefore, the top four visualizations are bar chart, followed by glowing bar, tree map, line graph, and pie chart. Positioning map is at the lowest rank with no gray-shaded metric.

The reason behind the higher ranking of bar chart and glowing bars is the simplicity, effectiveness, and easy to use design,⁶³ with no or little pre-knowledge requirement. Other factors that contributed strongly in the ranking are (1) highest visual appeal and user-friendliness, (2) easy to understand and highly informative design, and (3) highest comparison ability and good presentation style (see Table 6). One more advantageous factor of bar chart and glowing bar is the ease of locating specific information directly.⁶⁴ Besides the simplicity in the design, bar charts are not suitable to present multidimensional data with low level of abstraction and also have a limitation on the number of data values.^9,65

Tree map efficiently utilizes display space⁶⁶ to present multiple features and their corresponding semantic at a same time with an appealing and eye pleasing design. The identification and comparison of key features and sentiment are made easy by using tree map layout.⁶⁶ Line graph and pie chart are common, simple, effective, and easy-to-use visualization.^63,64 Line graph shows users the trend movement of customers’ opinions and assists to predict the tendency of customers’ opinions about a product. It also provides a quick analysis of data to find minimum and maximum positive and negative users’ opinions. As opinion data are highly dimensional, the line graph is not suitable for opinion visualization because of limitation on the number of dimensions and data sets.⁶³ A pie chart provides an easy way to perform comparison between the number of positive and negative comments.⁹ It is also interesting to note that all the top visualizations have less pre-knowledge requirement as they are simple, common, less dimensional, and user-friendly.

The next visualization is the visual summary that provides a quick and scalable way to compare competing products based on different features with corresponding semantics at a glance. The strength of visual summary is the color scale which presents levels of semantic tendency. The comparison is difficult because no significant difference lies in the size of inner rectangles as pointed out by the participants of the seminars. Furthermore, it is also difficult to locate a specific piece of information. The comparative relation map is an eye pleasing and multidimensional visualization that facilitates comparison among competitive products intuitively; however, it is difficult to understand because high pre-knowledge is required to interpret it as pointed out by the participants of the seminars. The next visualization is the rose plot, which is aesthetically appealing and has a compact layout and easy to interpret design.^2,5,17 Its design facilitates comparison in a quick and easy to understand way, but it has limitation on the number of data elements because each element requires a substantial amount of screen space, and for many items, the size of a petal becomes small and it is difficult to distinguish among petals.⁶⁷

Bar chart with symbols is difficult to understand because of high pre-knowledge requirement as pointed out by the participants of the seminars. Opinion wheel provides a good way to present high-dimensional data with a low level of abstraction.² It is also difficult to locate a piece of information because of higher dimensionality and low level of abstraction. Users do not prefer to present too much information in a visualization.² It is rated highest in pre-knowledge requirement and low in visual appeal and understandability. Mostly, the participants of the seminars failed to understand opinion wheel due to high pre-knowledge requirement. Opinion wheel is designed for data analysts, and this is one of the reasons behind the low ranking. Coordinated graph is visualization with (1) lowest visual appeal, (2) highest difficulty level to understand, (3) lowest user-friendliness and informative design, and (4) higher pre-knowledge requirement. A majority of participants pointed out about its visually uncomfortable design and difficulty in identifying a specific piece of information in the open conversation. The last visualization is positioning map which is widely used, simple, and easy to understand visualization; however, it lacks visual expression and some of the flexibility offered by newer visualization techniques to present multidimensional data.^68,69 Positioning map also has significant overlap while presenting many data points⁶⁹ and hence is less suitable for opinion data.

It is observed that common, simple, eye pleasing, easy to understand, less dimensional, and highly summarized visualizations ranked higher. The usefulness of visualization technique in terms of presenting decision-oriented information is a vital metric in usability of the visualizations. The level of abstraction is another important factor because users are not concerned with the details of individual review. Mostly, the users require a higher level of abstraction in easy to understand ways. Another significant metric is the knowledge required to understand the visualization. Users prefer visualizations that are understandable without any pre-knowledge and training. It is concluded from the results that most of the selected visualization required pre-knowledge to gain understanding about the visualizations. As the dimensionality of data within visualization grows, more pre-knowledge is required to understand visualization. If a user fails to understand visualization, then it is not considered useful. In the comparative analysis of different products and features of different products, comparison ability of visualization played an important role.

Investigation of differences between perceptions of the participants of seminars and online questionnaire

An independent sample t-test was conducted to investigate differences between perceptions of the participants of seminars and online questionnaire.

The results with significant differences about the perception of the participants of the two groups for radial visualizations are shown in Table 7. The participants of the online questionnaire are relatively more inclined toward visual appeal, intuitiveness, usefulness, comprehensiveness, comparison ability, and presentation style of the opinion wheel than the participants of the seminars. However, they required more pre-knowledge to comprehend the opinion wheel as compared to the participants of seminars. Similarly, high visual appeal, intuitiveness, usefulness, comparison ability, and pre-knowledge requirement were reported by the participants of online questionnaire on the rose plot as compared to the participants of the seminars.

Table 7.

Significant differences between the perception of the participants of seminars and online questionnaire for radial visualizations.

Visualizations	Metric	Data collection method	Mean	SD	t value	p value
Opinion wheel	Eye pleasing	Seminar	2.30	1.16	−4.160	.000
		Online questionnaire	3.28	1.00
	Intuitive design	Seminar	2.37	1.10	−4.853	.000
		Online questionnaire	3.36	0.93
	Usefulness	Seminar	2.34	1.05	−3.664	.000
		Online questionnaire	3.06	0.92
	Comprehensiveness	Seminar	2.56	1.11	−3.145	.002
		Online questionnaire	3.22	1.05
	Comparison	Seminar	2.36	1.15	−4.599	.000
		Online questionnaire	3.36	1.07
	Representation style	Seminar	2.65	1.29	−2.660	.009
		Online questionnaire	3.14	.83
	Pre-knowledge required	Seminar	3.37	1.62	−2.438	.017
		Online questionnaire	4.03	1.32
Rose plot	Eye pleasing	Seminar	2.65	1.13	−2.369	.019
		Online questionnaire	3.14	.93
	Intuitive design	Seminar	2.50	1.01	−3.377	.001
		Online questionnaire	3.14	.90
	Usefulness	Seminar	2.82	1.14	−2.219	.028
		Online questionnaire	3.28	.849
	Comparison	Seminar	2.64	1.22	−2.964	.004
		Online questionnaire	3.31	1.04
	Pre-knowledge required	Seminar	2.87	1.43	−3.429	.001
		Online questionnaire	3.67	1.12

SD: standard deviation.

The results with significant differences between the perception of the two groups for graphs are shown in Table 8. The intuitiveness, usefulness, comprehensiveness, and presentation style of the coordinated graph were appreciated more by the participants of the online questionnaire than the participants of the seminars. The participants of the online questionnaire highlighted high pre-knowledge requirement for the coordinated graph than the other group. Likewise, the participants of online questionnaire significantly agreed on visual appeal, understandability, user-friendliness, informativeness, intuitiveness, usefulness, comprehensiveness, comparison, presentation style, and pre-knowledge requirement of the positioning map more than the participants of the seminars. Similarly, the participants of online questionnaire were attracted toward the presentation style of the comparative relation map and reported high pre-knowledge requirement than the participants of seminars.

Table 8.

Significant differences between the perception of the participants of seminars and online questionnaire for graphs.

Visualizations	Metric	Data collection method	Mean	SD	t value	p value
Coordinated graph	Intuitive design	Seminar	1.98	1.09	−2.448	.016
		Online questionnaire	2.50	1.13
	Usefulness	Seminar	2.12	1.16	−2.848	.005
		Online questionnaire	2.75	1.13
	Comprehensiveness	Seminar	2.05	1.17	−3.215	.002
		Online questionnaire	2.75	1.05
	Representation style	Seminar	2.15	1.30	−2.142	.035
		Online questionnaire	2.56	.877
	Pre-knowledge required	Seminar	3.22	1.62	−3.408	.001
		Online questionnaire	4.03	1.08
Positioning map	Eye pleasing	Seminar	2.43	1.19	−2.695	.008
		Online questionnaire	3.03	1.06
	Easy to understand	Seminar	2.58	1.30	−4.272	.000
		Online questionnaire	3.50	1.06
	User-friendly	Seminar	2.66	1.14	−3.554	.001
		Online questionnaire	3.42	.996
	Informative design	Seminar	2.88	1.23	−2.715	.008
		Online questionnaire	3.39	.87
	Intuitive design	Seminar	2.26	1.12	−2.575	.011
		Online questionnaire	2.81	1.01
	Usefulness	Seminar	2.65	1.28	−4.106	.000
		Online questionnaire	3.44	.909
	Comprehensiveness	Seminar	2.53	1.11	−2.824	.005
		Online questionnaire	3.11	.979
	Comparison	Seminar	2.49	1.16	−3.076	.003
		Online questionnaire	3.17	1.08
	Representation style	Seminar	2.66	1.19	−2.260	.025
		Online questionnaire	3.17	1.06
	Pre-knowledge required	Seminar	2.63	1.42	−2.932	.005
		Online questionnaire	3.33	1.20
Comparative relation map	Representation style	Seminar	2.84	1.30	−2.119	.037
		Online questionnaire	3.28	1.00
	Pre-knowledge required	Seminar	2.79	1.27	−3.854	.000
		Online questionnaire	3.61	1.05

SD: standard deviation.

The results with significant differences between the perception of participants of seminars and online questionnaire for hierarchical visualizations are shown in Table 9. The tree map appeared more user-friendly to the participants of seminars; however, they required less knowledge to comprehend it, as most of them are familiar with tree map visualization. In contrast, the participants of online questionnaire favored the visual summary in terms of the eye pleasing, user-friendly, intuitive design, usefulness, comprehensiveness, comparison, and representation style metrics. They also highlighted higher pre-knowledge requirement than the other group.

Table 9.

Significant differences between the perception of the participants of seminars and online questionnaire for hierarchical visualizations.

Visualizations	Metric	Data collection method	Mean	SD	t value	p value
Tree map	User-friendly	Seminar	3.63	1.20	1.955	.052
		Online questionnaire	3.19	1.01
	Pre-knowledge required	Seminar	2.69	1.28	−3.931	.000
		Online questionnaire	3.61	1.02
Visual summary	Eye pleasing	Seminar	2.99	1.21	−2.149	.033
		Online questionnaire	3.47	1.03
	User-friendly	Seminar	2.75	1.22	−2.133	.035
		Online questionnaire	3.22	.989
	Intuitive design	Seminar	2.57	1.22	−3.890	.000
		Online questionnaire	3.22	.722
	Usefulness	Seminar	2.89	1.16	−3.398	.001
		Online questionnaire	3.42	.649
	Comprehensiveness	Seminar	2.80	1.03	−5.778	.000
		Online questionnaire	3.64	.639
	Comparison	Seminar	2.76	1.17	−4.434	.000
		Online questionnaire	3.50	.737
	Representation style	Seminar	2.83	1.28	−3.170	.002
		Online questionnaire	3.44	.909
	Pre-knowledge required	Seminar	2.79	1.34	−4.446	.000
		Online questionnaire	3.69	.95

SD: standard deviation.

The results with significant differences between the perceptions of participants of seminars and online questionnaire for bar chart are shown in Table 10. The results indicated a significant difference in the perception between two groups; the participants of online questionnaire favored the comparison ability of the bar chart more than the participants of the seminar. The perception among the participants of the seminar and online questionnaire significantly varies on the easy to understand, user-friendly, informative design, usefulness, comprehensiveness, comparison, and representation style metrics of the line graph and pie chart, where the participants of online questionnaire showed more agreement. There is only one significant difference on the bar chart with symbols between these two groups, where the participants of online questionnaire inclined toward the comprehensiveness of the bar chart with symbols more than the participants of seminars. In contrast, the participants of seminars reported high visual appeal, understandability, and informativeness of the glowing bars.

Table 10.

Significant differences between the perception of participants of seminars and online questionnaire for bar chart.

Visualizations	Metric	Data collection method	Mean	SD	t value	p value
Bar chart	Comparison	Seminar	3.42	1.21	−2.357	.021
		Online questionnaire	3.89	.979
Line graph and pie chart	Easy to understand	Seminar	3.40	1.32	−4.184	.000
		Online questionnaire	4.19	.856
	User-friendly	Seminar	3.30	1.19	−4.372	.000
		Online questionnaire	4.14	.931
	Informative design	Seminar	3.26	1.19	−2.874	.005
		Online questionnaire	3.78	.832
	Usefulness	Seminar	3.17	1.20	−4.666	.000
		Online questionnaire	3.94	.715
	Comprehensiveness	Seminar	3.03	1.28	−3.307	.001
		Online questionnaire	3.64	.833
	Comparison	Seminar	2.84	1.19	−4.598	.000
		Online questionnaire	3.64	.798
	Representation style	Seminar	2.92	1.13	−4.321	.000
		Online questionnaire	3.56	.607
Bar chart with symbols	Comprehensiveness	Seminar	2.73	1.17	−2.259	.027
		Online questionnaire	3.14	.867
Glowing bars	Eye pleasing	Seminar	4.10	1.04	4.477	.000
		Online questionnaire	3.22	.959
	Easy to understand	Seminar	3.85	1.28	2.028	.044
		Online questionnaire	3.36	1.13
	Informative design	Seminar	3.80	1.16	2.023	.045
		Online questionnaire	3.36	1.02

SD: standard deviation.

The maximum differences exist between the groups on the positioning map (10 metrics) followed by the visual summary (8 metrics), opinion wheel (7 metrics), line graph and pie chart (7 metrics), rose plot (5 metrics), and coordinated graph (5 metrics). The users reported less difference for glowing bar (3 metrics), tree map (2 metrics), comparative relation map (2 metrics), bar chart (1 metric), and bar chart with symbol (1 metric). Mostly, the participants of online questionnaire required more pre-knowledge for the interpretation of the visualizations due to lack of face-to-face communication, as they were unable to clarify the underlying concepts of the visualizations.

Identification of important visualization metrics

In order to achieve the second objective, that is, the identification of important information visualization metrics, we used descriptive statistics, that is, mean+ value. We considered the metric important if their mean+ value is greater than 3. Thus, the important metrics are eye pleasing, easy to understand, user-friendly, informative design, usefulness, and representation style (Table 6).

It is observed that users prefer eye pleasing, easy to understand, and user-friendly visualizations. Another important factor is how much decision-oriented information visualization presents. The usefulness and representation style of visualization are other vital metrics in the usability of visualizations. Intuitive design metric scored lowest mean+ value among all metrics. The reason behind this lowest score might be that users consider how informative and easy to understand visualizations are instead of how intuitive they are. The participants focused on these metrics more while evaluating the visualizations. These metrics played an important role in the ranking of the visualizations and user satisfaction and should be considered while designing new opinion visualizations. As a result, user’s satisfaction with these systems will be increased, and users can take full benefits from these systems.

Conclusion

The objectives of this study are (1) to rank the visualizations of the opinion mining systems, (2) to investigate the differences between perceptions of the two groups of the respondents (the participants of the seminars and the online questionnaire), and (3) to identify important visualization metrics. To collect data, a questionnaire survey was developed and data were collected via the online questionnaire survey and by conducting seminars. It is concluded that simple, easy to understand, less dimensional, less pre-knowledge required with good presentation style visualizations were rated higher than others. The top five visualizations are bar chart, glowing bar, tree map, line graph, and pie chart. The results revealed that the participants of the online questionnaire required more pre-knowledge to comprehend the visualizations than the participants of the seminars. It was found that the important metrics are eye pleasing, easy to understand, user-friendly, informative design, usefulness, and representation style. The results of this study could aid in the design and development of visualizations for the opinion mining system.

Footnotes

Funding

This study was supported by University of Malaya (RP002B-13ICT).

References

Dave

Way

Lawrence

. Mining the peanut gallery: opinion extraction and semantic classification of product reviews. In: Proceedings of the 12th international conference on World Wide Web (WWW ’03), 2003, pp. 519–528. New York: ACM Press, http://doi.acm.org/10.1145/775152.775226

Wei

Liu

. OpinionSeer: interactive visualization of hotel customer feedback. IEEE T Vis Comput Gr 2010; 16(6): 1109–1118.

Turney

Littman

. Measuring praise and criticism: inference of semantic orientation from association. ACM Trans Inform Syst 2003; 21(4): 315–346.

Pang

Lee

. A sentimental education: sentiment analysis using subjectivity summarization based on minimum cuts. In: Proceedings of 42nd meeting of the Association for Computational Linguistics (ACL ‘04), Barcelona, 21–26 July 2004, pp. 271–278. Association for Computational Linguistics.

Gregory

Chinchor

Whitney

. User-directed sentiment analysis: visualizing the affective content of documents. In: Proceedings of the workshop on sentiment and subjectivity in text, Sydney, NSW, Australia, July 2006, pp. 23–30. Stroudsburg, PA: ACL.

Wanner

Rohrdantz

Mansmann

. Visual sentiment analysis of RSS news feeds featuring the US presidential election in 2008. In: Workshop on visual interfaces to the social and the semantic web (VISSW 2009), Sanibel Island, FL, 8 February 2009.

Chen

Ibekwe-sanjuan

Sanjuan

. Visual analysis of conflicting opinions. In: Proceedings of the IEEE symposium on visual analytics and technology, Baltimore, MD, 31 October–2 November 2006, pp. 59–66. IEEE.

Gamon

Basu

Belenko

. BLEWS: using blogs to provide context for news articles. In: Proceedings of the international conference on weblogs and social media, Seattle, WA, 30 March–2 April 2008.

Khan

. Data and information visualization methods, and interactive mechanisms: a survey. Int J Comput Appl 2011; 34(1): 1–14.

10.

Osimo

Mureddu

. Research challenge on opinion mining and sentiment analysis. In: Using open data: policy modeling, citizen empowerment, data journalism, 2012, http://www.w3.org/2012/06/pmod/opinionmining.pdf

11.

Morinaga

Yamanishi

Tateishi

. Mining product reputations on the Web. In: Proceedings of the eighth ACM SIGKDD international conference on knowledge discovery and data mining, Edmonton, AB, Canada, 23–25 July 2002, pp. 341–349. New York: ACM Press.

12.

Miao

Dai

. AMAZING: a sentiment mining and retrieval system. Expert Syst Appl 2009; 36(3): 7192–7198.

13.

Liao

. Mining comparative opinions from customer reviews for competitive intelligence. Decis Support Syst 2011; 50(4): 743–754.

14.

Gamon

Aue

Corston-Oliver

. Pulse: mining customer opinions from free text. In: International symposium on intelligent data analysis, 2005, pp. 121–132, http://www.springerlink.com/index/94q1nrhfc8a4e8tn.pdf (accessed 12 December 2011).

15.

Oelke

Hao

Rohrdantz

. Visual opinion analysis of customer feedback data. In: Proceedings of the IEEE symposium on visual analytics science and technology, Atlantic City, NJ, 11–16 October 2009, pp. 187–194. IEEE.

16.

Liu

Cheng

. Opinion observer: analyzing and comparing opinions on the Web. In: Proceedings of the 14th international conference on World Wide Web (WWW ’05), 2005, pp. 342–351. New York: ACM Press, http://doi.acm.org/10.1145/1060745.1060797

17.

Draper

Livnat

Riesenfeld

. A survey of radial methods for information visualization. IEEE T Vis Comput Gr 2009; 15(5): 759–776.

18.

Nightingale

. Notes on matters affecting the health, efficiency, and hospital administration of the British army: founded chiefly on the experience of the late war. London: Harrison and Sons, 1858.

19.

Freitas

CMDS

Luzzardi

PRG

Cava

. On evaluating information visualization techniques. In: Proceedings of advanced visual interfaces, Trento, 2002, pp. 2–3.

20.

ISO 9241-10:1996. Ergonomic requirements for office work with visual display terminals (VDTs). Part 10: dialogue principles.

21.

ISO 9241-11:1994. Ergonomic requirements for office work with visual display terminals (VDTs). Part 11: guidance on usability. Draft international standard.

22.

Mayhew

. Principles and guidelines in software user interface. Upper Saddle River, NJ: Prentice Hall, 1992.

23.

Smith

Mosier

. Design guidelines for user-system interface software, MA: USAF Electronic Systems Division, Hanscom Air Force Base, Massachusetts, USA, 1984.

24.

Oulanov

Pajarillo

EJY

. CUNY+ web: usability study of the web-based GUI version of the bibliographic database of the City University of New York (CUNY). Electron Libr 2002; 20(6): 481–487.

25.

Brinck

Gergle

Wood

. Usability for the web: designing web sites that work. San Francisco, CA: Morgan Kaufmann, 2002.

26.

MIT Information Services and Technology Department. Usability guidelines, 2 December 2004.

27.

Lee

. A study on the improvement plan by analyzing user interaction pattern with the RISS. Technical report KR2004-17, 2004. Seoul, South Korea: KERIS.

28.

Nielsen

. Usability engineering. Cambridge, MA: Academic Press, 1993.

29.

Booth

. Introduction into human-computer interaction. London: Lawrence Erlbaum Associates, 1989.

30.

Zviran

Glezer

Avni

. User satisfaction from commercial web sites: the effect of design and use. Inf Manag 2006; 43(2): 157–178.

31.

Scholtz

. Beyond usability: evaluation aspects of visual analytic environments. In: Proceedings of the IEEE symposium on visual analytics science and technology, Baltimore, MD, 31 October–2 November 2006, pp. 145–150. New York: IEEE.

32.

Jeong

Han

. Usability study on newspaper mobile websites. OCLC Syst Serv 2012; 28(4): 180–198.

33.

Yushiana

Rani

. Heuristic evaluation of interface usability for a web-based OPAC. Libr Hi Tech 2007; 25(4): 538–549.

34.

Kim

. Usability study of digital institutional repositories. Electron Libr 2008; 26(6): 863–881.

35.

Casaló

Flavián

Guinalíu

. The role of satisfaction and website usability in developing customer loyalty and positive word-of-mouth in the e-banking services. Int J Bank Market 2008; 26(6): 399–417.

36.

Ravendran

MacColl

Docherty

. Usability evaluation of a tag-based interface. J Usability Stud 2012; 7(4): 143–160.

37.

Feizi

Wong

. Usability of user interface styles for learning a graphical software application. In: 2012 International conference on computer & information sciences, Kuala Lumpur, Malaysia, 12–14 June 2012, pp. 1089–1094. New York: IEEE.

38.

Van Riel

ACR

Liljander

Jurriëns

. Exploring consumer evaluations of e-services: a portal site. Int J Serv Ind Manag 2012; 12(4): 359–377.

39.

Dudek

Mastora

Landoni

. Is Google the answer? A study into usability of search engines. Libr Rev 2007; 56(3): 224–233.

40.

Freitas

CMDS

Luzzardi

PRG

Cava

. Evaluating usability of information visualization techniques. In: CHI 2002—5th workshop on human factors in computer systems, 2002.

41.

Zuk

Schlesier

Neumann

. Heuristics for information visualization evaluation. In: Proceedings of the 2006 AVI workshop on beyond time and errors: novel evaluation methods for information visualization, Venice, 23 May 2006.

42.

Zuk

Carpendale

. Theoretical analysis of uncertainty visualizations. In: Proceedings of the SPIE-IS&T electronic imaging (ed Erbacher

Roberts

Gröhn

.), San Jose, CA, 15 January 2006, vol. 6060. Bellingham, WA: SPIE.

43.

Amar

Stasko

. A knowledge task-based framework for design and evaluation of information visualizations. In: Proceedings of the IEEE symposium on information visualization (INFOVIS ‘04), Austin, TX, 10–12 October 2004, pp. 143–150. New York: IEEE.

44.

Shneiderman

. The eyes have it: a task by data type taxonomy for information visualizations. In: Proceedings of the IEEE symposium on visual languages, Boulder, CO, 3–6 September 1996, pp. 336–343. New York: IEEE.

45.

Wehrend

Lewis

. A problem-oriented classification of visualization techniques. In: Proceedings of the first IEEE conference on visualization, San Francisco, CA, 23–26 October 1990, pp. 139–143. New York: IEEE.

46.

Scholtz

. Developing qualitative metrics for visual analytic environments. In: Proceedings of the 3rd BELIV ’10 workshop: beyond time and errors: novel evaluation methods for information visualization, Atlanta, GA, 10–15 April 2010, pp. 1–7. New York: ACM Press.

47.

Forsell

Johansson

. An heuristic set for evaluation in information visualization. In: Proceedings of the international conference on advanced visual interfaces (AVI ’10), Rome, 25–29 May 2010, pp. 199–206. New York: ACM Press.

48.

Brath

. Metrics for effective information visualization. In: Proceedings of the IEEE symposium on information visualization, Phoenix, AZ, 18–25 October 1997, pp. 108–111. New York: IEEE Computer Society.

49.

Connell

TAO

Choong

. Metrics for measuring human interaction with interactive visualizations for information analysis. In: Proceedings of ACM CHI 2008 conference on human factors in computing systems, Florence, 5–10 April 2008, pp. 1493–1496. New York: ACM Press.

50.

Plaisant

Fekete

J-D

Grinstein

. Promoting insight-based evaluation of visualizations: from contest to benchmark repository. IEEE T Vis Comput Gr 2007; 14(1): 120–134.

51.

Bai

White

Sundaram

. Purposeful visualization. In: Proceedings of the 44th Hawaii international conference on system sciences, Kauai, HI, 4–7 January 2011, pp. 1–10. New York: IEEE Computer Society.

52.

Tufte

. The visual display of quantitative information, 2001. DOI: 10.1016/S0140-6736(05)70412-8.

53.

Bertin

. Semiology of graphics. Madison, WI: The University of Wisconsin Press, 1983.

54.

Ware

. Information visualization: perception for design. 2nd ed. San Francisco, CA: Morgan Kaufmann Publishers, 2004.

55.

Wiss

Carr

Jonsson

. Evaluating three-dimensional information visualization designs: a case study of three designs. In: Proceedings of the IEEE conference on information visualization, London, 29–31 July 1998, pp. 137–144. New York: IEEE Computer Society.

56.

Scholtz

. Developing guidelines for assessing visual analytics environments. Inf Vis 2011; 10(3): 212–231.

57.

Lam

Bertini

Isenberg

. Empirical studies in information visualization: seven scenarios. IEEE Trans Vis Comput Graph 2012; 18: 1520–1536.

58.

Scholtz

Plaisant

Whiting

. Evaluation of visual analytics environments: the road to the visual analytics science and technology challenge evaluation methodology. Inf Vis 2014; 13(4): 326–335.

59.

Brath

. 3D Interactive information visualization: guidelines from experience and analysis of applications. In: Proceedings of HCI international ‘97, San Francisco, CA, 24–29 August 1997, pp. 1–5. ELSEVIER.

60.

Pillat

Valiati

ERA

Freitas

CMDS

. Experimental study on evaluation of multidimensional information visualization techniques. In: Proceedings of the 2005 Latin American conference on human-computer interaction (CLIHC ’05), Cuernavaca, Mexico, 23–26 October 2005, pp. 20–30. New York: ACM Press.

61.

Pretorius

Calitz

Greunen

. The added value of eye tracking in the usability evaluation of a network management tool. In: Proceedings of the 2005 annual research conference of the South African institute of computer scientists and information technologists on IT research in developing countries (SAICSIT ’05), White River, South Africa, 2005, pp. 1–10. New York: ACM Press.

62.

Kang

Y-A

Görg

Stasko

. How can visual analytics assist investigative analysis? Design implications from an evaluation. IEEE T Vis Comput Gr 2010; 17(5): 570–583.

63.

Keim

Hao

Dayal

. Hierarchical pixel bar charts. IEEE T Vis Comput Gr 2002; 8(3): 255–269.

64.

Vliegen

van Wijk

ven der Linden

E-J

. Visualization business data with generalized treemaps. IEEE T Vis Comput Gr 2006; 12(5): 789–796.

65.

Keim

Hao

Dayal

. Value-cell bar charts for visualizing large transaction data sets. IEEE T Vis Comput Gr 2007; 13(4): 822–833.

66.

Kratt

Strobelt

Deussen

. Improving stability and compactness in street layout visualizations. In: Proceedings of the vision, modeling, and visualization workshop 2011, Berlin, 4–6 October 2011, pp. 285–292. Mitti, Germany: Eurographics Association.

67.

Stasko

Zhang

. Focus+context display and navigation techniques for enhancing radial, space-filling hierarchy visualizations. In: IEEE symposium on information visualization, Salt Lake City, UT, 9–10 October 2000, pp. 57–65. New York: IEEE.

68.

Elmqvist

Dragicevic

Fekete

J-D

. Rolling the dice: multidimensional visual exploration using scatterplot matrix navigation. IEEE T Vis Comput Gr 2008; 14(6): 1141–1148.

69.

Hao

Dayal

Sharma

. Visual analytics of large multi-dimensional data using variable binned scatter plots. In: Proceedings of visualization and data analysis (ed Park

Hao

Wong

.), San Jose, CA, 17 January 2010, vol. 7530. Bellingham, WA: SPIE.