Abstract
BACKGROUND:
Marketing clothes online can be challenging for those who buy and sell them. They are products that people still feel the need to have direct contact with. Some of their attributes are not easily communicated digitally, such as the fit and the feel of the fabric. Besides, the lack of standardization of sizes practiced by the Brazilian clothing industry raises several doubts to the user.
OBJECTIVE:
The objective of this paper is to present the results of an investigation concerning the user experience (UX) resulting from the interaction with female e-commerce websites.
METHODS:
The methods were organized into two stages: exploratory and descriptive. The former consisted of an exploratory interview (19 interviewees), an online questionnaire (205 respondents), and semi-structured interviews with 5 fashion specialists. In the latter, 60 users were divided into two groups of 30 participants to undertake a usability test on an e-commerce website (dafiti.com.br) to evaluate the UX when buying clothes. Group A used a desktop/laptop and Group B used a smartphone. The users were then debriefed and answered the AttrakDiff2™ questionnaire.
RESULTS:
Users lacked technical knowledge on clothing attributes and were extremely wary about the size and fit information provided (60% in Group A and 83% in Group B were not sure about fit and size information). Surprisingly, the AttrakDiff2™ showed no significant difference between the mobile or the desktop/laptop UX (Group A: M = 4.0 SD = 0.1; Group B: M = 4.1 SD = 0.2).
CONCLUSIONS:
The most significant problem faced by the users concerned the fit and sizing information, as these attributes are usually poorly communicated digitally. In addition to being a usability problem, by inducing users to errors, its greatest impact was on the UX.
Introduction
Since the introduction of e-commerce in the 1990s, retail has been subject to constant changes as a result of the new market dynamics imposed by the emergence of new technologies. The online marketing of apparel may be a challenge due to a few attributes concerning the category, especially those related to fit and sizing. The objective of this study was to investigate the obstacles faced by users of apparel e-commerce websites.
As a result of her research, Bozzi [1] learned that one of the main problems when buying clothes online in Brazil is the lack of confidence in the information presented. There is a barrier between the physical and digital environments caused by the lack of direct contact, preventing the user from doing a complete inspection of the product. To investigate this matter, the study was organized into two stages: exploratory and descriptive. (A) Exploratory: an exploratory interview, an online questionnaire, and an interview with pattern cutting specialists; (B) Descriptive: a usability test and AttrakDiff2™ questionnaire.
This paper is organized as follows: the first part presents the current e-commerce scenario and an overview of the fashion and accessories category. Subsequently, the theoretical background, methodology, results, and finally the discussion and conclusions are presented.
The e-commerce scenario
According to The Nielsen Company [2] report, there are currently 4 billion people connected to the Internet (53% of the world’s population), and 1.66 billion are e-shoppers [3]. Apparel is the top seller, 61% of consumers reported having already purchased at least one product from this category, followed by 59% travel, 49% books/music/stationery, 47% IT & mobile, and 45% event tickets [2]. Advances in technology, logistics, and payment methods, the increasing access to mobile Internet, and the consumer’s demand for convenience, have created a global online shopping landscape that currently represents US$ 2.3 trillion [2].
The number of users in Brazil has been increasing significantly. In 2016, 116 million people, 64.7% of the population, had access to the Internet, according to the Pesquisa Nacional de Amostra Domiciliar Contínua 2016 [PNADC] –Continuous National Household Sample Survey [4].
Among those who have access to the Internet, 55.15 million are e-shoppers [5]. In 2017, e-commerce generated R$ 47.7 billion in revenue as a result of 111.2 million orders. In the first half of 2018, 27.4 million consumers in Brazil made at least one purchase online. This number represents an increase of 7.6% compared to the first half of 2017, meaning 4.5 million people bought online for the first time. The Fashion and Accessories category was the leader in order volume, representing 14% of the total. The Southeast Region is accountable for the highest number of online orders, representing 67.1% of the total [5].
Brazilian e-shoppers are 50.6% women and 49.4% men and 42.2 years old on average [5]. Classes C, D, and E account for 82.2% of online purchases. The concept of family income by social stratum of Critério Brasil was developed by the Brazilian Association of Research Companies [ABEP] [6] based on the PNADC developed by IBGE [4] as shown in Table 1.
Family income per social stratum according to Critério Brasil
Family income per social stratum according to Critério Brasil
Source: ABEP. Available from: http://www.abep.org/criterio-brasil
In the first half of 2018, the Fashion and Accessories category ranked second in the number of orders [5]. It has been the leading category since 2013 [7]. The category is in the seventh place in terms of revenue. This indicates that the monetary value invested by consumers is low, suggesting consumers’ unwillingness to make a high financial investment when buying clothes online.
Digital and non-digital attributes
When buying clothes online, users cannot try them non have direct contact with the product prior to purchase, and this is considered a discouraging factor for 84% of the users, according to SEBRAE-RJ [8]. Some so-called non-digital attributes, such as the texture and fit of the fabric, are poorly communicated online. Lal et al. [9] explain that there are two categories of product attributes: digital and non-digital. The digital ones are easily communicated through the Internet; as the price and dimensions of a book, there is no loss of information. These attributes may also include information previously acquired by users through direct contact with the product during a visit to a brick-and-mortar store.
The non-digital attributes include features that can only be assessed through direct contact with the product. In clothing, the touch and feel, key characteristics for product evaluation, can only be verified when there is haptic contact. Therefore, depending on the channel used, consumers may not have enough information before making a purchase.
Consumers’ impossibility to have prior direct contact with products possessing non-digital attributes can: (1) Act as a deterrent to the purchase; and (2) Increase operational costs derived from returns when there is a discrepancy between expectation and reality [9].
Usability
A website should meet the needs of the user, not the opposite. ISO 9241-11 [10] defines usability as: “the extent to which a system, product or service can be used by specified users to achieve specified goals with effectiveness, efficiency, and satisfaction in a specified context of use”. The International Standards Organization [ISO] definition does not treat usability as a product attribute, but as a dialogue with the users, with the task purpose, and with the environment where the task is taking place [11, 12]. Poor usability or its absence can cause problems with consequences that may both annoy or frustrate a user and negatively impact their experience. According to Moraes [13], as a concept, usability deals with the adequacy between the product and the tasks for which it is intended to perform, the suitability with the user, and with the context in which it will be used. Therefore, when usability is considered appropriately, it will contribute to the user’s completion of the intended task with efficiency, effectiveness, and satisfaction.
UX
The buying process does not always involve purely rational decisions; cultural values, the environment in which the individual is present, and the perception of risk are factors that can be determinant [14]. In this sense, usability-only approaches are limited, as they tend to treat products as mere tools with which users perform tasks [15]. Usability alone does not account for all aspects of human interaction with a system, interaction is not limited to efficiency, effectiveness, and satisfaction. Despite the update, at the time this study was conducted usability did not take into account the emotional aspects of the human-system interaction. Thus, it was essential to measure other variables: the user experience as a whole. ISO-9241 [16] defines the user experience (UX) as: “user perceptions and responses resulting from the use and/or anticipation of the use of a product, system or service.” UX is the result of this interaction that has a beginning and an end, producing in the user memories of this experience [17]. To measure the effectiveness, efficiency, satisfaction, and also the user’s perceptions and responses resulting from the human-system interaction, both usability and user experience were considered. This equation aims at a better understanding of the impacts that usability (or its absence) may cause the user experience when users make (or attempt to make) purchases online.
Methodology
As mentioned previously, the study was divided into two stages. The first stage consisted of an exploratory interview with a sample of the target population, a semi-structured interview with pattern cutting specialists, and an online questionnaire. Exploratory research is useful when there is little knowledge about the topic being addressed. The exploratory study seeks to know or construct important questions for the conclusion of the research [18].
The second stage, the descriptive stage, consisted of the application of a usability test followed by a debriefing session and the application of the AttrakDiff2™ questionnaire. Descriptive research is an intermediate study between exploratory and explanatory research, that is, it is not as preliminary as the first or as in-depth as the second. In this context, describing means identifying, reporting, comparing, among other aspects. Marconi et al. [19] emphasize that the descriptive research aims to observe the facts, register, analyze, classify and interpret them, without the researcher’s interference.
Exploratory stage
To understand the opinion of pattern cutting specialists, a semi-structured interview was conducted to elucidate questions about the research topic and to use this information to later elaborate a usability test.
From the information gathered in the first part of the exploratory stage, we came to the research questions: (1) What are the difficulties a user faces when trying to buy clothes online? (2) What information about a product has to be presented to overcome the lack of direct contact before the purchase? (3) To what extent is usability being considered as a guideline, taking into account the characteristics and singularities of the devices being used? (4) How does usability impact user experience?
Exploratory interview
An open interview was conducted to understand users’ perceptions when buying clothes online; it was also used as the basis for the elaboration of an online questionnaire. The interviews were conducted using the WhatsApp application; the questions and immediately answered by the interviewees, there were 19 respondents. The interview was divided into two parts: demographic and qualitative questions.
Online questionnaire
Google Forms was used to elaborate the online questionnaire, it contained open questions to gather more information on how users interact with apparel e-commerce website interfaces. The pilot questionnaire was sent on 03/23/16 to 12 people (5 men and 7 women) and concluded on 04/04/2016, 10 people responded. The final questionnaire was made available from 05/05 to 02/07/2016 and was separated into seven sections.
Interview with specialists
To obtain specialists’ views about the issues addressed in this research, semi-structured interviews were conducted. The interviews took place between April 7th and May 24th, 2016, and 6 specialists were interviewed.
Interview content analysis
The interview transcripts were grouped into 8 context units, which were determined according to the Content Analysis methodology [20]. The frequency of which each unit was then determined, the results of which can be found in Table 2.
Content analysis of interviews with specialists
Content analysis of interviews with specialists
Source: Bozzi, (1)
The descriptive stage consisted of a usability test followed by a debriefing session and the application of the AttrakDiff2™ questionnaire.
The usability tests
The usability test objectives were determined based on the results of the online questionnaire and the issues raised in the literature review and are outlined below.
Objective 1: Evaluate the clarity of the information presented in the size table(s). The results of the exploratory interview showed that the lack of trust in the information presented on websites is the main reason that prevents consumers from buying clothes online. The uncertainty about the fit and size and the perceived difficulty to exchange or return a product are impeding factors. Sizing was intensively addressed by the online questionnaire respondents, 81% stated that they do not buy online because they do not know if the product size is right, and 87% because they are unsure about the fit.
Objectives 2 and 3 were grouped because they deal with searching for product information: (2) To evaluate the findability (ease of finding) of the size table(s) and; (3) To examine how the product description (or its absence) influences the user’s purchasing decision. The search for information is part of the decision-making process [14, 21] when consumers intend to buy something, they analyze the brand and the characteristics of the product [21]. The degree of knowledge that the consumer has about a product directly influences their buying behavior [22, 23] as well as the intangibility the online environment poses and the lack of trust can discourage them. On the other hand, if they are well informed, they are more likely to make a purchase [24].
Objective 4: To understand the influence of price on the purchase decision. As there is no direct contact with the product prior to purchase, consumers have a greater perception of risk when buying online. Kim et al. [25] discuss that the perception of risk is directly related to the financial investment, the higher the investment, the higher is the perception of risk. This assumption was reinforced by the results of the exploratory interview and the online questionnaire.
Objective 5: To understand the influence of the complexity of the product. The type of product (shirt, shirt, pants, etc.) affects the fit. Items that are more adjusted to the body require a larger number of measurements to be constructed, and therefore need to have a larger range of sizes to fit the target audience. On the other hand, looser items can fit the same group of consumers with a smaller amount of size options [26].
The fabric of the garment has also to be taken into consideration. For example, knitted fabrics are more elastic and usually provide good fit and comfort [27]. Therefore, when developing the usability test task to fulfill this goal, users were asked to buy a knitted fabric blouse and a buttoned-up shirt. T-shirts and blouses are usually looser and more easily to fit the body, shirts are tighter and generally are produced using woven fabric and require a wider range of size options to fit a range of consumers.
Objective 6: Evaluate the findability of the exchange and return policy. Characteristics such as fabric texture and fit are poorly transmitted digitally; they are non-digital attributes [9]; clothing requires multi-sensory (tactile and visual) contact to be fully evaluated by consumers. This lack of direct contact before a purchase can lead to a lower level of consumer satisfaction and a greater perception of risk [28]. Therefore, consumers develop mechanisms to mitigate the impact of perceived risk and its negative consequences, one of which is to learn more about the guarantees offered by the brands [14, 21]. A clear and easy return and exchange policy, for example, can help improve the user experience and increase their trust.
The tasks used to carry out the usability tests were developed based on the objectives cited above. Six pilot tests were conducted between 01/20/17 and 03/20/2017, 3 on a desktop/laptop, and 3 on a smartphone.
The tests were all performed in the field; in places familiar to the volunteers such as the university campus, homes, or workplaces. The volunteers were asked about their preferred device to buy online, on a desktop/laptop, or a smartphone. They performed the test using their smartphone, or on a desktop/laptop with which they were familiar. This criterion was adopted so that the volunteers did not have to learn to use the device.
The four tasks were performed on the Dafiti website and were presented in writing. The volunteers were asked to perform one task at a time in the listed order. Once the volunteers completed the tasks, debriefing sessions were conducted to assess their perceptions regarding the usability test, and then they were asked to answer the AttrakDiff2™ questionnaire.
Choosing the website to conduct the usability tests
The website chosen to perform the tasks was dafiti.com.br. According to the Alexa.com and the Conversion rankings (June 2017, editions), dafiti.com.br was the top seller in Brazil. It was also the most cited website by the online questionnaire respondents.
Dafiti is a Brazilian online marketplace, the leader in Latin America with branches in Argentina, Chile, and Colombia. It sells around 110 thousand products which are divided into six categories [29]. Each product has a textual description with its name, technical description (fabric type, item code, etc.), seller identification, and photos complement the description. However, no standard is adopted, and some products had incomplete descriptions.
It markets products from an extensive range of brands; consequently, the user is encouraged to reflect on clothing size across brands. The users are also required to know their measurements when using FIT FINDER, or when referring to the various size tables. FIT FINDER is an embedded application to support users in choosing the appropriate size of clothing and accessories. Users are asked to supply information such as age, height, weight, body shape (waist, hip, bust), and their preferred brands, the application can then suggest a size. Several websites, such as Asos.com, use similar tools. Dafiti does not have physical stores; consequently, users are unable to have direct contact with the products prior to purchase.
AttrakDiff2™ questionnaire
The quality of products can be categorized as pragmatic and hedonic. The first refers to a product’s potential to support the goals of ‘doing’ [30]; as buying a sweater. The second is a product’s or service’s potential to support the pleasure of use and ownership, i.e., it is related to the goals of ‘being’, of how to be admired and stimulated.
Usability is a widely accepted concept [31]. ISO-9241-11:2018 [10] defines it as “the extent to which a system, product, or service can be used by specified users to achieve specified goals with effectiveness, efficiency, and satisfaction in a specified context of use”. Where efficiency is defined as the “resources used in relation to the results achieved”, effectiveness is the “accuracy and completeness with which users achieve specified goals”.
The norm has been recently updated and the main changes compared to the previous edition are as follows: The scope has been extended to include systems and services; A wider range of goals is considered, including personal outcomes and organizational outcomes; Efficiency has been defined in relation to the results achieved rather than in relation to accuracy and completeness with which users achieve goals; Satisfaction has been clarified to include a wider range of issues.
Probably the deepest change was the definition of satisfaction, it is now defined as the “extent to which the user’s physical, cognitive and emotional responses that result from the use of a system, product or service meet the user’s needs and expectations” it takes into account some aspects of the user experience, and satisfaction includes the extent to which the user experience that results from actual use meets the user’s needs and expectations. It also considers that the anticipated use can influence satisfaction with the actual use. The previous definition of satisfaction only mentioned the absence of discomfort. At the time this study was completed, usability was concerned mostly with tasks, objectives, and cognitive processing of information, it did not take into account the hedonic aspect of interaction with a product, which is closely related to the UX.
Therefore, AttrakDiff2™ was chosen to measure the UX, it is a questionnaire developed by Hassenzahl et al. [32], to address emotional aspects, caused by the interaction with a product. It is a semantic differential questionnaire composed of 28 items. Seven items are meant to capture the pragmatic quality (PQ), fourteen items the hedonic quality that is divided into identity (HQI) and stimulation (HQS), and 7 the attraction (ATT) (Tables 3 and 4). A 7-point Likert scale is used to evaluate the impressions caused by the user’s interaction with the product.
Portuguese version of AttrakDiff2 and results of desktop/laptop tests
Portuguese version of AttrakDiff2 and results of desktop/laptop tests
Source: Bozzi, [1].
Portuguese version of AttrakDiff2 and results of smartphone tests
Source: Bozzi, [1].
The questionnaire was developed in German and has an official English version. Attrakdiff2™ was translated from English to Portuguese by the researcher and validated by two experts in Portuguese-English/English-Portuguese translation. Once validated, 3 versions of the questionnaire were generated and were then submitted to a panel of LEUI (Laboratory of Ergonomics and Usability of Interfaces - PUC-Rio) researchers, to define the final version (Tables 3 and 4).
A task analysis is the process of describing tasks and is used to guide projects as it helps to anticipate the user’s performance [33]. This process decomposes tasks in a series of steps to make predictions about how difficult or easy they are and how much effort will likely be required to perform them, resulting in a list of steps needed to accomplish a specific goal. The basic measure of task complexity is the number of steps needed to complete it.
In HCI (human-computer interaction), task analysis can be used to: analyze the current situation of a system, redesign its project, or evaluate the outcomes of an intervention. When analyzing an existing computational system, task analysis can be very realistic, describing the user’s behavior in detail [34].
In summary, task analysis is a critical way to understand how users currently do something, as well as to identify obstacles and opportunities for improvement. The way people perform tasks usually bears little resemblance to how they would like to do them or how they might be most effective [35]. Therefore it is necessary to observe them in action.
Once all the usability tests were completed, all the videos were analyzed, and the tasks performed by each volunteer decomposed into steps. It was then possible to quantify the steps each volunteer took to reach the goals of the usability test tasks (Table 5). The tasks were organized into flowcharts, which served as a basis to evaluate the volunteer’s performance.
Number of steps users needed to complete each task
Number of steps users needed to complete each task
Source: Bozzi, [1].
The exploratory interview
19 people were interviewed, the answers to the two qualitative questions (“Report your experience when shopping online” and “What does an e-commerce website need to present to make confident to order?”) were grouped into 3 items: Trust, Price, and Delivery. The impact of these 3 factors is determinant to provide a positive user experience.
It was concluded from the interviewees’ statements that trust is their main concern when buying clothes online [36]. It is a subjective factor in the user’s interaction with the website and is associated with the hedonic qualities of a product. The lack of trust was related to insufficient product information, which, in some cases, prevents the task fulfillment and results in negative user experience.
There is a lack of standardization in the Brazilian sizing systems, and because of the impossibility of trying on the product to check the fit and size, consumers are not sure they will be happy with their order. However, studies show that users are willing to overcome this fear when purchasing cheaper products [25].
Online questionnaire
The majority of the respondents were women (68.5%), 205 people answered the questionnaire, aged from 18 to 69 years old. Out of all respondents, 48% had a graduate degree, 99.5% stated they had already purchased online (51% women and 49% men), and 0.5% had never bought online.
Convenience and the ease of comparing prices were the top reasons to buy online, 83% and 75%, respectively, 52% buy online because of lower prices.
The top-selling category was books/magazines and subscriptions, 66%, the fashion, and accessories category appeared in 3rd place, 52%. More than half of the respondents (65%) declared they had already bought clothes online, and 75% had made online purchases using a mobile phone.
Of the 45% who do not buy clothes online, the main reasons that prevent them from doing so are: not knowing if the clothes will be the right size (81%) and uncertainty about the fit (87%).
Interviews with pattern cutting specialists
Among the most mentioned context units in the interviews with the specialists were “Standardization” and “Information” representing, more than 50% of the total number of occurrences recorded using the Content Analysis method [20]. Among the other units, only “Industry” obtained more than 10% of the occurrences. It was considered relevant to only describe in detail the two most cited context units.
The “Standardization” context unit resulted in 13 registry units, there was special attention to two: “References for clothing construction” and “Need for standardization”. The first unit is related to the different references on which the Brazilian sizing systems are based. There was almost a consensus, according to the specialists, as the anthropometric studies took long to start being developed in Brazil, several foreign sources were used resulting in a variety of tables with different measurements and without considering the specificities of the Brazilians’ bodies. “If you go to the market today to look for a pattern cutting bibliography translated into Portuguese you will find a multitude of books in which the material is not standardized, there are differences in the names of the measurements, in the way they are measured.” “Regarding measurements, people here in Brazil had no reference, so they searched for references in several different places resulting in an absence of standard”. Besides, a literature review carried out by Bravo et al. revealed that anthropometric studies do not follow standard data collection procedures nor use the same measuring tools. For this reason, the results might affect precision reliability and accuracy [37].
The second registration unit, “Need for standardization” revealed different views among the specialists, some advocate for the standardization of the sizing tables so that the companies will follow the same numbering. Others did not think it necessary to have a sizing standard and that each brand should create its size table. Some believed that standardization is not possible.
The “Information” context unit generated 6 registration units, “Information and transparency for the consumer” accounted for almost 55% of the occurrences. In the view of some specialists, a brand may not have a standard size table, but it needs to tell its users what measurements are being used. “Getting to know the public” also proved to be an important subject for the specialists, representing 14.3% of the occurrences.
Among the other registration units, “The importance of benefiting the industry and “Vanity sizing 1 ”, were given special attention.
Descriptive stage
In the following sections, the results of the descriptive research phase will be described.
Sample profile
60 volunteers were recruited, 30 performed the test on a desktop/laptop (Group A), and 30 on a smartphone (Group B). Their ages ranged from 19 to 68 years old, average 33.3, 77% had an iOS smartphone, and 23% Android. 30% of respondents were graduates, 17% of respondents in group A were undergraduates, and 23% in group B, 23%.
Online purchase frequency
37% of group A reported buying online more than twice a month, 23% twice a month, 17% once a month, 13% once every 2 months, and 10% once every 4 months. In group B: 30% reported buying online more than twice a month, 23% twice a month, 23% once a month, 13% once every 2 months, 7% once every 4 months, and 3% once every 6 months. None of the volunteers said they did not shop online.
In group A, 83% of the volunteers had already purchased clothes online and 67% had already visited dafiti.com.br. In group B, 70% had already purchased clothes online and 63% had already visited dafiti.com.br.
Usability test
The usability tests generated both quantitative and qualitative data. Among the quantitative results were: the completion or non-completion of the tasks, the time to perform the tasks, what type of sizing table was consulted, and the number of steps to perform the task. During the tests, it was noted that the time for completing the tasks was not necessarily an indication of ease or difficulty, the delay in completing the tasks was often due to deeper and more careful navigation when choosing the products in tasks 1 and 2. The average navigation time on clothing and accessory websites is almost 4 times higher than in other categories [38].
The qualitative data derived from the volunteers’ comments during the execution of the test and the answers given in the debriefing session. The volunteers were not asked to make comments during the tests but were encouraged if they felt they needed to do so; the most relevant ones will be posteriorly described. The quantitative results of the descriptive research stage will be presented as follows.
In group A, 27 people completed task 1, 26 task 2, 27 task 3, and 24 task 4. Task 4 had the highest number of dropouts, and one person did not complete the task. The task was considered “not completed” when the volunteer declared the task was finished but did not reach the intended goal.
In group B, as in group A, the majority completed all four tasks. 28 people completed task 1, 27 task 2, 27 task 3, and 26 task 4. Group B had a larger number of people who did not complete the task. In both groups, A and B, there were similar numbers and there was no significant difference between them.
In task 4, “To be sure of your size, consult the size guide,” the volunteers were asked to look for a size guide to check their measurements and see if they had made the right choice in tasks 1 and 2. There were three ways of checking the measurements: FIT FINDER, the Size Guide, and the Complete Size Guide. FIT FINDER was the most used, by 43% of Groups A and B. In some cases, when accessing the Product Page, the Size Guide link was visible, but seconds later, while loading the page, this link was transformed in the link to access the FIT FINDER. Some volunteers did not notice this change and others were a bit confused.
On the mobile and the complete website, the Complete Size Guide could be accessed from the Size Guide page. On the desktop/laptop website, it could also be accessed by clicking on the Fashion Glossary link located in the website footer.
In group A, 13% of the volunteers and 20% of Group B did not find any of the three.
AttrakDiff2™ questionnaire
After the usability tests, the volunteers were asked to answer the AttrakDiff2™ questionnaire (Tables 3 and 4). Each pair of words has the objective of measuring the volunteer’s perception concerning certain qualities of the product, divided into pragmatic quality (PQ), hedonic quality identity (HQI), hedonic quality stimulation (HQS), and attraction (ATT). The responses were grouped according to the quality measured (Table 6), an average score for each word pair was calculated and then an average for each quality (PQ: M = 4.0 SD = 0.45, HQI: M = 4.1, SD = 0.47, HQS: M = 4.2, SD = 0.5, ATT: M = 3.8, SD = 0.42). The results of the two groups were also analyzed separately to verify if there was a difference in the desktop/laptop and smartphone user experience. There was no significant difference between the tests performed by groups A and B regarding UX (Group A: PQ: M = 4.0 SD = 0.57, HQI: M = 4.1, SD = 0.44, HQS: M = 4.1, SD = 0.52, ATT: M = 3.9, SD = 0.39. Group B: PQ: M = 4.1 SD = 0.3, HQI: M = 4.2, SD = 0.5, HQS: M = 4.2, SD = 0.49, ATT: M = 3.8, SD = 0.45). The lowest scores were attributed to the attraction item.
Summary of AttrakDiff2 results
Summary of AttrakDiff2 results
Source: Bozzi, [1].
After completing the tasks on the Dafiti website the volunteers were debriefed. There were 3 yes/no questions and the volunteers were asked to justify their answers. Following, the users’ most relevant answers to each question: Was the information in the table/guide clear? Yes □ No □ Why?
In group A, 40% answered that the information presented in the size table was not clear, in group B the percentage dropped to 33%. 25% of the volunteers in group A and 50% of B, did not find the tables. Among other reasons, was the absence of necessary, or expected information in the tables. “It could indicate how to take measurements, perhaps with a drawing of the body, because that part of the measurements always confuses me,” “I think weight and height are factors that can vary a lot, they do not necessarily indicate size. I prefer when there are measurements in cm.”
Other reasons for the negative answers were: the subjective perception of the measurements, “Because they are subjective measurements and what the other sees is different from my image in the mirror”. There was a volunteer who was not able to use the measurement system because of her height, there was no size available for her, “I do not know. I could not access it. Maybe it was because of my height (1.83 m)”. And there were still those who were confused about using the FIT FINDER system.
Depending on the product the volunteers selected; a different measuring method was presented to them, either the FIT FINDER or a link to the Size Guide.
60% of the volunteers in group A and 67% from group B answered positively to the question. For both groups, the appropriate information was the main reason for the positive perception: “Because I had the necessary information”, “Because the informed measurements were enough to know the size of the product”.
Some volunteers thought the information was clear but were not sure about their measurements: “But the result was wrong”, “The data were clear, the measurements, but I felt insecure about the accurateness of the results”. Besides the responses of who did not find the information in the size tables: “But I believe that the measurement option may not be so practical since I need time and a measuring tape.”
Do you trust the fit information? 60% of the volunteers in group A and 83% in group B responded negatively to the question. The main reason for the negative response in group B was the lack of identification with the photos, 40% of the answers, “Because the photos are taken on models, their stereotype is very different from real buyers”, “Not much, because in general, the photos of the models do not resemble my physical type.”
40% of the volunteers in group A, and 17% in group B were satisfied with the information about the fit of the products. Among the reasons was the identification with the photos, “Because the photos are very informative about the material and fit”, “With the image available it was possible to have an idea, but it would be better to have the clothes dressed on a person”.
It is also worth highlighting the responses of those who trusted the information provided by the website, “I know myself, and I feel secure about the information on the website”, “Because the suggested size usually matches the sizes I usually buy”.
Based on the information available, are you sure the chosen size is correct? From group A, 47% of the volunteers and 60% from group B gave negative answers to this question. The volunteers did not have enough information to be sure that the chosen size would fit them, “Because there is no way to know how it will look on your body from the given information,” “Not exactly. Because I have already made other purchases on the website that went wrong, precisely because they did not have enough information about the material/fabric of the clothes etc.”
In addition to not having enough information, other volunteers did not trust what was provided, “As much as I find my measurements in the (size) table, I always have doubts about manufacturing variations”, “Because measurements may be wrong, or information may be missing.”
Other reasons less frequently mentioned were: the perception of the difference in sizes among brands and the necessity of having to try on the product to be sure. “Because size varies a lot from brand to brand,” “I would only be sure after checking it when the product arrives at my home.”
From group A, 53% of volunteers and 40% of group B gave affirmative answers to the question. 63% of group A answered yes to this question because they felt trusted the information provided: “Because I found the size guide detailed enough”, “Because I trusted the information”. Only 17% of group B gave similar reasons.
In group B, the most frequently cited justification for the affirmative response was the use of prior purchase experience when choosing the size, 58% stated that they relied on their purchasing history.
The other debriefing questions are highlighted below.
The information on the website was: SUFFICIENT –INSUFFICIENT. Participants’ responses were based on a 7-point Likert scale ranging from 7 (sufficient information) to 1 (insufficient information). The mean was 5.2 and mode 6 for group A, and the mean 5 and mode 6 for group B. These results demonstrate that the general perception of the volunteers was that the website offered enough information. However, the responses to the other questions show otherwise.
Were the exchange policies clear? In group A, 67% thought that the Exchange and Returns Policies were clear and 87% in group B. It was observed that, during the usability test, some of the volunteers who answered negatively to this question did not find or did not read the policies.
Is price a decisive factor when buying online? 97% of the volunteers in group A and 67% in group B answered yes to this question.
What other factors influence your purchasing decision? Other financial factors, such as the shipping cost and the possibility of installment payments, were the most cited. Another highlight was the convenience that online shopping provides as well as offering online exclusive products.
How did you choose your size? The issue of size and fit was widely discussed by fashion specialists and users. 63% of Group A volunteers and 83% of group B used information obtained from previous purchases to choose their sizes: “I usually know that my size is L, so I selected the products without checking the size table”, “ Based on previous purchases”. 13% of both groups used the resources offered by the website (such as the Size Guide and FIT FINDER): “Based on the estimate offered by the website despite not being sure”, “By the product description”. 13% of group A stated that they had used both their previous purchase history and the resources offered by the website. Other answers ranged from “By intuition” to “Guessing”.
Syntheses
Usability test
Task 1: None of the volunteers performed the test in the minimum number of steps. This result is expected because in this task the volunteers were asked to simulate a purchase and to reach the goal, they navigated the website in search of a product that would meet the task requirements and also satisfy their taste. Most volunteers from Groups A and B, 87%, performed the task between 6 and 16 steps.
Task 2: The result was similar to task 1, none of the volunteers performed the test in the minimum number of steps. In this task, the volunteers were also asked to simulate a purchase. It is worth emphasizing how confused they were about the nomenclature of the products; task 1 consisted of simulating the purchase of a short sleeve blouse and task 2 of a long sleeve shirt. Most volunteers in Groups A and B performed the task between 6 and 16 steps. In general, volunteers were more familiar with the website because they had performed the first task.
Task 3: In Group A, 97% of the volunteers performed the task between 3 and 9 steps, although 33% performed it in the minimum possible number of steps, 3. In Group B, 90% performed between 2 and 13 steps, 2 steps, and 17% of the volunteers completed the task in 5 steps. The exchange policies could only be accessed from the footer of the page on the mobile website, while in the desktop/laptop version, it was possible to access it from 4 different links. This was the task performed most easily, although it was the one with a greater perceived difficulty: “It’s complicated”, “It’s a challenge to find out how to return products”.
Task 4: In Group A, 90% of the volunteers completed the task in up to 17 steps. In Group B, 90% completed it in up to 16 steps. While most found some sort of chart or size guide it did not necessarily mean they were sure about the information.
In tasks 1 and 2 it was noted that the large number of filters available were often misleading. Depending on the combination of active filters, there was a small number of results and the volunteers had trouble deactivating them or to figure out which ones were still active. This problem was worse in group B.
The size filter was especially confusing, there was a huge range of options to choose from, the volunteers were quite confused. Another problem with the filters were the results. When filtering for women’s shirts, among the results were other items that did not meet the inquiry.
The lack of technical knowledge about the products was also a barrier. There was a lot of confusion about the names of the products (not knowing the difference between a blouse and a shirt) and about the fabric.
Volunteers often resort to the website’s internal search, and it created more difficulty. The search tool of the Dafiti website only searches for products, when other terms are used such as Size Guide or Return Policy (much used in tasks 3 and 4) the system did not find any results. The search also does not consider spelling mistakes, typos, or similar terms.
The volunteers were very apprehensive about tasks 3 and 4, especially 3, “I have no idea where to find it (the Size Guide)”, “I do not know where to go to find the Size Guide”. As for task 4, some volunteers did not even know it was possible to return a product “Is there such a possibility?”
Attrakdiff2™ questionnaire
The overall results showed that the volunteers had a satisfactory experience in all aspects measured by AttrakDiff2 (PQ, HQI, HQS, and ATT). The lowest grades were attributed to the attraction quality, but even obtaining the lowest result there was no significant difference compared to the other measured qualities. The results indicated a similar UX on both versions of Dafiti’s website. Due to the complexity of the user experience, its evaluation tends to be equally complex [39]. In a future study, it would be interested to apply a second technique to compare the results.
Debriefing
There was no significant difference between Groups A and B, both groups were quite satisfied with the amount of information offered on the Dafiti.com.br website. However, the answers to the other questions contradict this perception that there was sufficient information available. As in the usability test conducted by [40], there was a perception that volunteers initially did not feel comfortable expressing negative feelings about their interaction, perhaps because it was the first question after the execution of the usability test, they possibly they felt they had to be softer in their response.
When asked if the information in the size table/guide was clear, the responses of the two groups were similar, 60% of Group A and 67% of Group B gave affirmative answers. However, one volunteer from Group A and two from Group B replied that the size table/guide contained clear information, but did they not trust the FIT FINDER results or the Size Guide information. This indicates that in spite of an affirmative answer, the volunteers were not sure about the information provided and the number of negative answers could have been higher if these answers were also taken into account.
On the Complete Size Guide page, there were explanations as to how the users should measure themselves, but the minority of the volunteers managed to access this page, one of them commented that this information would have been helpful when choosing the size.
The FIT FINDER calculated the size based on some information input by the users but it did not provide the product’s dimensions in cm and some volunteers felt insecure about this result and felt the need to consult a table with the product’s dimensions. Among the volunteers who used the information in the tables or guide, there was a portion who said they still had doubts regarding the size.
Less than half of Group A volunteers, 40%, and only 17% of Group B were satisfied with the fit information. The high rate of negative responses from Group B, 83%, indicates greater difficulty in visualizing this type of information on a smartphone. The most cited reasons in both groups were due to the lack of identification with the photos, the necessity to try on the product to be sure of the fit, and the lack of information.
It is worth mentioning that the volunteers who trusted the information on the fit of the product identified themselves with the photographs of the products, exactly the opposite of the group that answered negatively to this question. Fashion photography is undoubtedly an important supplier of images and one of the most powerful mediums when it comes to communicating images of men and women [41].
Another interesting finding was that the volunteers who were satisfied with the fit information chose their size based on their previous shopping experience rather than on the information provided by the website.
What determines the fit of clothes is the type of fabric and how it was cut. Knitted fabrics are more elastic and adjust more easily to the body. Woven fabrics, because they are less elastic, are usually more adjusted to the body. The information available on clothing websites, not only from Dafiti, gives little information about the fit characteristics, or these are extremely technical. Some websites, such as Asos.com, offer a short video to help the users visualize the products fit.
Group B was less sure than Group A regarding the chosen size, 60% against 47% of the volunteers gave negative answers. Once again, desktop/laptop users have shown a better understanding of the information displayed on the screen. However, almost half of the volunteers in Group A were not certain of their choice. Information related to the size and fit of a product is the most sensitive, as they are considered non-digital attributes [9] and are poorly communicated online. Volunteers did not necessarily make their size choices based on the information on the website, they relied on their previous experiences. Most of the volunteers in group B, 58%, gave an affirmative answer, they reported they used the knowledge acquired in previous experiences to choose the size.
As a way to reduce the perceived risk, consumers seek guarantees such as buying from well-known brands or from companies that offer good exchange policies [14, 21]. For the vast majority of the volunteers in both groups (67% in group A and 87% in group B) the exchange policies were clear. Some of the volunteers who gave a negative answer did not find the policies or did not read them.
It is worth emphasizing the density of information: the same text is used both on the mobile site and on the complete one. Although it did not have a negative effect on the volunteers, it would be interesting to test the comprehension of the text and to ask them to go through the exchange process. In the usability test, they were only asked to find the policies.
As has already been indicated in the exploratory interview and the online questionnaire, price is an important factor for e-shoppers. It is in fact the main reason for buying online. Payment facilities, shipping rates, sales, and discounts were also frequently mentioned.
Only 13% of the volunteers relied uniquely on the resources offered on the website to choose their size to perform tasks 1 and 2. This result reflects how difficult it can be to choose your size using only the information available on a website. The size issue is a worldwide problem, on the one hand, organizations are carrying out anthropometric studies to create more accurate sizing systems and establishing standards to communicate this information and on the other, the brands are using their tables to reach their target audiences. Often, the size tables can be manipulated to reflect vanity sizing, so that consumers have the false impression of wearing smaller sizes according to an aesthetic requirement imposed by society [26, 42].
This lack of standardization confuses the consumer, and the inability to properly inspect the product before purchasing it results in a high rate of returns [43] and unsatisfied consumers. In a highly globalized world where products are largely imported and exported, it is paramount to communicate more effectively the size and fit information.
Discussion
Based on the interviewees’ statements during the exploratory interviews, it may be concluded that trust is the main barrier faced by online consumers, especially when buying clothes. Trust is a hedonic quality related to the perceived ability to support the user in achieving goals of being [44].
Fashion and accessories, despite being the top-selling category, ranks 6th place regarding revenue [45]. This number suggests that the average ticket is low, corroborating the interviewees’ statements, indicating that low prices decrease the risk perception. Besides, studies have shown that users are willing to risk more when purchasing products with low monetary value [25].
The perception of risk is mainly due to the lack of trust in the sizing systems adopted in Brazil. According to the interviewed experts, there is not an established standard to follow and as a result, products have different sizes across brands. The fear of buying a poorly fitted product is emphasized by the lack of direct contact and by insufficient product information. Better product descriptions and presentations could contribute to better-informed buying decisions. There is a gap between the demands of the users to standardize sizes and the opinions of the specialists. According to the specialists, as long as brands disclose their sizing systems, it is fine to create their own, as it is very unlikely that standardization will become a reality.
Lack of trust can prevent the task from being completed. In addition to affecting usability, the lack of a sizing standard and insufficient product information affect the user experience. UX, an abbreviation of the term, encompasses all aspects of the end-user’s perception when interacting with a company and its products and/or services [46]. This perception includes the effectiveness, efficiency, emotional satisfaction, and quality of the companies’ relationship with the user [47]. These aspects of the interaction with apparel e-commerce are independent of the device; it extrapolates the media through which the user is accessing the website.
During the usability tests, it was possible to observe that users face difficulties when buying clothes online. This difficulty was not limited to the interaction with the system but it was perceived even before it and affects the user’s expectations influencing their experience. The UX is the user’s perceptions and responses resulting from the use and/or anticipation of the use of a product, system, or service [48].
Problems related to the sizing systems in Brazil offer a number of factors to be considered [42]. The lack of standardization can be seen among products from the same brand and it influences directly the consumer’s purchasing decision process. In general, one has the perception that this disparity happens more in women’s clothing, according to Chun [26], women report more problems than men. The author argues that this is partially due to the wide variety of styles available in women’s clothing. This scenario worsened because of the common practice of outsourcing production in the fashion business. In Brazil, brands often outsource their production to small companies that do not necessarily have a standardized product development process [42]. The results of the descriptive phase did not indicate a significant difference between the desktop/laptop and the smartphone UX. Only when asked: “Do you trust the information on the fit of the products?” and “Based on the information, available are you sure the chosen size is correct?”, there were more negative answers from the group who performed the tasks on a smartphone (Group B). Due to the screen size, a smaller amount of information is presented at once on a smartphone. The cognitive load to process this information is greater and the number of times needed to scroll the page is also higher, demanding more of the user’s short-term memory [49].
Limitations
Despite the efforts outlined in this article, this study has several limitations that may provide directions for future research. First, the volunteers could choose the device of their preference to perform the usability test. It would be interesting to exchange the groups and understand how this could interfere with the results.
The sample was composed of people with a high educational background; future studies could address this limitation by using a more diverse sample regarding this issue.
Also, the scope of this study is limited due to its focus on how women buy clothes online. Future research could carry out usability tests on a male population.
Conclusion
Convenience and financial factors seem to be the main drivers of online shopping. The main obstacles are the sizing inconsistencies and the difficulty of communicating fit and sizing information. The volunteers were less certain about the fit than about size. Some consumers still find it necessary to touch and handle the product. Good product photos are essential for a complete product presentation. The lack of identification with the models in the photos was one of the factors that negatively influenced the user experience.
There was no significant difference between the mobile and the desktop/laptop UX. However, the information architecture on the mobile website was less intuitive than on the desktop/laptop. The volunteers who performed the usability tests using smartphones had more difficulty finding the Return Policies and the Size Table/Guide. There were fewer ways to perform the same tasks on the mobile site as on the desktop/laptop website. The direct transposition of projects from complete sites to mobile websites hinders access to information and therefore interfering with the purchasing process.
Volunteers lacked technical knowledge about the products, the methods of how to measure their bodies and the composition, and types of fabric.
Returning or exchanging a product is perceived as a task that involves a high degree of difficulty, even though it was the one most easily accomplished.
When choosing their size, the volunteers did not use the information provided by the website, they used the knowledge obtained from past shopping experiences. This happened because they did not trust the information, did not understand it, or did not find it.
The size standardization matter is a controversial subject, specialists are mostly against it, and they believe that brands should be able to develop sizing systems according to their target audience.
Conflict of interest
None to report.
Footnotes
“Vanity sizing” is the manipulation of size labels, where the printed size does not correspond to the real one, aiming to sell larger pieces as if they were smaller.
