An approach to the classification of educational chatbots

Abstract

Nowadays, chatbots have become popular tools in such a way that they are used in different sectors like commercial, elderly care, tourism, and education. The COVID-19 pandemic has forced many students and teachers to suspend face-to-face classes. Therefore, schools and governments have found it necessary to continue education remotely, using the resources provided by the Internet. This fact has created a greater interest in educational chatbots, so several projects have been proposed to develop these academic tools, each following its way of implementation and addressing issues from different points of view. This paper presents a proposal for chatbot classification, following the Systematic Mapping Study and an iterative method to review and classify educational chatbots. We also discuss the resulting categories and their characteristics and limitations and possible uses by developers and researchers.

Keywords

Chatbot classification chatbot-user interaction education scholar tools

1 Introduction

A chatbot is a software that interacts with humans using natural language [1]. In 1966, one of the pioneers in this field, Weizenbaum [2], presented his famous ELIZA chatbot, whose goal was to hold a conversation with a human being. ELIZA is based on templates and pattern matching techniques but lacks reasoning capabilities. The design of this chatbot is limited to the technology and knowledge of that time. However, this former and the subsequent research works are the foundations for the design of current chatbots.

Modern chatbots are capable of understanding the context of a conversation and learning from it, improving themselves over time thanks to the use of Machine Learning techniques [3] and novel computer architectures [4].

In customer service departments, the use of chatbots is constantly increasing. Thus, enterprises have plans to launch their own versions in the short term [5]. For this reason, companies do research in this field, in order to stay ahead and reduce costs [4].

With the arrival of the COVID-19 pandemic, governments were forced to implement mechanisms for students to continue with their education. Although the schools closed their facilities, the courses continue. The concern of governments and authorities in charge of education focused on not losing the school year. Consequently, students have had to adapt their academy training conditions and thus continue their classes online. This abrupt change directly have affected the traditional educational models that mainly focus on face-to-face classes [6].

Chatbots for the educational sector are not new at all. Before the COVID-19 pandemic, there were already proposals that involved chatbots in said domain, e.g., T-Bot/Q-Bot [7], CHARLIE [8], UMT-BOT [9], FIT-EBot [10], LiSA [11], DINA [12], and Ubibot [13] to name a few. However, educational chatbots are still limited [14], and the research that has been done is not so extensive.

In the terminology by Gregor [15] and Nickerson et al. [16], the concepts of typology, framework, taxonomy, and classification are interchangeably. For our purposes, in this paper we used the term classification.

A classification of objects is a basic mechanism for organizing knowledge [17]. Some benefits are analyzing and understanding complex domains [16], providing structure and organization of knowledge in a field [18], implementing an order that allows establishing relationships between concepts [19], and understanding the differences between previous research works [20].

The study and development of chatbots is a topic of great importance and popularity today, and it seems unlikely that this trend will change in the coming years. That is why it is essential to identify the characteristics and classifications of proposals in the area, especially in the educational domain, as it is a fertile field in which artificial intelligence can have a significant impact. However, to do so, it is necessary to have solid methodologies that allow gathering a representative sample of the proposals in question, and it was thanks to the Nickerson et al. [16] method and the Systematic Mapping Study (SMS) [21] that we were able to achieve it.

The originality of our work lies in the classification per se because educational chatbots classifications are practically non-existent. The few examples found in state of the art are classifications that have been carried out with ad hoc or trivial methodologies, which leads to possible biased results and very poor reproducibility.

We hope that the dimensions we identify will be of great value to specialists in the area, as they can serve as a guide to improve implementations of educational chatbots and for researchers who want to enter the field to obtain an overview of the work carried out.

This article is structured as follows. After presenting in Section 2 the related work, Section 3 exposes the iterative development of the adopted classification method. Then, in Section 4, we detail the resulting classification of chatbots, focusing on its dimensions and characteristics. In Section 5, we discuss our results. Finally, Section 6 presents conclusions and future work.

2 Related work

Chen et al. [22] classify chatbots into two main groups: task-oriented and non-task-oriented. The former chatbots help the user to complete a task, e.g., search for a product or have a short conversation in a closed domain. The latter chatbots interact with the user following a question/answer approach for ludic purposes, usually in an open domain.

Nuruzzaman and Hussain [23] divide chatbot applications into four categories: goal-based, knowledge-based, service-based, and response generated-based. Goal-based chatbots are designed for a specific task and a short conversation, e.g., answer/question approach or problem solving for customers in a website. Knowledge-based chatbots are intended for both open and closed domains, e.g., answer to general and particular topics, respectively. Service-based chatbots provide customers with facilities, e.g., orders to food stores. Response generated-based chatbots focus on how to answer user questions, i.e., a prioritized answer is returned, and it is chosen depending on what is established in the model’s policy.

Gnewuch et al. [24] propose a bidimensional classification: primary mode of communication and context. The former dimension indicates the modality of interaction with the chatbot, i.e., text or voice. The latter dimension focuses on a specific domain or a conversation topic with its users.

Ramesh et al. [25] suggest a six groups classification: retrieval-based, generative-based, shor-text conversation, long-text conversation, open domain, and closed domain. Retrieval-based chatbots have a set of predetermined answers and to give an appropriate one they make use of heuristics. Generative-based model avoids depending on predefined answers to generate new answers. Shor-text conversation refers to a succinct answer when the chatbot receives a specific question. Long-text conversation means that the chatbot can have a lasting chat. Open domain refers to the possibility in a discussion to switch between different domains. Closed domain indicates a specific knowledge, therefore, an appropriate response is desirable.

Diederich et al. [26] present a taxonomy of eleven-dimensions for chatbot: communication mode, context, language, intelligence, implementation, hosting, pricing model, reporting, sentiment detection, enterprise integration, and platform integration. The communication mode and context are extends of Gnewuch et al. [24] classification. Language indicates whether the chatbot can support one o more languages. Intelligence refers to whether the chatbot is based on rules, such as pattern matching or self-learning skills. Implementation indicates the technology used for the development of the chatbot. Hosting is the platform where the chatbot is deployed. Pricing model indicates the price to be payed for using the platform, e.g., Microsoft Azure Bot has a cost depending on the number of interactions and Dialogflow has a limited free version. Reporting considers whether the platform has a monitor to know the details of interactions, users, and number of conversations. Sentiment detection indicates whether the platform supports the detection of user sentiments in a conversation. Finally, enterprise integration means that the platform offers an APIs (Application Programming Interface) or pre-built interfaces.

Nimavat and Champaneria [27] classify chatbots in four groups: knowledge domain, service provided, goals, and input proccesing and response generation method. Knowledge domain refers to whether the chatbot can interact in an open or closed domain. Service provided is based on a proxemics subclassification: interpersonal, intrapersonal, and interagent. Goals involves a subclassification based on a primary goal: informative, chatbot based/conversational, and task based. Input processing and response generation method indicates the procedures to handle inputs and generate answers.

Hussain et al. [28] establish the following groups for their classification: interaction mode (text-based or voice/speech-based), chatbot application (task-oriented or non-task-oriented), rule-based or AI (machine learning, deep learning or templates), and domain (specific or open).

Adamopoulou and Moussiades [29] propose seven categories of chatbots. The first four categories are the same as presented by Nimavat and Champaneria [27]. The human-aid category refers to the need of flexibility, so the operations are carried out by the chatbot with human intervention. Permissions considers whether the chatbot is open source or commercial. Communication channel depends on the interaction modality: text, voice, or image.

Quiroga Pérez et al. [30] mention a classification of educational chatbots with two categories: service-oriented and teaching-oriented. The former chatbots are focused on service support, such as, FAQs. The latter chatbots are divided into formal and informal ones.

As can be seen, there are relatively few chatbot classifications, with most works presented as surveys [22 , 28–30]. This is a limitation because those classifications did not present in detail the process to get them.

On the other hand, some authors use the method proposed by Nickerson et al. [16] for the development of their classification, e.g., Diederich et al. [26], Feine et al. [31], Janssen et al. [32, 33], and Bittner et al. [34]. However, these are not educational chatbot classifications.

It is important to mention that educational chatbot classifications are almost nonexistent, except for the work by Quiroga Pérez et al. [30], that classify educational chatbots in two categories, as mentioned before. However, they do not give details of their classification process.

3 Iterative classification process

In this section, Nickerson et al. [16] method is presented in a general way, highlighting important theoretical elements. Then, we show the development of the step-by-step method to obtain classification of educational chatbots.

3.1 General theoretical elements

For the development of our classification, one of the most important points was the choice of Nickerson et al. [16] method. We consider that this method is adequate to formalize the process because it is flexible and has a solid foundation that is established by objective and subjective ending conditions, steps, and iterations to obtain a concrete result. Nevertheless, it can be confusing when used for the first time because, according to Nickerson et al. [16], the subjective conditions are difficult to identify and apply, and evaluating them requires the insight, experience, and skills of researchers.

The method uses terms as: dimensions and characteristics. Dimensions can be viewed as variables and characteristics as instances of variables.

Fig. 1 presents the seven-step method of Nickerson et al. [16]: (1) identifying the meta-characteristic, which refers to the purpose of the classification; (2) determining ending conditions, which serve as a guide to stop the classification process; (3) selecting an approach: empirical-to-conceptual (new objects are examined to determine whether features are sufficient or new features and possibly new dimensions are needed) or conceptual-to-empirical (it begins by conceptualizing dimensions without examining real objects); depending on the selected approach steps 4-6, may change: (4e) identifying subset of objects by recognizing the characteristics from a systematic sample; (5e) identifying common characteristics and grouping objects; (6e) grouping characteristics into dimensions to create a taxonomy; or (4c) conceptualizing characteristics and dimensions of objects; (5c) examining objects for these characteristics and dimensions; (6c) creating a taxonomy; and (7) asking whether the ending conditions have been met.

Fig. 1

Nickerson et al. [16] seven-step method.

There are eight possible objective ending conditions:

All objects, or a representative sample, have been examined.

No object was merged with a similar object or splitted into multiple objects in the last iteration.

At least one object corresponds to each characteristic of every dimension.

No new characteristics or dimensions were added in the last iteration.

No characteristics or dimensions were merged or splitted in the last iteration.

Every dimension is unique.

Every characteristic is unique within its dimension.

Each cell (combination of characteristics) is unique.

As for the subjective ending conditions, there are five possibilities:

Concise: it indicates the number of dimensions that allow the classification to be established.

Robust: it means that the characteristics and dimensions identified are enough to differentiate the objects of interest.

Comprehensive: it indicates that all analyzed objects can be classified within the domain under consideration or the classification includes all dimensions of the objects of interest.

Extendible: it refers to the possibility of adding new characteristics and new dimensions.

Explanatory: it provides a useful explanation of the objects of study.

It is worth mentioning that step 2 is determined by the objective and subjective ending conditions; once they are satisfied, the iteration process stops. To get to the end of the method, it is necessary to check that the conditions are met; otherwise a new iteration must be started from step 3. The method is flexible, since not all objective and subjective ending conditions have to be fulfilled, i.e., it is possible to choose from those conditions.

A classification may not be perfect but it is useful if we want to explain the nature of the objects in an study [16].

3.2 Applying the method

Table 1 presents the number of iterations that we got and how the objective and subjective conditions were met. We start by showing the development of the first iteration.

Table 1
Detail of the iterations with respect to the objective and subjective ending conditions [16]

Objective ending conditions Iteration 1 Iteration 2 Iteration 3

All objects, or a representative sample, have been examined. ✓ ✓ ✓

At least one of them corresponds to every characteristics of every dimension. ✓ ✓

No new dimensions or characteristics were added in the last iteration. ✓

No dimensions or characteristics were merged or splitted in the last iteration. ✓

Every dimension is unique and not repeated. ✓ ✓ ✓

Subjective ending conditions

Concise ✓ ✓

Robust ✓ ✓

Comprehensive ✓ ✓

Extendible ✓ ✓ ✓

Explanatory ✓ ✓

Objective ending conditions	Iteration 1	Iteration 2	Iteration 3
All objects, or a representative sample, have been examined.	✓	✓	✓
At least one of them corresponds to every characteristics of every dimension.		✓	✓
No new dimensions or characteristics were added in the last iteration.			✓
No dimensions or characteristics were merged or splitted in the last iteration.			✓
Every dimension is unique and not repeated.	✓	✓	✓
Subjective ending conditions
Concise		✓	✓
Robust		✓	✓
Comprehensive		✓	✓
Extendible	✓	✓	✓
Explanatory		✓	✓

Step 1: we are interested in the features that educational chatbots can have with respect to functionality, that is our meta-characteristic. Therefore, each characteristic obtained in the subsequent steps must be logically consistent with our meta-characteristic.

Step 2: the objective ending conditions that we adopted were: 1, and 3 to 6. We accepted all five subjective ending conditions.

Step 3; in this step the first iteration begins, but it is necessary to select an approach. We decided to select the empirical-to-conceptual approach, because we are going to classify chatbots into categories based on their characteristics, i.e., when we obtain our sample of educational chatbot works in the scientific literature, we will identify their similarities and analyze whether there are relationships between them or not.

Step 4: we use the SMS [21] to get our sample, because it allows us to develop a broad review of primary studies in a specific domain and identify the evidence available on the topic [35].

The research question (RQ) is What kinds of chatbots are in the education domain?

The inclusion, exclusion and selection criteria are needed.

We define the following inclusion criteria are: a) the proposal was published from 2006 onwards; b) the work contains relevant terms in the title, abstract, or keywords; and c) the proposal is focused on educational chatbots.

As for the exclusion criteria we have: a) the work is duplicated; b) the proposal does not satisfy the RQ; c) the work is not from a journal, conference or workshop; d) the proposal is not written in English or Spanish language; and e) the work is a survey or a systematic review.

Our selection criteria are: a) apply inclusion and exclusion criteria in documents; b) in the remaining proposals, read the conclusion to look for properties that we need and that were not identified in the abstract; and c) obtain a refined list of articles and proceed to read them taking into account the inclusion and exclusion criteria, but now on the main body of the paper.

The search expression for the RQ is: (education OR hci) AND (chatbot OR chatter-bot OR “conversational agent”) AND (universit* OR institut* OR college OR academy OR “high school” OR teach* OR student OR class).

We use the ACM Digital Library, IEEE Xplore, Science Direct, Scopus, and Springer Link search databases to obtain our sample of proposals.

In the first search we got 1703 results. Applying the inclusion, exclusion, and selection criteria, the number was reduced to a total of 45 papers: T-Bot/Q-Bot, Bigham et al. [36], CHARLIE [8], INES [37], Bhargava and Maheshwari [38], Niranjan et al. [39], Gómez Róspide and Puente [40], UMT-BOT [9], Chatbot [41], NLAST [42], MALL [43], Song et al. [44], Bala et al. [45], Dutta [46], Ranoliya et al. [47], EASElective [48], FIT-EBot [10], DINA [12], LiSA [11], EduBot [49], English Practice [50], Clarizia et al. [51], Ubibot [13], LTKA-Bot [52], Chatbot [53], Mikic-Fonte et al. [54], CiSA [55], Reyes et al. [56], KEMTbot [57], Lee et al. [58], Goschlberger and Brandstetter [59], Oliveira et al. [60], Pereira et al. [61], Doly [62], Tribubot [63], QuizBot [64], Nguyen et al. [65], Lecturer’s Apprentice [66], Sreelakshmi et al. [67], Pereira and Barcina [68], E-orientation [69], TutorDocente [70], Infobot [71], Mekni et al. [72], and CultureBot [73].

Step 5: we identify the following characteristics: Evaluation, Feedback, Information, FAQs (Frequently Asked Questions), Procedures, Courses, LCMS (Learning Content Management Systems), Software, Subjects, Q&A (Questions and Answers), Tutorship, Schools, e-Learning, Users, and Universities.

Step 6: the grouping of characteristics in dimension is defined as follows. The School Service-Oriented dimension has the next characteristics: Information, FAQs, and Schedule. On the other side, the e-Learning-Oriented dimension has the following characteristics: Courses, LCMS (Learning Content Management Systems), and Software. Finally, the Student-Oriented dimension has the Evaluation, FAQs, Feedback, Q&A, Subjects, Schedule, and Tutorships characteristics.

Step 7: two objective and four subjective ending conditions have not been met yet. Therefore, it is necessary a new iteration, starting from step 3.

In the second iteration, we also select the empirical-to-conceptual approach for the same reason as the first iteration. Continuing the steps 4 to 6, we identify the Teacher dimension. To make the process concise and comprehensive, we group and rename the Student dimension as Student/Teacher-Orientation, and it has the following new characteristics: Reports, Support, and Topics. Furthermore, we identify a new characteristic, Procedures, for the School Service-Oriented dimension. The ending conditions have not been met, because a new dimension has been created in this iteration. Therefore, a new iteration is necessary.

In the third iteration, we also select the empirical-to-conceptual approach and no new dimension and characteristics have been found in subsequent steps. Thus, the classification method does not need a new iteration, because the objectives and subjectives ending condition were met.

4 Resulting classification of educational chatbots

After applying the Nickerson et al. method, we found the three dimensions and fifteen characteristics, which are presented in Fig. 2.

Fig. 2

Structure of the chatbots classification and their characteristics in education domain.

The School Service-Oriented dimension groups chatbots whose main function is to provide general information, FAQs, schedules, or procedures. This type of chatbots is useful for educational institutions, as they can provide a complete service to both the community and external users that need more information about fees, or educational offer. The main advantages of this type of chatbots are 24/7 availability, reduction of workload for the staff, handling many students at the same time, and accessibility from a mobile device. The characteristics of this dimension are:

Information: the chatbot provides users with information, e.g., educational offer, directory, and study plans.

FAQ: questions and answers that are common for users. This characteristic is also found in the Student/Teacher-Oriented dimension.

Procedures: a guide for students to carry out a specific procedure, e.g., how to enroll in a class or what is needed to get certified. This characteristic is also found in the Student/Teacher-Oriented dimension.

Schedule: specific information on activities, e.g., academic events and evaluations.

The e-Learning-Oriented dimension covers chatbots that were designed for MOOCs (Massive Open Online Course), LCMS (Learning Content Management Systems), and education-oriented software, e.g., Moodle, that students use remotely. This type of chatbots are a complement of massive courses that do not necessarily belong to a formal academic environment, e.g., classes of English as a foreign language. The characteristics of this dimension are:

Courses: when a chatbot is an ad hoc implementation where users can enroll remotely, e.g., massive courses on a foreign language learning.

LCMS: those chatbots that are designed to be integrated into an existing MOOC platform.

Software: chatbots that are one more tool in a specialized system and can generally be accessed through the Web.

The Student/Teacher-Oriented dimension groups chatbots that not only interact with students, but also with teachers. We found several elements as: evaluation, FAQs, feedback, Q&A, reports, schedule, subjects, suppport, topics, and tutorships. The characteristics of this dimension are:

Evaluation: assessment tools for students, e.g., exams, homework, quizzes, practices, and essays.

Feedback: the chatbot provides feedback to students according to their progress in class.

Q&A: it has specific questions and concrete answers to the student.

Reports: details provided by the chatbot to the teacher about the progress of their students.

Subjects: interacting with the student about the classes they have registered.

Support: it offers some kind of support to the student, e.g., how to connect an electronic device to the laboratory network.

Topics: answering questions on specific topics, e.g., complexity of a sorting algorithm.

Tutorships: it offers students some form of educational or personal orientation.

Table 2 represents the relative and absolute frequencies of the characteristics that we identified in our classification. From the 45 papers analyzed, 24.44% belongs to School Service-Oriented, 22.22% to e-Learning-Oriented, and 53.34% to Student/Teacher-Oriented (see Fig. 3). As each paper can be identified with more than one characteristic, the number of characteristic incidences was 89. The Papers column represents the percentage of characteristics based on the number of papers in a given dimension (relative) and the total number of papers (absolute). Similarly, the Incidence column was calculated using the number of incidences in a given dimension (relative) and the total number of incidences (absolute).

Table 2

Relative and absolute frequencies of the characteristics among 45 educative chatbots papers and 89 characteristic incidences

Dimension	Characteristic	Papers (n=45)		Incidence (m=89)
		Relative	Absolute	Relative	Absolute
School Service-Oriented (n=11, m=18)	Information (m=9)	81.81%	20%	50%	10.11%
	Schedule (m=1)	9.09%	2.22%	5.55%	1.12%
	FAQ (m=6)	54.54%	13.33%	33.33%	6.74%
	Procedures (m=2)	18.18%	4.44%	11.11%	2.24%
e-Learning-Oriented (n=10, m=9)	Courses (m=3)	30%	6.66%	33.33%	3.37%
	LCMS (m=2)	20%	4.44%	22.22%	2.24%
	Software (m=4)	40%	8.88%	44.44%	4.49%
Student/Teacher-Oriented (n=24, m=62)	Evaluation (m=10)	41.66%	22.22%	16.12%	11.23%
	Feedback (m=9)	37.5%	20%	14.51%	10.11%
	Reports (m=1)	4.16%	2.22%	1.61%	1.12%
	Subjects (m=9)	37.5%	20%	14.51%	10.11%
	Topics (m=11)	45.83%	24.44%	17.74%	12.35%
	FAQ (m=2)	8.33%	4.44%	3.22%	2.24%
	Q&A (m=15)	62.5%	33.33%	24.19%	16.85%
	Schedule (m=2)	8.33%	4.44%	3.22%	2.24%
	Support (m=2)	8.33%	4.44%	3.22%	2.24%
	Tutorships (m=1)	4.16%	2.22%	1.61%	1.12%

Fig. 3

Education-focused chatbot classification.

5 Discussion

As can be seen in Section 2, most of the works followed an ad hoc methodology, which prevents its use in other domains (e.g., educational chatbots) and makes it more challenging to replicate the results they obtained. Instead, the Nickerson et al. [16] method establishes general elements that facilitate its use for different domains. Thus, our proposal has the advantage of reproducibility, as the research question, search expression, inclusion, exclusion, and selection criteria are well established and laid in sound foundations. Furthermore, choosing between empirical/inductive and conceptual/inductive approaches gives us flexibility and versatility to develop our classification.

From our proposal, three groups emerge that classify chatbots according to their characteristics. We not only show how these groups were reached but also the particularities of each one. This is an advance over other works with much less detailed groups, such as the one proposed by Quiroga Pérez et al. [30]. Unlike those proposals that follow arbitrary or ad hoc methods, our classification allowed us to represent many more chatbots in a characteristic group without the need to create small clusters that can be trivial. That is because we used a systematic process [16].

Regarding the three classifications that resulted from our iterations (see Fig. 2), we can mention that they are related according to the granularity of the information that the chatbot can send and receive. The classification that contains the chatbots that are oriented to more general tasks and therefore handle basic information is found in School Service-Oriented since the target users usually include a general public looking for rudimentary information. On the contrary, the most specialized chatbots are classified as Student/Teacher-Oriented. They fill the need for more personalized interactions and carry out much more precise tasks, since they are aimed at users as individuals. The chatbots within e-Learning-Oriented are generally somewhere in the middle. Although they also serve a broad population, their use context is more limited.

In the classification of the works shown in Fig. 3, we can observe that there is a clear trend towards the development and study of chatbots that focus on students and teachers. This may be due to the way these chatbots interact with people. As we already mentioned, the need to provide more personalized attention to specific users in specific contexts opens up broader possibilities for research, since problems arise with the capabilities inherent to artificial intelligence and those related to software engineering, usability, and user experience.

According to Nimavat and Champaneria [27] and Adamopoulou and Moussiades [29], it is possible to have multiple categories for a chatbot, e.g., both KEMTbot [57] and TutorDocente [70] are within School Service-Oriented and Student/Teacher-Oriented. However, these cases are rare, because chatbots in general are limited in giving predefined answers from a database or answer questions on closed domains [23].

We can analyze the implications of our clasification from two points of view. The first benefits developers, since we identify the basic interaction mechanisms they must implement according to their development context. For example, if they develop a chatbot for a high school class, which both students and teachers will use, then our Student/Teacher-Oriented group is relevant because it tells them that they have to create mechanisms that satisfy the characteristics of that classification.

On the other hand, our proposal also impacts research groups since we create a base for more research on the classification of education-oriented chatbots. From our search expression, inclusion, exclusion, and selection criteria, as well as our approach and ending conditions, researchers may find more features groups in years to come.

A limitation of the Nickerson et al. [16] method is that it is qualitative, so it is not possible to carry out a formal analysis in a quantitative way. However, from the analysis that we carried out in Table 2, we can say that the most frequent characteristics of our sample of works were those of Information, Evaluation, Feedback, Subjects, Topics, and Q&A. This is fascinating because it represents a mixture of more automation-oriented functions with those that are much more advanced and require the chatbot to participate in the teaching/learning process actively.

On the other hand, it seems that there is a tendency for research in these rudimentary functions to disappear since the least frequent features were Schedule, Procedures, Reports, FAQ, and Support. This may be since new technologies have solved all the problems in these functionalities. Other rare features were Courses, LCMS, Software, and Tutorships. We attribute this to the sheer cost and complexity of these systems.

We know that this quantitative analysis does not replace a formal method, representing our proposal’s limitation. To remedy this, we plan in the future to explore quantitative tools such as those offered by Likert scales [74], the framework of Szopinski et al. [75], the Fuzzy Comprehensive Evaluation [76], and the Analytic Hierarchy Process [77].

6 Conclusion and future work

We develop an educational chatbot classification using the Nickerson et al. [16] method. Through a series of iterations, we identified three dimensions: School Service-Oriented, e-Learning-Oriented, and Student/Teacher-Oriented. Furthermore, the SMS allowed us to cover a significant sample of papers for our proposal, giving us an advantage over other classifications that used ad hoc or trivial methods. In this way, we identify a tendency, as the most prevalent dimension was Student/Teacher-Oriented.

The passive role of chatbots as virtual assistants is being left behind. Conversational systems have taken a more active role in current education thanks to technological advances. Today, some applications help students through immediately graded exercises and receive adequate feedback in case of doubts or errors. On the other hand, teachers also take advantage of these tools to discover their students’ progress, which is essential since chatbots are not intended to replace educators in the classroom but rather are a tool for them to provide more specialized education for their students. Another facet that chatbots can offer is an early identification tool since, based on the system reports, pedagogues and psychologists could identify emerging problems.

As for future work, we plan to create a new classification method that reduces uncertainties, because as we already mentioned, the Nickerson et al. [16] method may have a problematic entry barrier for inexperienced researchers. We found a knowledge gap in this aspect, since a much more precise methodology is needed to deal with collecting, identifying, and organizing works under qualitative and quantitative perspectives.

Footnotes

Acknowledgement

We thank CONACyT (Consejo Nacional de Ciencia y Tecnología) for funding José Fidel Urquiza Yllescas’s doctoral fellowship. Scholarship number: 331560.

The work described in this paper was funded by “Fondo SEP-CINVESTAV de Apoyo a la Investigación (Call 2018).” Number of project 120 titled “Desarrollo de un chatbot inteligente para asistir el proceso de enseñanza/aprendizaje en temas educativos y tecnológicos.”

José Fidel Urquiza-Yllescas thanks IEMS-CDMX for giving him the opportunity to continue his doctoral studies.

References

Shawar

and Atwell

, Chatbots: are they really useful?, Journal for Language Technology and Computational Linguistics 22(1) (2007), 29–49.

Weizenbaum

, ELIZA—a Computer Program for the Study of Natural Language Communication between Man and Machine, Commun ACM 9(1) (1966), 36–45.https://doi.org/10.1145/365153.365168

Baby

C.J.

, Khan

F.A.

and Swathi

J.N.

, Home automation using IoT and a chatbot using natural language processing, in: 2017 Innovations in Power and Advanced Computing Technologies (i-PACT), Vellore, India, 2017, pp. 1–6. https://doi.org/10.1109/IPACT.2017.8245185

Hristidis

, Chatbot Technologies and Challenges, in: 2018 First International Conference on Artificial Intelligence for Industries (AI4I), 2018, pp. 126–126.

Arcand

, Can Chatbots Fully Replace Humans? Not Yet, CRM Magazine 21(6) (2017), 4.

Silva de Souza

G.H.

, Bento Marques

, Siqueira Jardim

, Cesar Lima

, Lopes Junior

and Silveira Ramos

, Brazilian Students’ Expectations Regarding Distance Learning and Remote Classes During the COVID-19 Pandemic., Educational Sciences: Theory & Practice 20(4) (2020), 66–80.

Mikic-Fonte

F.A.

, Burguillo

J.C.

, Rodriguez

D.A.

, Rodriguez

and Llamas

. T-Bot and Q-Bot: A couple of AIMLbased bots for tutoring courses and evaluating students, in: 2008 38th Annual Frontiers in Education Conference, 2008, pp. S3A–7-S3A-12.

Mikic-Fonte

F.A.

, Burguillo

J.C.

, Llamas

, Rodriguez

D.A.

and Rodriguez

. CHARLIE: An AIML-based chatterbot which works as an interface among INES and humans, in: 2009 EAEEIE Annual Conference, 2009, pp. 1–6. https://doi.org/10.1109/EAEEIE.2009.5335493.

Valle-Rosado

, García-García

and López-Martínez

, Desarrollo e implementación de un bot conversacional como apoyo a los estudiantes en su proceso de titulación, in: International Conference on Robotics and Computing, 2013.

10.

Hien

H.T.

, Cuong

P.-N.

, Nam

L.N.H.

, Nhung

H.L.T.K.

and Thang

L.D.

. Intelligent Assistants in Higher-Education Environments: The FIT-EBot, a Chatbot for Administrative and Learning Support, in: Proceedings of the Ninth International Symposium on Information and Communication Technology, SoICT 2018, Association for Computing Machinery, New York, NY, USA, 2018, pp. 69–76. ISBN 9781450365390. https://doi.org/10.1145/3287921.3287937.

11.

Dibitonto

, Leszczynska

, Tazzi

and Medaglia

C.M.

. Chatbot in a Campus Environment: Design of LiSA, a Virtual Assistant to Help Students in Their University Life, in: Human-Computer Interaction. Interaction Technologies, M. Kurosu, ed., Springer International Publishing, Cham, 2018, pp. 103–116. ISBN 978-3-319-91250-9.

12.

Agus Santoso

, Anisa Sri Winarsih

, Mulyanto

, Wilujeng saraswati

, Enggar Sukmana

. Rustad

, Syaifur Rohman

, Nugraha

and Firdausillah

. Dinus Intelligent Assistance (DINA) Chatbot for University Admission Services, in: 2018 International Seminar on Application for Technology of Information and Communication, IEEE, Semarang, Indonesia, 2018, pp. 417–423. https://doi.org/10.1109/ISEMANTIC.2018.8549797.

13.

Paschoal

L.N.

, de Oliveira

M.M.

and Chicon

P.M.M.

. A Chatterbot Sensitive to Student’s Context to Help on Software Engineering Education, in: 2018 XLIV Latin American Computer Conference (CLEI), 2018, pp. 839–848. https://doi.org/10.1109/CLEI.2018.00105

14.

Yang

and Evans

. Opportunities and Challenges in Using AI Chatbots in Higher Education, in: Proceedings of the 2019 3rd International Conference on Education and E-Learning, ICEEL 2019, Association for Computing Machinery, New York, NY, USA, 2019, pp. 79–83. ISBN 9781450372251. https://doi.org/10.1145/3371647.3371659

15.

Gregor

, The nature of theory in information systems, MIS quarterly 30(3) (2006), 611–642.

16.

Nickerson

R.C.

, Varshney

and Muntermann

, A method for taxonomy development and its application in information systems, European Journal of Information Systems 22(3) (2013), 336–359.

17.

Wand

, Monarchi

D.E.

, Parsons

and Woo

C.C.

, Theoretical foundations for conceptual modelling in information systems development, Decision Support Systems 15(4) (1995), 285–304. https://doi.org/10.1016/0167-9236(94)00043-6

18.

Glass

R.L.

and Vessey

, Contemporary applicationdomain taxonomies, IEEE Software 12(4) (1995), 63–76. https://doi.org/10.1109/52.391837

19.

McKnight

D.H.

and Chervany

N.L.

, What Trust Means in E-Commerce Customer Relationships: An Interdisciplinary Conceptual Typology, International Journal of Electronic Commerce 6(2) (2001), 35–59. https://doi.org/10.1080/10864415.2001.11044235

20.

Sabherwal

and King

W.R.

, An Empirical Taxonomy of the Decision-Making Processes concerning Strategic Applications of Information Systems, Journal of Management Information Systems 11(4) (1995), 177–214.

21.

Petersen

, Feldt

, Mujtaba

and Mattsson

, Systematic Mapping Studies in Software Engineering, in: Proceedings of the 12th International Conference on Evaluation and Assessment in Software Engineering, EASE’08, BCS Learning and Development Ltd., Swindon, GBR, 2008, pp. 68–77.

22.

Chen

, Liu

, Yin

and Tang

, A Survey on Dialogue Systems: Recent Advances and New Frontiers, SIGKDD Explor Newsl 19(2) (2017), 25–35. 10.1145/3166054.3166058.

23.

Nuruzzaman

and Hussain

O.K.

, A Survey on Chatbot Implementation in Customer Service Industry through Deep Neural Networks, in: 2018 IEEE 15th International Conference on e-Business Engineering (ICEBE), 2018, pp. 54–61. 10.1109/ICEBE.2018.00019.

24.

Gnewuch

, Morana

and Maedche

, Towards Designing Cooperative and Social Conversational Agents for Customer Service, in: Proceedings of the 38th International Conference on Information Systems (ICIS), Seoul, ROK, December 10-13, 2017. Research-in-Progress Papers., AIS eLibrary (AISeL), 2017.

25.

Ramesh

, Ravishankaran

, Joshi

and Chandrasekaran

, A Survey of Design Techniques for Conversational Agents, in: Information, Communication and Computing Technology, S. Kaushik, D. Gupta, L. Kharb and D. Chahal, eds, Springer Singapore, Singapore, 2017, pp. 336–350. ISBN 978-981-10-6544-6.

26.

Diederich

, Brendel

and Kolbe

, Towards a Taxonomy of Platforms for Conversational Agent Design, in: Wirtschaftsinformatik, 2019.

27.

Nimavat

and Champaneria

, Chatbots: An overview. Types, Architecture, Tools and Future Possibilities, Interntaional Journal for Scientific Research & Development (2017), pp. 1019–1024.

28.

Hussain

, Ameri Sianaki

and Ababneh

, A Survey on Conversational Agents/Chatbots Classification and Design Techniques, in:Web, Artificial Intelligence and Network Applications, L. Barolli, M. Takizawa, F. Xhafa and T. Enokido, eds, Springer International Publishing, Cham, 2019, pp. 946–956. ISBN 978-3-030-15035-8.

29.

Adamopoulou

and Moussiades

, Chatbots: History, technology, and applications, Machine Learning with Applications 2 (2020), 100006. https://doi.org/10.1016/j.mlwa.2020.100006

30.

Quiroga Pérez

, Daradoumis

and Puig

J.M.M.

, Rediscovering the use of chatbots in education: A systematic literature review, Computer Applications in Engineering Education 28(6) (2020), 1549–1565.

31.

Feine

, Morana

and Maedche

, Leveraging Machine-Executable Descriptive Knowledge in Design Science Research –The Case of Designing Socially-Adaptive Chat-bots, in: Extending the Boundaries of Design Science Theory and Practice, B. Tulu, S. Djamasbi and G. Leroy, eds, Springer International Publishing, Cham, 2019, pp. 76–91. ISBN 978-3-030-19504-5.

32.

Janssen

, Rodríguez Cardona

and Breitner

M.H.

, More than FAQ! Chatbot Taxonomy for Business-to-Business Customer Services, in: Chatbot Research and Design, A. Følstad, T. Araujo, S. Papadopoulos, E.L.-C. Law, E. Luger, M. Goodwin and P.B. Brandtzaeg, eds, Springer International Publishing, Cham, 2021, pp. 175–189. ISBN 978-3-030-68288-0.

33.

Janssen

, Passlick

, Cardona

and Breitner

, VirtualAssistance in Any Context - A Taxonomy of Design Elements for Domain-Specific Chatbots, Business & Information Systems Engineering 62 (2020), 211–225. 10.1007/s12599-020-00644-1.

34.

Bittner

, Oeste-Reiß

and Leimeister

J.M.

, Where is the Bot in our Team? Toward a Taxonomy of Design Option Combinations for Conversational Agents in Collaborative Work, in: HICSS, 2019. https://doi.org/10.24251/HICSS.2019.035

35.

Kitchenham

B.A.

, Guidelines for performing Systematic Literature Reviews in software engineering, Technical Report, Vol. EBSE 2007-001, 2007.

36.

Bigham

J.P.

, Aller

M.B.

, Brudvik

J.T.

, Leung

J.O.

, Yazzolino

L.A.

and Ladner

R.E.

, Inspiring Blind High School Students to Pursue Computer Science with Instant Messaging Chatbots, SIGCSE Bull 40(1) (2008), 449–453. https://doi.org/10.1145/1352322.1352287

37.

Mikic-Fonte

F.A.

, Rial

J.C.B.

, Llamas-Nistal

and Hermida

D.F.

, Using semantics in INES, an Intelligent Educational System, in: 2009 39th IEEE Frontiers in Education Conference, (2009), pp. 1–6. ISSN 2377-634X. https://doi.org/10.1109/FIE.2009.5350539

38.

Bhargava

and Maheshwari

, An Intelligent Speech Recognition System for Education System, 2009.

39.

Niranjan

, Saipreethy

M.S.

and Kumar

T.G.

, An intelligent question answering conversational agent using Naïve Bayesian classifier, in: 2012 IEEE International Conference on Technology Enhanced Education (ICTEE), 2012, pp. 1–5.

40.

Gómez Róspide

and Puente Águeda

, Agente Virtual Inteligente Aplicado a un Entorno Educativo, Revista Pensamiento Matemático 2 (2012).

41.

Benotti

, Martínez

M.C.

and Schapachnik

, Engaging High School Students Using Chatbots, in: Proceedings of the 2014 Conference on Innovation and Technology in Computer Science Education, ITiCSE’14, Association for Computing Machinery, New York, NY, USA, 2014, pp. 63–68. ISBN 9781450328333. https://doi.org/10.1145/2591708.2591728

42.

Mikic-Fonte

F.A.

, Nistal

M.L.

, Rial

J.C.B.

and Rodríguez

M.C.

, NLAST: A natural language assistant for students, in: 2016 IEEE Global Engineering Education Conference (EDUCON), 2016, pp. 709–713. ISSN 2165-9567. https://doi.org/10.1109/EDUCON.2016.7474628

43.

Troussas

, Krouska

and Virvou

, Integrating an Adjusted Conversational Agent into a Mobile-Assisted Language Learning Application, in: 2017 IEEE 29th International Conference on Tools with Artificial Intelligence (ICTAI), (2017), pp. 1153–1157.

44.

Song

, Oh

E.Y.

and Rice

, Interacting with a conversational agent system for educational purposes in online courses, in: 2017 10th International Conference on Human System Interactions (HSI), (2017), pp. 78–82.

45.

Bala

, Kumar

, Hulawale

and Pandita

, Chat-Bot For College Management System Using A.I, International Research Journal of Engineering and Technology (IRJET) (2017), 2395–0056.

46.

Dutta

, Developing an Intelligent Chat-bot Tool to Assist High School Students for Learning General Knowledge Subjects, Technical Report, Georgia Institute of Technology, 2017.

47.

Ranoliya

B.R.

, Raghuwanshi

and Singh

, Chatbot for university related FAQs, in: 2017 International Conference on Advances in Computing, Communications and Informatics (ICACCI), Udupi, India, 2017, pp. 1525–1530. https://doi.org/10.1109/ICACCI.2017.8126057

48.

Chun Ho

, Lee

H.L.

, Lo

W.K.

and Lui

K.F.A.

, Developing a Chatbot for College Student Programme Advisement, in: 2018 International Symposium on Educational Technology (ISET), 2018, pp. 52–56.

49.

Verleger

and Pembridge

, A Pilot Study Integrating an AI-driven Chatbot in an Introductory Programming Course, in: 2018 IEEE Frontiers in Education Conference (FIE), (2018), pp. 1–4. ISSN 1539-4565. https://doi.org/10.1109/FIE.2018.8659282

50.

Pham

X.L.

, Pham

, Nguyen

Q.M.

, Nguyen

T.H.

and Cao

T.T.H.

, Chatbot as an Intelligent Personal Assistant for Mobile Language Learning, in: Proceedings of the 2018 2nd International Conference on Education and E-Learning, ICEEL 2018, Association for Computing Machinery, New York, NY, USA, 2018, pp. 16–21. ISBN 9781450365772. https://doi.org/10.1145/3291078.3291115

51.

Clarizia

, Colace

, Lombardi

and Pascale

, Santaniello

, Chatbot: An education support system for student, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 11161 LNCS (2018), 291–302. https://doi.org/10.1007/978-3-030-01689-0_23

52.

Mulyana

, Hakimi

and Hendrawan , Bringing Automation to the Classroom:AChatOps-Based Approach, in: 2018 4th International Conference on Wireless and Telematics (ICWT), 2018, pp. 1–6.

53.

Benotti

, Martnez

M.C.

and Schapachnik

, A Tool for Introducing Computer Science with Automatic Formative Assessment, IEEE Transactions on Learning Technologies 11(2) (2018), 179–192.

54.

Mikic-Fonte

F.A.

, Nistal

M.L.

and Rodríguez

M.C.

, Using a Chatterbot as a FAQ Assistant in a Course about Computers Architecture, in: 2018 IEEE Frontiers in Education Conference (FIE), (2018), pp. 1–4. ISSN 1539-4565. https://doi.org/10.1109/FIE.2018.8659174

55.

Heo

and Lee

, CiSA: An Inclusive Chatbot Service for International Students and Academics, in: HCI International 2019 –Late Breaking Papers, C. Stephanidis, ed., Springer International Publishing, Cham, (2019), pp. 153–167. ISBN 978-3-030-30033-3.

56.

Reyes

, Garza

, Garrido

, De la Cueva

and Ramirez

, Methodology for the Implementation of Virtual Assistants for Education Using Google Dialogflow, in: Advances in Soft Computing, L. Martínez-Villaseñor, I. Batyrshin and A. Marín-Hernández, eds, Springer International Publishing, Cham, (2019), pp. 440–451. ISBN 978-3-030-33749-0.

57.

Ondáš

and Hládek

, How chatbots can be involved in the education process, in: 2019 17th International Conference on Emerging eLearning Technologies and Applications (ICETA), (2019), pp. 575–580.

58.

Lee

, Jo

, Kim

and Kang

, Can Chatbots Help Reduce the Workload of Administrative Officers? - Implementing and Deploying FAQChatbot Service in a University, in: HCI International 2019 - Posters, C. Stephanidis, ed., Springer International Publishing, Cham, (2019), pp. 348–354. ISBN 978-3-030-23522-2.

59.

Göschlberger

, Brandstetter

, Conversational AI for Corporate E-Learning, in: Proceedings of the 21st International Conference on Information Integration and Web-Based Applications and Services, iiWAS2019, ssociation for Computing Machinery, New York, NY, USA, 2019, pp. 674–678. ISBN 9781450371797. https://doi.org/10.1145/3366030.3366115

60.

Oliveira , Espíndola

D.B.

, Barwaldt

, Ribeiro

L.M.

and Pias

, IBM Watson Application as FAQ Assistant about Moodle, in: 2019 IEEE Frontiers in Education Conference (FIE), (2019), pp. 1–8.

61.

Pereira

, Fernández-Raga

, Osuna-Acedo

, Roura-Redondo

, Almazán-López

and Buldón-Olalla

, Promoting Learners’ Voice Productions Using Chatbots as a Tool for Improving the Learning Process in a MOOC., Technology, Knowledge and Learning 24(4) (2019), 545–565.

62.

Kowsher

, Tithi

F.S.

, Ashraful Alam

, Huda

M.N.

, Md Moheuddin

and Rosul

M.G.

, Doly: Bengali Chatbot for Bengali Education, in: 2019 1st International Conference on Advances in Science, Engineering and Robotics Technology (ICASERT), (2019), pp. 1–6.

63.

Rafael

M.S.

, María

T.B.L.

, Antonio

F.U.

and Hanns

D.L.F.M.

, Support to the learning of the Chilean tax system using artificial intelligence through a chatbot, in: 2019 38th International Conference of the Chilean Computer Science Society (SCCC), (2019), pp. 1–8.

64.

Ruan

, Jiang

, Xu

, Tham

B.J.-K.

, Qiu

, Zhu

, Murnane

E.L.

, Brunskill

and Landay

J.A.

, QuizBot: A Dialogue-Based Adaptive Learning System for Factual Knowledge, in: Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems, CHI’19, Association for Computing Machinery, New York, NY, USA, 2019, pp. 1–13. ISBN 9781450359702. https://doi.org/10.1145/3290605.3300587

65.

Nguyen

H.D.

, Pham

V.T.

, Tran

D.A.

and Le

T.T.

, Intelligent tutoring chatbot for solving mathematical problems in Highschool, in: 2019 11th International Conference on Knowledge and Systems Engineering (KSE), 2019, pp. 1–6. ISSN 2164-2508. https://doi.org/10.1109/KSE.2019.8919396

66.

Ismail

and Ade-Ibijola

, Lecturer’s Apprentice: A Chatbot for Assisting Novice Programmers, in: 2019 International Multidisciplinary Information Technology and Engineering Conference (IMITEC), (2019), pp. 1–8.

67.

Sreelakshmi

A.S.

, Abhinaya

S.B.

, Nair

and Jaya Nirmala,

, A Question Answering and Quiz Generation Chatbot for Education, in: 2019 Grace Hopper Celebration India (GHCI), 2019, pp. 1–6.

68.

Pereira

and Barcina

M.A.

, A Chatbot Assistant for Writing Good Quality Technical Reports, in: Proceedings of the Seventh International Conference on Technological Ecosystems for Enhancing Multiculturality, TEEM’19, Association for Computing Machinery, New York, NY, USA, 2019, pp. 59–64. ISBN 9781450371919. https://doi.org/10.1145/3362789.3362798

69.

Zahour

, Benlahmar

E.H.

, Eddaoui

, Ouchra

and Hourrane

, Asystem for educational and vocational guidance in Morocco: Chatbot E-Orientation, Procedia Computer Science 175 (2020), 554–559, The 17th International Conference on Mobile Systems and Pervasive Computing (MobiSPC), The 15th International Conference on Future Networks and Communications (FNC), The 10th International Conference on Sustainable Energy Information Technology. https://doi.org/10.1016/j.procs.2020.07.079

70.

Cordero

, Toledo

, Guamán

and Barba-Guamán

, Use of chatbots for user service in higher education institutions, in: 2020 15th Iberian Conference on Information Systems and Technologies (CISTI), 2020, pp. 1–6. https://doi.org/10.23919/CISTI49556.2020.9141108

71.

Lee

L.-K.

, Fung

Y.-C.

, Pun

Y.-W.

, Wong

K.-K.

, Yu

M.T.-Y.

and Wu

N.-I.

, Using a Multiplatform Chatbot as an Online Tutor in a University Course, in: 2020 International Symposium on Educational Technology (ISET), 2020, pp. 53–56. 10.1109/ISET49818.2020.00021.

72.

Mekni

, Baani

and Sulieman

, A Smart Virtual Assistant for Students, in: Proceedings of the 3rd International Conference on Applications of Intelligent Systems, APPIS 2020, Association for Computing Machinery, New York, NY, USA, 2020. ISBN 9781450376303. https://doi.org/10.1145/3378184.3378199

73.

Nias

and Ruffin

, CultureBot: A Culturally Relevant Humanoid Robotic Dialogue Agent, in: Proceedings of the 2020 ACM Southeast Conference, ACM SE’20, Association for Computing Machinery, New York, NY, USA, 2020, pp. 280–283. ISBN 9781450371056. https://doi.org/10.1145/3374135.3385306

74.

Gil

M.A.

, González-Rodríguez

, Fuzzy vs. Likert Scale in Statistics, 2012, pp. 407–420. ISBN 978-3-642-24665-4. https://doi.org/10.1007/978-3-642-24666-1_27

75.

Szopinski

, Schoormann

and Kundisch

, Because your taxonomy is worth it: Towards a framework for taxonomy evaluation, in: ECIS, 2019.

76.

Zhang

and Feng

, Application of fuzzy comprehensive evaluation to evaluate the effect of water flooding development, Journal of Petroleum Exploration and Production Technology 8(4) (2018), 1455–1463.

77.

Saaty

R.W.

, The analytic hierarchy process—what it is and how it is used, Mathematical Modelling 9(3) (1987), 161–176. https://doi.org/10.1016/0270-0255(87)90473-8