Abstract
Autonomy is crucial in cooperation. The complexity of HRI scenarios requires autonomous robots able to exploit their superhuman computations (based on DNN, Machine Learning techniques and Big Data) in a trustworthy way. Trustworthiness is not only a matter of accuracy, privacy or security, but it is becoming more and more a matter of adaptation to humans agency. As claimed by Falcone and Castelfranchi, autonomy means the possibility of dislaying or providing an unexpected behavior (including refusal) that departs from a requested (agreed upon or not) behavior. In this sense, the autonomy to decide how to adopt a task delegated by the user, with respect to her/his own real needs and goals, distinguishes intelligent and trustworthy robots from highly performing robots. This kind of smart help can be provided only by cognitive robots able to represent and ascribe mental states (beliefs, goals, intentions, desires etc.) to their interlocutors. The mental states attribution can be the result of complex reasoning mechanisms or can be fast and automatic, based on scripts, roles, categories or stereotypes typically exploited by humans every time they interact in everyday life. In all these cases, robots that build and use cognitive models of humans (that have a Theory of Mind of their interlocutors), have to operate also a meta-evaluation of their own predictive skills to build those models. Robots have to be endowed with the capability to self-trust their skills to interpret the interlocutors and the context, for producing smart and effective decisions towards humans.
After exploring the main concepts that make collaboration between humans and robots trustworthy and effective, we present the first of a series of experiments draw for testing different aspects of a designed cognitive architecture for trustworthy HRI. This architecture, based on consolidated theoretical principles (theory of social adjustable autonomy, theory of mind, theory of trust) has the main goal to build cognitive robots that provide smart, trustworthy collaboration, every time a human requires their help. In particular, the experiment has been designed in order to demonstrate how the robot’s capability to learn its own level of self-trust on its predictive abilities in perceiving the user and building a model of her/him, allows it to establish a trustworthy collaboration and to maintain a high level of user’s satisfaction, with respect to the robot’s performance, also when these abilities progressively degrade.
Introduction
In today’s world, artificial intelligence systems are playing a crucial role in our daily lives. The decisions made by machines are leaving a profound impact on our society and are involving almost every aspect of our life. Different kinds of artificial systems, whose behaviours are based on statistical tools, AI algorithms, machine learning models are used in applications such as healthcare, government, business, judicial and political spheres. Decisions made by AI systems lead to beat some of the best human player [26], to make super accurate medical diagnostics [13], to help companies in customers support [16] and so on. These decisions are more oriented to superhuman computations and performances, than brain-inspired or psychological paradigms. With the enormous impact that AI systems have on society, it is crucial to ensure that all these systems we are relying on are trustworthy. Trustworthy AI is largely considered one of the topics much more demanding in the artificial intelligence field, not only in research, but also in institutions [8, 27], due to the huge impact that AI systems are having in society.
As mentioned above, AI moved from human psychology inspired models (i.e. decision trees in expert systems) to deep neural networks, machine learning, Big Data and so on. If this type of approach proved to be very powerful in computational and performance terms, it increased the gap between super intelligent agents and humans, in terms of trustworthy cooperation between humans and artificial systems. We do not consider just the cases in which results provided by artificial systems have been extremely dangerous for humans [20, 25] (trustworthiness as accuracy, robustness, privacy, security etc.); we focus on those dimensions of trustworthiness (e.g. adaptation to human autonomy) that are involved when humans and artificial systems, in particular robots [11], have to interact [9, 10] and cooperate [7] with each other, and humans have to establish a deep relationship of trust [18, 22] every time they have the necessity to include robots as part of their plans or goals (task delegation and adoption [5]). Trust is not just the result of the frequency with which an agent produces the desired behavior or result; trust is a much more complex attitude, including a causal attribution, an estimation, an ascription of several internal factors that play a causal role in the activation and control of the behavior; trust is the counterpart of delegation [6].
In this work we propose a preliminary, simulated experiment, designed with the goal to test a cognitive architecture [4] for trustworthy human robot collaboration. The architecture allows a BDI [24] robot to integrate different cognitive skills, with the goal to provide a smart help every time a human user delegates to it a task to achieve in her/his place.
Cooperation and task delegation
Cooperation is based on different and complementary kinds of attitudes and reasons from the partners involved. Let’s consider the following collaborative scenario (Fig. 1): a human X (the trustor) and a robot Y (the trustee) collaborate so that X has to trust Y, in a specific context, for executing a task τ (referred to an action (α), to its resulting world state (g), or both) and realizing the results that include or correspond to the X’s Goal X (g) = g X [6].

Collaborative scenario based on delegation and adoption attitudes.
In this context, X relies on Y for realizing some part of the task she/he has in mind (task delegation); on its side, Y decides to help X, to replace her/him and perform a sequence of actions that are included in the X’s plan, in order to achieve some of her/his goals or sub-goals (task adoption). The capability to implement a smart task adoption distinguishes a collaborator from a simple tool, and presupposes intelligence and autonomy [12]. Being truly cooperative implies more than the simple concept of execution of a prescribed action. For example, in order to adopt some goal of X in an intelligent form, Y has to understand the X’s mental states (i.e. goals, beliefs, expectations about Y’s behavior) and it has to adjust the delegated action to the represented mental states, to the context and to its own current abilities and characteristics. In their much complex sense, cooperation and help require more autonomy and initiative. A real collaborative trustee should provide to the trustor different kind of help, according to [5] (Fig. 2):

Levels of Adoption.
Sub help: Y satisfies a sub-part of the delegated world-state (so satisfying just a sub-goal of X), Literal help: Y adopts exactly what has been delegated by X, Over help: Y goes beyond what has been delegated by X without changing X’s plan (but including it within a hierarchically superior plan), Critical-Over help: Y realizes an over help and in addition also modifies the original plan/action (included in the new meta-plan), Critical help: Y satisfies the relevant results of the requested plan/action (the goal), but modifies that plan/action, Critical-Sub help: Y realizes a sub help and in addition modifies the (sub) plan/action, Hyper-critical help: Y adopts goals or interests of X that X itself did not take into account (at least, in that specific interaction with Y): by doing so, Y neither performs the specific delegated action/plan nor satisfies the results that were delegated. In practice, Y satisfies other goals/interests of X by realizing a new plan/action.
Y has to exploit its autonomy, competence and cognitive skills to find the better or a possible solution for X’s goal. This not necessarily should require a negotiation, discussion, agreement; it might be an initiative of Y by expecting that X will understand why. This is precisely what intelligent robots must have and these are the kind of partners the humans need.
How would this advanced form of cooperation would be possible? What are some of the capabilities that a robot has to show for enhancing trust in its human interlocutor? A smart and trust-based collaboration between humans and intelligent robots requires, among many others things, complex cognitive capabilities these artificial systems must be endowed with: mental attribution, adjustable autonomy, user profiling and user behavior adaptation, behavior transparency. Besides the capabilities to evaluate the interlocutor and/or the contextual physical environment, a robot (as a trustee) should be able also to operate a meta-evaluation: how much itself would be able to interpret and produce the evaluations regarding the trustor? How much is reliable its capability to perceive or infer the trustor’s features? On the basis of its own capabilities to perceive or to act in the world, the hypothesis or prediction it has made, the chosen course of action, are the best or the most effective, with respect to the needs, the features and the mental states of the interlocutor? Smart help has to be based on different capabilities to interpret the environment and the interacting user, but first of all, it has to be based on the robot’s capability to realistically self-assess the level of trustworthiness on its ability to interpret the collaborative and potentially uncertain context, including the interacting user [15, 17]. The outcome of the meta-evaluation expressed above represents the robot’s self-trust for adopting a delegated task. In practice, the robot uses this evaluation of its own specific abilities as a filter for their use with respect to the interlocutors with whom it is interacting. The robot learns the trustworthiness of its skills and, on the basis of the context and the task to carry out, establishes which skills to use and how trustworthy (from its point of view) will be the solution it will propose to its interlocutor. So robot’s self-trust can be viewed as a precondition for exploiting the robot’s interpretative skills accordingly to its own interlocutor, in order to foster a true and deep relationship of collaboration and trust with her/him.
A form of intelligent help that requires autonomy adaptation in the task adoption implies risks. One of the possible consequences of this form of help can be the emergence of collaborative conflicts between the human (the trustor) and the robot (the trustee) that adopts the task, due to the robot’s willingness to collaborate and to help the user better and more deeply than required. Sometimes, the difference between the results of the adopted task provided by the robot and the user’s expectations, could lead the interlocutor to a complete lack of trust towards the robot. We are not just considering the robot’s failure in the precise delegated task: failures become more evident every time the robot goes beyond the delegated task and the results are too much distant (or even in conflict) from the user’s expectations. Among humans these conflicts can be mitigated by the experience: humans learn to measure their competence in achieving specific results, or making the right prediction about the correctness of a chosen behaviour, on the basis of the context and the interlocutor; furthermore, on this basis, they learn to self-trust their own abilities/skills (with respect to both the interlocutors and the tasks). Similarly, robots can learn to trust their capabilities to evaluate the interlocutors (and consequently to build and use the cognitive models they attribute to them) through a repetitive interactions with humans. For example, a robot can exploit the feedback provided by its interlocutor any time she/he delegates to it a task and receive an evaluation (i.e. user’s satisfaction) on the results of the robot’s adoption process.
A cognitive architecture for trustworthy HRI
The cognitive architecture, shown in Fig. 3, underpins the principles of the Procedural Reasoning System (PRS) [14], one of the widely used architectures that explicitly embodies beliefs, desires, intentions. In this Section we focus on the main modules of the framework (i.e. reasoning module describing the task adoption process), with particular attention to the module that allows the robot to evaluate its level of self-trust.

The cognitive architecture overview.
The mental states of the robot are stored in the Robot’s Beliefs Base (BB). The database contains beliefs referring to: the current state of the environment (excluding the interacting user), its own physical internal state, i.e. the robot’s battery, the temperature of its joints and so on, the beliefs, goals, plans the robot attributes to the interlocutor on the basis of its ability to have a theory of mind of the user herself/himself, generic beliefs representing the knowledge of the robot.
In our work the concept of belief is extended with respect to the traditional definition, typically intended as information encoding the state of the environment; here we consider belief any attribution that the robot is able to ascribe to itself or to other agents (i.e. a plan is a belief that the robot attributes to itself or to the user involved in the interaction). The formalization of any belief included in the robot’s BB is based on the principles of first-order logic (FOL).
Reasoning
The robot’s reasoning process starts once the user delegates to it a task τ. The first operation done by the robot is implemented by Human Profiling module, which allows the robot to create a user’s profile. When the robot begins the interaction, it collects several perceptual information that can be processed into different kind of human features. Depending on the domain, the robot can map facial expressions, conversational information, symbolic gestures, to human features, physical or social. The features can be exploited for categorizing the current interlocutor (see Section 5 for user profiling and classification). In addition to classify the user, based on the operations described above, the robot evaluates the task and the constraints associated with it. The same task can have different execution strategies, on the basis of either the preconditions due to the current state of the environment, or the constraints corresponding to the profile attributed to the delegating agent by the robot. The Task Constraints Analysis module provides the robot the capability to analyze which user’s features, mapped into the BB, are necessary to select the right strategy for adopting the task. At this stage of the reasoning process, the robot has collected in its own BB: beliefs encoding the task’s profile, beliefs encoding the user’s profile, beliefs encoding the environment state.
The next modules exploit the cognitive ingredients mentioned above, in order to deliberate upon the state of affairs it wants to achieve and decides how to using the available plans in BB for achieving the deliberated intention. It is important to underline that the intention it commits to achieve corresponds/contains the delegated task. First of all, the robot sets the level of help with which the task has to be adopted. We decide to design a robot that, every time receives a delegated task by its interlocutor, tries to provide a much more complex plan/goal than the delegated one (the delegated task is still part of the complex goal/plan). The robot can go beyond the task delegated and achieve goals that the user never expects, but that can be part of a complex plan she/he has in mind and of which the delegation is only a sub-task (over-help).
A crucial role is played by a plan library stored in the robot’s BB, containing several plans for achieving goals. Plans are represented as complex hierarchical goals that decompose into other sub-goals g i or actions α i . This results in a graph representation in which edges denote plan decomposition and root nodes in the graph correspond to goals or elementary/complex actions. This representation allows the robot to represent in its BB the goals attributed to the user. Every plan attributed to the user is classified on the basis of how much is adapt to specific human features. Due to the hierarchical representation of plans in the robot’s BB, we can say that every plan allows to achieves goals or sub-goals that are more suitable for specific users than for others. We define active goals as those goals that the user, identifiable through her/his particular features, did not delegate, but that might represent state of affairs that she/he might to achieve in the future, or after achieved the task delegated. The Active Goals Plans Selection module allows the robot to sort the plans with active goals on the basis of the larger number of human features matching to the features of the perceived user’s profile. By means of the Task Recognition module the robot explores plans with active goals and recognizes which plans contain the delegated task. Then, according to the defined level of help, the robot decides to select those relevant plans that achieve the delegated task, or to select those one that achieve much complex tasks, of which the delegated task represents a sub-task. The outcome of the module is the task the robot intends to achieve. Both the active goals plans selection and task recognition modules allow the robot to exploit the different levels of task adoption described in Section 2 in order to find the most suitable task it can adopt, after the delegation required by its interlocutor. For example, if the robot cannot find a suitable plan including much more complex active goals, it can perform a literal task adoption; vice versa, if a much more complex plan is found, it will try to perform over-help; finally, if it decides to perform a literal task adoption, but some plan’s precondition are not satisfied, it will perform just a sub task (sub-help), belonging to the delegated one. In this way, the robot exploits its expertise (provided by a rich plan library) and adjusts its own autonomy to carry out the task, in order to find the best level of help as the compromise between the needs of the user and the environmental conditions.
When the goal recognition phase has been implemented, the selected task might need a final specification. The Task Specification module allows the agent to fill any attribute associated to the task but not explicitly specified by the user. In particular, the robot searches in its own BB, information necessary for completely specify the task, on the basis of the user’s profile already defined.
Building self-trust module
The intelligent help supported by the cognitive architecture we described in this work, allows a robot to improve its role with respect to the human collaborator: from a passive executor system to an intelligent collaborator, able to adapt its autonomy and decisions with respect to the expectations or prescriptions provided by its interlocutor.
The Building self-trust module in Fig. 3 fosters a robot to exploit the feedback provided by the interacting user, in order to learn a method to self-evaluate the trustworthiness of its skills in building her/his profile. The strategy that the robot uses for learning to self-evaluate the trustworthiness of its profiling skills is based on a training phase that requires interaction between the robot and m different users. Every user delegates to the robot the same task; for its part, the robot adopts the task at a literal level of help. At every interaction the robot computes multiple robot’s skill trustworthiness values, each for every feature that forms the user’s profile. These values depend on the feedback provided by the users during the training phase. At the end of the training phase, the robot is able to associate a final value of trustworthiness to the corresponding capability to recognize the features collected in the user’s profile and maps these information, under the form of beliefs, into its own Beliefs Base. On the basis of the learned values, the robot modulates the entire task selection process represented in Fig. 3, by leveraging on those skills that maximize the user’s task performance satisfaction. In the next session we will provide much more details about the implementation of the building self-trust process.
The proposed experiment
The experiment designed for testing the proposed cognitive architecture, has been implemented by exploiting the well known multi-agent oriented programming (MAOP) framework JaCaMo [1], which integrates three different multi-agent programming levels: agent-oriented (AOP), environment-oriented (EOP) and organization-oriented programming (OOP). Basically, the experiment simulates the process of task delegation and task adoption between a robot and multiple users, grouped in classes of users, in a specific application domain.
The experimental settings
We figured the following interactive scenario: the robot is a touristic assistant that helps people to organize different touristic activities offered by a city (i.e. eat in a restaurant, visit a museum, visit a monument, drink something in a bar, enjoy the city doing multiple daily activities). The experiment is based on the interaction between two agents: the user and the robot. Both of them are implemented as Jason [2] agents. The user has her/his own mental states represented in form of beliefs, goals and plans and interacts with the robot by delegating to it a task. On its side, the robot is able to represent and attribute mental states to the user and to itself and, on the basis of its capabilities to profile the user and build a model of her/him, to adopt the delegated task at different levels of help.
The experiment has been designed with the goal to show the importance for a robot to self estimate the level of trustworthiness associated to its expertise in building a profile of the interacting user. This capability lets the robot choose the best and suitable task to adopt with respect to the user’s features, also when its skills progressively degrade and can be considered not trustworthy. Indeed, the robot is able to sort these skills on the basis of the corresponding level of trustworthiness, and leverage on the most trustworthy among them for deciding how to adopt the task delegated. As mentioned above, two agents populate the simulation: the agent robot
Map of the relations between features, sub-components and intervals
Map of the relations between features, sub-components and intervals
Users are organized into classes of populations: each class collects together users with the same profile (in terms of sub-components). Each user of a class distinguishes from the others due to five real values r
H
i
for i = 1, . . , 5 randomly picked up from the interval associated to the sub-components. The decision making system of
It is important to specify that the robot’s beliefs are organized according to the features that are classified within
The experiment is a simulation of several trials – interactions between
Building robot’s self-trust
The robot
The user’s satisfaction function
We have introduced a user’s satisfaction function
The experiment’s phases
The experiment is structured as follows: the robot implements a first trial with a population of class the robot implements a second trial with the same population of class the robot starts a training phase with a new population of class the robot starts a second training phase with a new population of class the trials described at points 1 and 2 are repeated, but this time the robot exploits what it has learned respectively in the context described at the point 3 and 4, in order to achieve the task adoption process.
Results
In this Section we present the results of the experiment designed in order to address the research purpose previously defined: demonstrate how building robot’s self-trust is a precondition for providing smart and trustworthy collaboration, every time a user requires the robot’s help. The results show a behavior of the robot such that it is always able to find a complex plan that includes the delegated task and other active goals suitable with the user’s profile; therefore it is able to complete both the delegated task (literal help) and tasks not explicitly expected by the user, but which could potentially be suitable with her/his profile (over-help).
The plots shown in Fig. 4 compares the results obtained after the execution of each experiment’s phase described in Section 5.5.

Figure 4a and 4b show the trend of the curves representing the user’s satisfaction, obtained after each phase described in Section 5.5 each plot represents the trend of the component P τ (light red line) and the trend of S U (dark red line) as combination of P τ and S U plus . Figure 4c shows a statistical description of the impact of the self-trust building process in the level of user’s satisfaction on the robot’s smart collaboration.
Let’s start by describing the Fig. 4a This plots refer to the case when the robot’s capability to recognize the feature category is inaccurate, while are accurate the capability to recognize the remaining features collected in
Figure 4b displays the trends of the user’s satisfaction function
Finally, Fig. 4c shows the box plots comparing the distributional characteristics of
Cooperation is one of the main social activities exploited by humans for gaining resources, in terms of goals achieved, shared knowledge and so on. The increasing intelligent technology surrounding us is becoming crucial for our own social development, and, as a consequence, the need of trusting these supporting and sophisticated tools is becoming every day more stringent. But, if on the one hand these systems are becoming more intelligent and sophisticated, on the other hand they show a strong lack in the ability to collaborate effectively with humans. Despite the complexity of the problem they can solve, they continue to have just a passive supporting role in the collaboration with humans. For being not only executive tools, these intelligent systems (i.e. robots, chat-bots, autonomous cars and so on) should expose the capability to behave in a critical way with respect to the needs/goals of their interacting users. Indeed, the collaboration becomes deep and effective when a system is able to provide not declared, unexpected results but compatible with the context, the needs of the user and the capabilities of the system itself. The level of autonomy of robots or other artificial agents, it should be such that such systems can exercise a certain level of discretion in achieving the task delegated but humans. But, in order to foster trust in humans, they should behave having the capability to create a complex theory of mind of the interlocutors and a strong capability to self assess their own capability to carry out a task, also at a different level of help than required.
In this work we have presented the first of a series of experiments drawn for testing different aspects of a designed cognitive architecture. This architecture, based on consolidated theoretical principles (theory of adoption and delegation, theory of mind, theory of social adjustable autonomy, theory of trust) has the main goal to build robots that provide smart, trustworthy collaboration, every time a human requires their help. With this experiment we wanted to test the robustness of the designed architecture to rely on the robot’s ability to learn the limits in interpreting the needs of its interlocutor, by measuring the trustworthiness of its predictive abilities. In fact, the architecture gives to a robot the capability to profile the user and to leverage on its profiling skills in an adaptive manner, by exploiting those skills that maximize the user’s task performance evaluation; it allows the robot to reason about the mental states of the user (beliefs, goals, plans and intentions) and makes it capable to modulate its autonomy for achieving the delegated task. One of the main problems in intelligent collaboration between humans is the possibility of misunderstandings that can lead to conflicts between cooperators. We call these collaborative conflicts, as they are based on the desire to collaborate beyond what is required but in doing this errors and discrepancies can occur.
Just to minimize these conflicts and increase the robot’s trustworthiness, an important requirement to introduce is the capability of the robot itself to self trust his capabilities to build a complex model of the user. The data analyzed have shown how the process to learn the trustworthiness of its own profiling skills can lead the robot to have an effective collaboration, based not only on the actions/tasks prescribed by the user, but especially on the non declared needs and goals of the user himself/herself. Our main future work will be to move the experiment in a real environment, with a real robotic platform and real users. We are already designing real experiments, in order to test the proposed cognitive architecture; for doing that, we exploit the humanoid robot Nao (Fig. 5), widely used in HRI applications. Furthermore, we will continue to provide simple but effective experiments that allow us to investigate different aspects of the concept of intelligent and trustworthy collaboration between robots and humans, that consider robots as cognitive agents able to interact with humans as humans do when they interact with each others.

Real interaction.
