Abstract
Understanding the collaboration between physical systems and human is an essential task in man-machine systems. This area of research has been significantly explored in the recent years with the focus on the machine side. Much less attention has been directed to the other side in the man-machine systems, which is the human. The aim of this review is to discuses the major directions and challenges in man-machine systems from control theory perspective with a focus on human modeling and human attributes in the man-machine system. Four directions has been identified; these are: 1) Understanding the attributes and the limitation of the human operator; 2) Categorizing the human-in-the-loop applications and derive generic bounders for each category; 3) Building a realistic application-based model for the human behavior; and 4) Integrating the human models into a formal control synthesis methodology. We surveyed the contribution in each of the four challenges and pointed out the limitation in the proposed ideas. Particular attention is given to the model identification of the human behavior.
Introduction
Nowadays, humans are interacting with machines in every aspect of their lives, either actively controlling the machine or passively getting served by them. Human-in-the-loop is the term that is often used in the control theory literature to describe the involvement of the human with physical systems as [1–13]. The human-in-the-loop particularly refers to a situation where a system or a machine is controlled, fully or partially by a human. The control is refereed to be active if it was through some control manipulators. Human-in-the-loop can also mean that, the human is monitored or even controlled by a machine, which we refer as passive. In the active human-in-the-loop setup, the human observe the output of the system through, for instance, a display in which he or she can view all the required information to update their control actions or their decisions. This is typical feedback interconnection control of a dynamic system (the plant) and a controller (the human) see Fig. 1. Therefore, the human-in-the-loop can be modeled as an input-output system, similarly as any dynamical system. This has lead to the development of several dynamical models that mimics the human behavior in human-in-the-loop applications.

Human in the loop control system.
The term human-in-the-loop appears also with a different perspective in the field of human-computer interaction (HCI). HCI focuses on the interfaces between users and computer technology.Understanding of human behavior and observing the way in which humans interact with different computer technology platforms and design technologies that allow humans to interact with computers in different ways [14–16].
One important direction of human-in-the-loop research is the study of skill transfer between the skills of human operators and autonomous systems [17–21], where autonomous systems such as robots have the opportunity to learn from the operator. Also, shared control between robots and humans is another human-in-the-loop application, where a rich research questions such as co-adaptation between the human and the robot are raised. Furthermore, the behavioral and social change of humans while working and cooperating with different levels of intelligent systems is another important area where the concept of man-machine systems is presented [22–31].
The term behavior is defined in [15] as a set of temporal changes in neuromuscular and physiological activity that can last from a few milliseconds (a blink) to minutes (talking) or hours (sitting). Examples of this behavioral signals include attitude such as fear, inattention, joy or stress, Also, it can be manipulation such as the actions used to move an objects in the surrounding environment. Understanding the human behavior is done by translating their behavioral signals in particular context into a description of the given behavior.
One way to understand an operator behavior in a cyber-physical system is to analyze his input and output from a control theory and data mining perspective. Finding unknown repeated behaviors or patterns can lead to better understanding of operator’s action in particular situations. Also, mining the repeated behaviors of skilled operators can be used as a benchmark for teaching either other human operators or autonomous systems.
The man-machine systems are strongly connected to automation and therefore, control theory was the primary contributor to this field [32, 33]. Control theory scientists started in the early 60th or even before to build models for human behavior using dynamical systems and control tools. Manual control was the known name at that time, where the human works as an operator that operate the system. Since then, the development of the human models continued to use the automatic control tools. Manual control still exists in highly automatic and intelligent systems such as aircraft and ground vehicles, and a skilled human operator is able to make the process performs at the desired efficiency level.
The term human-in-the-loop, in control theory perspective, simply means a plant or a system controlled fully or partially by a human through some control manipulators such as joystick or control panel. The human can observe the output of the system through a display in which he or she can view all the required information to update their actions. This setting is a typical feedback interconnection of a dynamic system (the plant or the system under control) and a controller (the human operator).
Most of the existing models for the human operator, as we will see later in the paper, treats the human as a physical controller either linear or nonlinear. However, any physical controller is composed of several mechanical components and softwares. These components are able to repeat the same actions under the similar conditions for the system under control for any number of times. In contrast, the human, as an operator or a controller, cannot repeat or reproduce the same exact actions under the same stimulus. This is due to several reasons such as fatigue and stress. In addition, the physical controller can work continuously for long time without any issue, however, a human operator cannot function for more than one regular working shift [34].
Cyber-physical systems, Internet of things, human-robot interaction are modern names that accommodate human-in-the-loop applications. In [35], three main challenges in feedback control with humans in the loop in cyber-physical systems are defined. They are: (i) the need for a comprehensive understanding of the complete spectrum of the types of human-in-the-loop controls, (ii) the need for extensions to system identification or other techniques to derive models of human behaviors, and (iii) determining how to incorporate human behavior models into the formal methodology of feedback control. In the recent survey [36], an overview of the human-in-the-loop in cyber-physical systems is presented with a taxonomic exercise focused on the general roles of the human component together with a requirement analysis.
The aim of this paper is to discuses the major directions and challenges in man-machine systems from automatic control theory perspective. Four directions have been identified: Understanding the attributes and the limitation of the human operator; Categorizing the human-in-the-loop applications and derive generic bounders for each category; Building a realistic application-based model for the human behavior, and Integrating the human models into a formal control synthesis methodology
In the following sections, a detailed description of each of the above direction with highlights on the limitation in the proposed ideas is discussed. Particular attention is given to the model identification of the human behavior [35, 36].
The first step towards real man-machine system understanding is to understand the human physical limitations and behavioral attributes while dealing with physical dynamical systems. This involves many aspects such as the way in which humans process information, the limitation of humans sensory system and humans decision-making process. As it becomes clear that humans behavior and response in control loop inside cyber-physical systems are fundamentally different from the physical controllers. This is because of the difference between the biological nature of humans and the artificial and cyber-physical nature of machines. This difference makes several significant challenges understanding man-machine systems.
In this section, we will discuss our review by highlighting some of the physical limitation and behavioral attributes that distinguish humans from dynamical cyber-physical systems. The term behavior is defined as a set of temporal changes in neuromuscular and physiological activity that can last from a few milliseconds (a blink) to minutes (talking) or hours (sitting) [15]. In other words, the set of actions or reactions of a person in response to external or internal stimulus situation. Behavior also can be observed, recorded and measured. This allows better understand diffidence between humans and physical dynamical systems. Equally understanding this difference will help us to map the engineering theories and modeling tools in order to model human subjects in human in the loop system.
There are certain basic properties of humans that are routinely observed when studying human-machine interactions. The following list highlights some of the factors that define the physical limitation and behavioral attributes of the human operator in human-machine interactions.
Humans, in a most simple form, do not behave linearly; they exhibit time delays in their reaction to stimuli [37]. For instance, the response delay in vision is at least 40 msec [38].
We, humans have limited sensing and actuating capabilities. There are threshold limits that we can not operate beyond them. For instance, our vision sensory is limited by what it is known as flicker fusion threshold. Equally, our audible frequency is bounded by 20 to 20, 000 Hz. The Same limitation applies to our actuation capabilities. These limitations make humans very limited compared with the dynamicalsystems.
As described in [39], short term memory is defined as the ability for holding a small amount of information in our minds in an active and readily available state for a very short time. This duration of time is in the order of few seconds. On the other hand, working memory is a cognitive system with a limited ability that is responsible for the transient holding, processing, and manipulation of information. Working memory is an important process for reasoning and the guidance of the way that we makes decisions [39].
Prediction can be defined as our belief that some event will happen in particular circumstances. However, anticipation is when we are being ready for what might possibly take place and prepare a plan for how to deal with it once occurred. The main diffidence here is that prediction does not require action, while anticipation normally does and affect our current decision. Therefore, anticipatory behavior is a process that depend on a compilation of our belief about the future, the experience from the past, and what we are going through in our present. We, humans, adopt the anticipatory behavior in any decisions-making. This makes us compliantly different for the way that dynamical systems work.
Humans are capable of learning new knowledge in many different ways, which is a unique characteristic of human beings. This knowledge allows human to increase their capacity and experience. The concept of learning in humans is different than the learning in machines, which is in the end of the day statistical learning.
Humans are very adaptable systems; they are able to cope with the environment changes around them. The learning capability and the anticipatory behavior of humans gives them very high level of adaptation. Of course, there are dynamical systems that can be described as an adaptive systems; for instance, adaptive control systems. However, the notion of adaptation in human is more deeper and involved. For instance, human is the ability to adapt to not only different controlled dynamics but to altered operating conditions as well.
Human relay on several inputs and integrate them to make a decision. For instance, human rely on vision, vestibular and kinesthetic, and auditory in order to drive a car.
As indicated in [40], it is not only anticipation that control the human activity. Several other factors have as strong effects such as guessing, expectation, prediction, forecast, and planning. These factors some times comes against what we anticipate. In other words, anticipation and reaction can be considered only in their unity [40].
Scientific research of brain activity in varies events have shown that only chaotic behaviour are exists when we come to human mind. As concluded in [41], We have found that brain function cannot be explained in terms of features of neurons taken individually or as part of a local network, nor is it adequately characterized as a passive reaction to stimuli. The nature of the internal feedback in the brain system along with the incremental uncertainties, which amplified with time suggests that the brain is a chaotic system, where a long-term predictions of a person behaviour isimpossible [41].
The causality is a fundamental property in physical systems. A system is causal if its output depends only on present and past inputs. In other words, the output of the causal system do not depend on the future inputs. This property makes these physical systems realizable and can be implemented in real life. In contrast, a system which present output or response depends on future inputs is called as a non-causal system. Mathematically speaking, if we defined h (t) be the response of a linear time invariant system at t = 0 . Let u (t) and y (t) denote the input and output of the system. Suppose that u (t) =0 for t < 0. Then we have the following relation
In the case of linear time invariant system is causal then there should be no response before the input is applied, i.e. h (t) =0 for t < 0. Therefore, the causal system can be written as follows This shows that for a causal system, h (t) gives zero weight to inputs that occur in the future. Systems that do not have this property are non-causal.
Taxonomy for human-in-the-loop applications
Generic taxonomies and classifications are required to allow for comparison of different human-in-the-loop applications and approaches. It is obvious that human behavior will depend on the type of the application they are involved in. Understanding the different types of applications and categorize them will help in identifying patterns of human behavior that emerge in different applications. A very limited effort has been given to this direction. Only the recent survey [36] attempt to present a taxonomy of the applications that involves human. The taxonomy is based on the human roles in a givenapplication.
Taxonomy based applications should involve several key features that distinguish different application and the human role in this application. Here we present some of these key features that we think are essential in such taxonomy.
Intelligence level of the system under control
The behavioral aspect of the operator will depend on the intelligence level of the system under control and the ability of the system to execute decisions autonomously. Grouping the applications with the same level of intelligence will help to identify similar behavioral patterns of the human operators. Defining the characteristics and the boundaries of their levels is not an easy task, however, it is an essential key feature that defines the human model behavior in controlling task.
For instance, one could consider level zero correspondence to complete the manual operation, where the system has no apprehension of its current state and all sensing, control and actuation is done externally, for example, position status indicator, a locally flashing fault indicator light. Whereas the highest level of intelligence correspondence to complete autonomous system that can execute complex decisions and has the ability to predict and adapt to all situations. Between there two levels, there should be several other levels that stare from basic automation process to advanced control strategies.
System’s controllability
Engineering systems have different controllability degree based, for instance, on the nonholonomic nature of the system. For example, nonholonomic mechanical systems such as wheeled mobile robots, cars, autonomous underwater vehicles, unmanned aerial vehicles, under-actuated robots, cannot move in an arbitrary direction in its configuration space. The degree of controllability often defined as the minimum input energy to change the system’s states [42].
Cost of failure to operate the system
Failure is something we try to avoid at all costs, since the cost of failure is expensive. Our decision-making process and our behavior, while operating a particular system, depends strongly on the cost of failure of that system. In other words, systems with high failure cost will force us to behave more carefully than the systems with less failure cost.
The system resilience and robustness
A resilient system is a system that adapt to challenges and uncertainty by changing its method of operations while continuing to function. However, a robust system is the system that continue functioning in the presence of bounded uncertainty without any changes to the original system. The degree of robustness and resilience of the controlled system affect the human behavior while controlling the system. Therefore, it is logical to classify the systems using its degree of robustness and resilience.
Skills needed to operate the system
Human behavior, as an operator or a controller in a particular scenario, will depend on the time of contact between the human and the controlled system.
Modeling and identification of human behaviors
Model identification for human behaviors is, so far the most visited research area in the human-in-the-loop. Control theory was the main founder for most of the human models. Since the 1960s, there has been strong progress in finding a proper model that mimic the human behavior in manual control. Most of this models were driven from an application such as airspace to model the human as a pilot.
The first recognized control theory orientated models for the human operator was early in the 60s. A human operator was modeled as servomechanisms with sensors and actuators acting as the sole-in-the-loop control element, Fig. 1. The independents of the operator model and the plant or the machine was assumed as an initial hypothesis, i.e., the human operator behavior model could be combined with any control process or any machine he is controlling. The work by McRuer and his colleagues in [32, 43] was very successful, in particular, the crossover model. In this section, we will review briefly some of this model and their developments. In Table 1, a summary of the of models considered in this paper is presented.
A summary of the models discussed above
A summary of the models discussed above
There have been several models for human as an operator or a controller that operate a dynamical system. Perhaps, the quasi-linear model is the first to be known. An early review on the development of that model is given in [43] by the model founders. The quasi-linear model developed from the fact that most of the nonlinear systems have similar responses to specific inputs comparing to the responses of equivalent linear systems to the same inputs. For a given input-nonlinear-system combination, the response of the nonlinear system can be divided into two parts; one component which corresponds to the response of an equivalent linear element driven by that input and an additional quantity, called the remnant, which represents the difference between the response of the actual and the equivalent linear element [43].
This model is also known as crossover model since the performance of the human based on this model depends on the crossover frequency ω
c
. At this frequency, the open loop transfer function satisfies
The advances in modern control theory by Kalman and others led to more sophisticated models for the human operator. Kleinman and others introduced an optimal control-based model (OCM) for the human operator in [44, 45]. The main hypothesis behind the optimal control-based model is that a well-trained, well-motivated human operator behaves in an optimal manner, subject to his inherent limitations and to the requirements of the control task [44]. In this model, the human operator is working as a minimizer of the error between deviations of the actual process trajectory and the desired trajectory. Based on the assumption that human behaves optimally in some sense, [44] proposed a quantitative model for the response characteristics of the human operator. The model consists three main elements; time-delay, a representation of neuromotor dynamics, and controller remnant as limitations; see Fig. 2.
The internal model in control theory refer to the process that simulates the response of a given system in order to estimate the output of the system disturbance and was first intruded in [46]. In the recent work in [2, 47–50], reversal experimental setup was conducted to show that human operator can be modeled according to the internal model hypothesis using system identification. They argue that human operator model can be divided into two components, one of which is the inverse dynamics of the controlled system in a feedforward and the other component is to be identified from the system identification; see Fig. 3.
In many control systems, it is required that the outputs should track the reference signals in the presence of exogenous disturbances, see Fig. 4. If the future information of the reference signal or the disturbances is available, then we can greatly improve the performance of transient responses. In the preview control method, the most important thing is that the known future demand output or disturbance signal affects the operation of the control system. As a typical method, the augmented error system plays a vital role in the design of the controller in discrete-time systems. Since the relationship between the system and the future signal was established by the difference operator, discrete-time systems have become very popular in many research fields.
Gawthrop and others developed a model for the human operator based on intermittent control theory [51–59]. The intermittent control model for a human operator are building on the optimal control model developed by [44]. The model assumes the following hypothesis general theory of human control systems must include continuous as well as intermittent processes which incorporate discrete switching [53].
Human behavior model based on Hidden Markov model was proposed in [60]. In this model, the human is considered as a device with many numbers of internal mental states. Each mental state has its own particular control model that describe a particular behavior and interstate transition probabilities. An example of such models would be a set of standard linear systems (e.g., Kalman filters with a simple control law), with different dynamics and measurements. These linear models are sequenced with a Markov network of probabilistic transitions. The short-term behavior is described by the linear model dynamics, and long-term behavior is described by the transition between these linear models.

Optimal control model for the human.

Optimal control model for the human.

Preview control.
Bio-inspired models for human behavior focus on the collective decision-making of group of humans share some interest. These models are mathematical models inspired by the biological evolution of animals birds, insects, or humans, that describe how a group of individuals can make a collective decision. It combines approaches in the social and natural sciences. Bio-inspired models are based on the fact that group decisions in animals pose many similar questions to those in humans [61]. In [61], they suggested a categorization of group decisions into two classes; aggregate/consensus decisions, and interactive/combined decisions. Aggregate/consensus decisions when a group makes a single collective decision, e.g. between multiple options, that is ‘binding’ in some way for all members. Interactive/combined decisions in which there need not be a single collectively binding decision, but in which individuals decide interdependently with oneanother.
This model considers decision-making scenario in which two alternative choice task is given to each individual and has to decide between the two alternatives [62]. The evidence aggregation is modeled by the drift-diffusion process defined by
This model considers a set of n decision-makers performing a two alternative choice task and let their interaction topology be modeled by a connected undirected graph G with Laplacian matrix L ∈ n×n. The model is mathematically described by the following coupled drift diffusion model:
In [63], they consider a set of N agents. Let The distributed dynamics as a candidate for the realization of bio-inspired collective decision-making behavior is given as follows;
Where, the term uS (x
j
) in (5) is the opinion of agent j as perceived by a generic agent i. The parameter u controls this smaller value and models the social effort: higher social effort leads to a broader range of opinions being assessed correctly. With this interpretation, the uninformed opinion dynamics (5) is the continuous time version of the process in which each agent at each time updates her opinion to a convex combination of her opinion with the perceived opinions of her neighbors.
Cognitive models consider the low-level details of human behavior and typically focuses on prediction and generality [64]. For instance, how eye movement on a screen, the characteristics of human memory, the level of reasoning required to act on a particular problem or make a choice and the time required to press a button. Psychological research is the leading buddy of building cognitive models, which constraints of the human cognitive system [64]. Several attempts to combine computational and psychological theories to expand the human behavior models [65–67]. These computational models can be represented as a computer program that combines deductive and experimental methods in one computational approach. In this way, experiments can be designed with the model, and the dynamic properties of the model can be examined by studying its actual behavior and allow us to observe and measure their behavior [68]. These models are different from statistical and mathematical models, which describe a particular phenomenon or behavior but do not have the capability to generate similar behavior.
Integrating human model into the control synthesis methods
Incorporating the human behavior as an element of control synthesis methodology is the most important research challenge from our point of view. Yet, there is limited work in the lecture in this direction. Possibly because the human model is not fully utilized yet.
As indicated in [35], integrating human model to be part of the system itself will have several advantages as it will allow analyzing the property of the entire system using feedback control strategies. Also, this will allow running optimization methods across the whole system including people, building, or even city to optimize for diffident parameters to maximize generalized utilities, e.g., health improvement or saving energy. Some of these immediate advantages of integrating the human model into the control synthesis methods are presented in following subsections.
Optimal allocation of humans within the loop
The notion of optimality is an essential notion in control theory. Optimal allocation of sensors and actuators.
A generalized Fitts List presented in [69]
A generalized Fitts List presented in [69]
Human allocation and their functionality within the control loop is a necessary step in man-machine system design. In the system design stage, a decision whether a particular function will be allocated to a human, technology or mix of human and technology is to be made. This process is called function allocation process [69].
The difference between human and machine capabilities has a long history in man-machine systems and human factors. Perhaps, Fitts list [70] is the earliest to be considered that made a clear distinction between human and machine capabilities. A Generalized Fitts list presented in [69] shown in Table 2.
Optimal allocation of these functions will lead to the system to be more effective and reliable. The notion of optimality is an essential notion in control system design. However, optimal control theory focuses on the design of optimal system components only. The challenge here is to be able to formulate and quantify concepts such as error rates, costs, hazards, human values, technological feasibility, ethical issues and the desire of people to consider the optimal function allocation of humans within theloop.
Closing the loop between humans and machines will require deep understanding to both the machines and the humans to be able to build a reliable interface. These interfaces are backed up with a control system that allows the operator to interact with the controlled system. The design of such controllers is optimized for a wide range of operators with a different set of skills and capabilities to allow for wider range of users. A more reliable control system is, the control system that learns and adapt to the particular operator, i.e., a control system that has the notion of personalization.
The personalized control system is similar in concept to adaptive control is the control method that adapt to a controlled system with parameters which vary with time, however, in the personalized control system, the controller adapted to the operator skills and behavior. Equally, personalized control system "learn" the operator behavior with time and build a model for that particular operator.
Building personalized control system depend on learning the operator model and understanding the set of skills that he/she applies to control or interact with the given system. Machine learning with the computational power available these days provide powerful tools to generate and analyze rich data online in quick and efficient manner. Utilizing such tools in our control system design for human in the loop application will facilitate the concept of the personalized control system. For instance, if the operator is allowed to operate or control the system for some time, the controller uses some machine learning techniques to build a model for that operator and adapts its parameters to the identified model.
For instance, one way to understand an operator behavior in a cyber-physical system is to analyze his input and output from a control theory and data mining perspective to mine for the repeated habits. Finding unknown repeated behaviors or patterns can lead to better understanding of operator’s action in particular situations. More so, mining the repeated behaviors of skilled operators can be used as a benchmark for teaching either other human operators or autonomous systems.
In [71], we introduced a method for pattern (repeated behavior or habits) detection in trajectory data generated in the human in the loop applications. The proposed method is suitable for pattern detection in position trajectories of moving objects in 3-dimensional space or 2-dimensional plane. The method is a geometrical shape-based pattern detection, which operates in two phases. The first phase consists of trajectory segmentation, creating segment fingerprint, clustering and symbolic representations. Then, the second phase of the proposed method is pattern extraction. The second phase is inherited from the motif finding algorithms in time series data and DNA sequences.
Concluding remarks
The purpose of this research is to find out the collaboration between physical systems and humans in a man-machine systems. The research suggested that, modeling human behaviours, is still at an early stage of investigation. The available models are either very general or very specific for a particular scenario. As it is obvious that the human model involves highly nonlinear complex dynamics. One way of achieving reasonable model is to build a library of generic mental models in different situations. Here mental models are internal representations of the external environment that humans use to interact with the world. The parameters of these generic mental models can be configured for every individual.
The review raise several possible research questions, which can be summarized as follow: How to integrate long decision-making models into human models? How to build generic classification and taxonomy for human in the loop applications? How to equip autonomous systems with cognitive and personalization capabilities in order to increase the cooperation with humans in human in the loop systems? How to optimally place human subjects into automated systems? How to create, quantify, and test the cooperative behavior between humans and autonomous systems?
In our opinion, in order to answer some of these questions, human modeling problem should be formulated as of building a tunable generic model (or set of models) that describe the human behavior in controlled environment from both micro-muscle and decision-making perspective taking into consideration uncertainty of the controlled system.
