Abstract
There is a belief that complexity and chaos are essential for adaptability. But life deals with complexity every moment, without the chaos that engineers fear so, by invoking goal-directed behaviour. Goals can be programmed. That is why living organisms give us hope to achieve adaptability in robots. In this paper a method for the description of a goal-directed, or programmed, behaviour, interacting with uncertainty of environment, is described. We suggest reducing the structural (goals, intentions) and stochastic components (probability to realise the goal) of individual behaviour to random variables with nominal values to apply probabilistic approach. This allowed us to use a Normalized Entropy Index to detect the system state by estimating the contribution of each agent to the group behaviour. The number of possible group states is 27. We argue that adaptation has a limited number of possible paths between these 27 states. Paths and states can be programmed so that after adjustment to any particular case of task and conditions, adaptability will never involve chaos. We suggest the application of the model to operation of robots or other devices in remote and/or dangerous places.
Introduction: problem statements and area of application
Exploring a remote environment, such as space or other planets, or the places on Earth difficult to access, needs robots that can be trusted to do their jobs properly, quickly, creatively, autonomously and reliably. Central control in some cases becomes impossible: large distance and lack of local information, time of signal travelling for space conditions. This means that control should be distributed in a robot swarm. A number of small simple cheap robots is more reliable and damage-tolerant than a single expensive and complex robot. We need robots that could work under harsh conditions due to their adaptability both as individuals and as a group. The challenge is to produce a complex distributed system that can work under changeable conditions. Such an adaptable and even creative autonomous robot swarm will provide:
Cheap construction and complex behaviour: part of the group could ‘sleep’ while others work. Abundant and replaceable interchangeable agents. Reliability, because the group can perform even after losing most of its parts. Intrinsic working modes that change with environment and functions without changing the group construction Self-repair and homeostasis of the group. Longevity: the group can protect itself as a group more effectively than a solitary robot.
A characteristic of a distributed system is that its parts work more or less independently compared with an integrated system where parts are functionally and morphologically fixed, reducing the possibility for adaptation. In both cases the environment induces uncertainty in system behaviour, but in the case of a distributed system there is always a chance of the unexpected event within a system itself.
There is a belief that complexity and chaos are essential for adaptability. This is true, but only for unanimated things. Life deals with complexity every moment with absence of chaos: all is more or less well organised. Still there is a place for unexpected, but this is nothing to do with chaos that engineers dislike so much. Life deals with uncertainty with goal-directed behaviour (Bogatyrev N.R., Bogatyreva O.A;2003). Goals can be programmed. That is why living Nature gives us a hope to achieve adaptability in robots: we just need to develop a method for a description of programmed (goal-directed) behaviour when it is affected by uncertainty. For a robotic swarm, designed for planetary exploration we definitely would like:
Self-organisation but under our control Adaptability, but not beyond predictability.
To achieve this we need a biomimetic approach Vincent, Mann, 2002; Vincent, 2003) This approach gives us a hope for predictable adaptability because, compared with physical (non-living) systems, changes in life are never random (the system would not take off in a bizarre new direction) and a living system is open to environmental impacts only at some sensitive stages of its development (i.e. they are semi-open systems).
Methodological background and empirical data
Our model is within the framework of the most pressing problems of adaptive intelligence and artificial life (Kauffman, 1993; Dawkins, 1982; Axelrod, 1984, Boden, 1996). Nature's prototype for such systems is a colony of social insects (ants or bees): it is self-organised, self-dependent, self-adapted and self-regulating. To understand the idea of “self-”, we need to know how “self-” works. There are two main concepts (Johnson, 2001; Bonabeau, Thiraulaz, 2000):
Top-down (pace-maker); hierarchical control – deterministic approach. Bottom-up (emergence) – stochastic approach.
We have identified 5 fundamental principles comparing bottom-up and top-down intelligence concepts (Table 1). In bottom-up organisation, many small simple elements form, with a few simple rules, complex systems that are rich in behaviour, adaptation, reliability, possibilities, etc. In top-down organisation (largely neglected by researchers) relatively complex elements create a relatively simple system that could suggest better performance (sociological, economical, ecological management, etc.).
The comparison of two general methodological approaches to complex systems study
The comparison of two general methodological approaches to complex systems study
In both systems we need to know the conditions under which emergent effects appear in order to maintain them (for “beneficial” effects) or prevent them (for “harmful” effects). There are presumably also neutral emergent effects. Both approaches look for mechanisms that allow quick responses to environmental changes. But in the bottom-up approach, ignorance is not useful for adaptation and engineers cannot afford randomness as well as natural selection. The main problem with the top-down approach is a predefined order that prevents quick adaptation. To benefit from the advantages of both approaches we need to combine them, but nearly all the characteristics are in conflict.
There is an individuality in Nature (the agents are never similar), rules are not given, they evolve and persons are definitely cleverer than a system they are in (global wisdom cannot be achieved with local ignorance). So, the statements for our concept are the following:
Hierarchy of interactions instead of hierarchy of complexity. Each agent possesses individuality. Rules emerge. Changes in a system are never random. Agents think globally and act locally.
To achieve the goal of merging the bottom-up and top-down approaches we need to estimate the contribution of each agent to the team behaviour and, by the “group picture” of these contributions, judge the group state and predict its development.
Biological systems are massively parallel and distributed, they use disposable components, they are robust to perturbations in their environment, they learn innovative solutions to problems, and their global structure and behaviour are not predictable from simple inspection. The favourite prototype for this kind of system is ants (Deneubourg et al. 1990). But we will look at them from a different point of view – ants are not similar, stupid and one-rule “robots”, their behaviour is based on a permanent decision-making process, cooperation and on an information flow organised in a network by innate rules for interaction Decision-making as well as cooperation has rules, but these rules depend again on cooperative decision-making. Our belief is that there are limited numbers of cooperation styles in nature (like basic archetypes) and ants can switch between them to provide an adaptation.
We studied ant individual and social behaviour for 20 years (1979–1999) in Kazakhstan and Western Siberia. All data presented in this paper are based on 1000 hours of ants' personal time budgets and trajectories of individually marked (named) workers obtained in field observations in their natural environment. We observed 13 colonies of Formica uralensis Ruzsk., 9 colonies of Camponotus japonicus aterrimus Em, 3 colonies of Camponotus saxatilis Ruzsk., 21 colonies of Formica cunicularia Ruzsk., 22 colonies of Cataglyphis aenescens Nyl., 12 colonies of Formica picea Nyl., 17 colonies of Formica pratensis Retz‥ (Bogatyreva, 1981, 1987, 2002). The model we developed is a result of our attempt to understand each individual contribution into colony management.
Before we start creating a model, we should understand the very “physics” of swarm behaviour and interpret this within the basic concepts of mathematical statistics. A program – robot behaviour “goal” (structural component, estimated by numerical/ ordinal value) meets an uncertain reality (estimated by probability, for example – the probability to get into the meteorite “rain”) and we need a model for the description of such a system, the behaviour of which is determined by both: structural and stochastic components (Bogatyreva, Shillerov, 2003). There is a common method for description of complex system behaviour – entropy. But we cannot directly use it because 1) the entropy method is suitable only for stochastic process description, but we have a structural component as well and 2) the entropy index depends on the number of system elements, which can vary (robots can be broken, lost, deactivated).
To avoid the first obstacle and enable operation with events as random variables, we reduced the random variable with numerical value (structural component) to random variable with nominal value (see 3.1). This means that the variable which distribution we are interested in reflects each agent contribution to the whole system state. To avoid the second obstacle we introduced a normalized version of entropy: h=H/Hmax (see 3.2). We need only to choose which parameters in agent behaviour to measure to judge a system state (see part 4).
Reduction of numerical data to nominal values
A system Ω consists of N agents (elements) ωi (
The basic concepts of mathematical statistics are: event, event measure and event probability, category of events and different kinds of feature value distributions within categories. In order to investigate interactions between agents in the system Ω we need to establish a correspondence between the real processes and analogous concepts of mathematical statistics. Any interaction is asymmetrical. Dominance is the behavioural expression of this asymmetry and is a process/property of a system that reflects in single agent behaviour. The feature that we investigate in agents' behavioural hierarchy is its rank. According to mathematical statistics the event A is a set of agents with similar feature value. The measure of the event μ(A) is the number of agents whose ranks are the same. The number of all agents in the system Ω is we call the basic number μ(Ω)=N. So, the probability of event A is the ratio of the event measure and the basic number:
In probability theory a complete system of events A1, A2,…‥An means a set of events that one and only one of them occur at each trial, e.g. only one of each rank value can appear in the experiment. If we are given the events
The systems with the categorical values of features are well described with informational entropy index (Shennon, 1948; Khinchin, 1958).
Entropy index can be interpreted a measure of distribution scatter, uncertainty, divrsity and information quantity. We are interested in measuring of uncertainty of system Ω behaviour. But in the reality all agents' interactions have “intensity”, strength. In other words, any category of interaction has numerical value of the feature. So, the distribution of the feature of agent interaction in the system can be characterised by the random variable with numerical values. To enable us to use entropy index we need to reduce random variable with numerical value to random variable with nominal value (Felinger, 1983). The random variable X with numerical value is the ordinary random quantity. It can be characterised by the set different values

Dice game with structural and stochastic components acting independently
Probability characteristics of agent's interaction are independent from structural (functional) relationships between them – probability of each die face is the same, but the game result is different.
Let's play dice N times. If we have differently coloured die faces (Fig.1), the statistical results of the game will depend on the frequency of each colour appearing.
The expectation E(X) of a winning number in a single die throw can be calculated. The mathematical expectation of a random variable X, that can have possible meanings (events
The contribution of each face to the result of the game is different. This happens due to the structure imposed on the quantities xk of the random variable X – numbers. Winning depends on this structural component because the probability of each event (die face appearance) is the same.
If each agent is different and n=N, ωk can be seen as event Ak,
The contribution
We can consider
There is no restriction for the numbers being similar for different die faces. For example, x1=1; x2=1; x3=2; x4=2; x5=3; x6=4. Here we have the events/face colour
The contribution of each event to the game result E(X) can be calculated using the equation (2):
Now we have got a probability distribution for the random variable reduced to the nominal values Ak which is different from the first case.
Any physical variable can be considered as the multiplying of some numerical value on its measurement unit. For example, time – 5sec means that the calculated time is five times one second. So, in the equation (2), part of structural relationships, which are determined by numerical values xk are transferred from “intensity” into “frequency”. The numerical magnitude we will call “intensity”, with a unit of measurement standard intensity; the frequency of pattern/unit of measurement repetition.
For example, in some manufacturing process it is necessary to perform n different kinds of operation: A1, A2, …. An. It takes xk units of time (or other resource) to perform the Ak operation,
The quantity xk Pk is the outcome of multiplication of intensiveness xk and “frequency” Pk. We can interpret this kind of multiplication as “action” using the analogy with momentum = mv (m is mass, v is velocity) or work = FS (F is force, S is distance). Every action xk Pk is performed by some group of agents, included in the event Ak. The value of
In the general case the reduction of the random variable X with numerical value
Where mathematical expectation is
We can consider qk as a probability distribution of random variable reduced to the nominal meanings Ak. Now we have a finite scheme A: complete system of mutually exclusive events A1, A2, ‥ An together with their probabilities
To measure the amount of uncertainty associated with a given finite scheme we can use entropy index as it is proposed in the information theory (Khinchin, 1957)
Or by virtue of (3)
Here the logarithms are taken to an arbitrary but fixed base, and we always consider pk log pk = 0 if pk=0 (Khinchin, 1957). In specific case when all probabilities are the same P(xk)=P, the above equation becomes
In the Probability Theory function
The entropy index (4) is an estimate of the mean expectation of disorder in a system Ω. Disorder is uncertainty. Entropy H is maximum when all probabilities are equal to each other, and under those conditions qk = 1/n, where n is the number of events (Khinchin, 1957). Hence, Hmax = log n characterises maximal uncertainty in the system Ω. The surplus information characterises predictability – order – in the system is given by:
where Hmax is the potential diversity of the system and H is the current diversity in the system. Index of current diversity varies 0 ≤ H ≤ log n and obviously depends on n – events number. This creates difficulties when we need to investigate a system in its development accompanied with changing of agents' number or to compare two different systems. To avoid this obstacle it is useful to normalise H to yield
For any probability qk distribution, normalized entropy index h varies in the range [0;1] and measures the relative amount of uncertainty in a system. According to (7) normalized surplus information i = 1−h varies in the range [0;1] and measures relative amount of order in a system.
Let's name them accordingly:
The number of all system states is U=V+C=1. We are mainly interested in the ratio of numbers of order (predictable changes) and disorder (unpredictable changes, uncertainty, variety). The number of unpredictable changes (uncertainty) referred to a single predictable change is:
The number of predictable changes referred to a single unpredictable change (uncertainty) is:
The equations (10) can be interpreted as the “velocity” of change of uncertainty relative to predictability. Equations (11) can be interpreted as the “velocity” of change of predictability in relation to uncertainty.
Lines cross the functions (10) and (11) with the linear functions of growing order (9) and chaos (8) correspondingly (when VC = C and CV =V) divide the normalized entropy axis into three ranges: [0; 0,382), [0,382; 0,618], (0,618;1] (Figure 2). Hence,

Normalised Entropy Index «h» as a relative measure of order and uncertainty in a system/robot swarm
By virtue of equation (8) and (9):
Hence,
In equation (15) C is a geometric mean of V and U. In equation (16) V is a geometric mean of C and U.
When h =0,618 or h = 0,382 accordingly:
We have shown how to use the normalized entropy index to estimation the balance between chaos and order for only one parameter. Now we need to develop the method for multi-parameter cases. The model concerns interactions in conditions of uncertainty: unpredictable environment, neighbour behaviour, behaviour of the system you are in - where a chance (uncertainty) and a program meet and their balance gives the “emergent effect”. In each case goal-directed behaviour faces uncertainty from the other “partner” in an interaction pair (see the Table 2).
Structural and stochastic components in system/swarm behaviour. k1, k2, k3 – coefficients which can be obtained by experiment
Structural and stochastic components in system/swarm behaviour. k1, k2, k3 – coefficients which can be obtained by experiment
Any system consists of sub-systems and belongs to a super-system. The system is a team and significant parameter is that which determines team cohesion – a control style, information flow. The sub-systems are agents that perform functions giving a system life; their main task is trading energy trade with a system. The best strategy for an individual agent is to minimise energy costs (which leads to specialisation), whereas the best strategy for the colony is for its agents be poly-functional and replaceable. The super-system is an environment supplying resources for a system.
Accordingly, all possible types of interactions in a distributed system are accounted for: agent-agent, agent-team, team-environment interaction. So, the main parameters are those which characterise control, function and resource (Fig. 3, Table 2), we need only to adjust a model to each particular system we are studying/making (coefficients in Table 2).

“Thinking Cube” Model
The Normalized Entropy Index (NEI) is the common scale to measure the interaction of a system with its environment, the interactions between the sub-systems (parts of the system) and the interaction of a single part with the whole system. The Normalised Entropy Index divides each of these axes into 3 ranges: zone of chaos, zone of order and corridor of quasi-equilibrium. We call the 3D version of our model “Thinking Cube”. The main assumption is that there is a limited number of possible effects (27 cells in the cube – Fig.3) and even fewer possible ways for a system to react, so we can program the adaptation process and avoid undesirable situations when our autonomous system makes an unsuitable decision.
Further development of our model allows:
Creation of a general model, which will describe integrated system as a particular case of a distributed one. This will allow us to vary team parameters in order to regulate the degree of swarm integration/distribution. Being distributed, such robots can be sent to a planet or into space where they will adaptively form integrated constructions. Development of control methods for distributed adaptable robot swarm management. Investigation of system behaviour in all 27 possible states (“Thinking Cube Model”).
Application to cooperation in an ant colony
Studying the individual behaviour of ants we found that some of them are leaders, but others prefer to follow. Leaders and subordinates have quantifiable differences in their behaviour: leaders do not look for contacts; they pay little attention to subordinates and contact them only when they need to; subordinates actively look for contacts with leaders and with each other. If each ant is given a rank according to the percentage of contacts it rejects, it is possible to draw diagrams of their interactions (Fig.4, 5; Table 3). According to our observations some Command Units are linear (h<0.38) and some are branched to the extent of over-crowding (h>0.62). Teams shown as open circles we never observed. The most common in nature are teams based on triple unit fall within the limits, where h index shows quasi-equilibrium (filled circles). We can also draw all the possible structures up to the 5th level of hierarchy based on complete, linear or over-crowded units. Management based on a Triple Command Unit allows group cohesion even if the number of ants is extreme (very large or small) and changeable (fig 6).

Formation of a command unit

Information transfer between leader and subordinates

Triple Comand Unit as the most efficient way to accommodate the largest number of ants into the information network without losing group cohesion
Rank of the dominance as structural component of ant behaviour
So, the most effective way to keep ant colony cohesive is the Triple Command Unit, which yields values of h between 0.38 and 0.62. This allows rather more than 60 ants to form a cohesive team with automatic increase or decrease in the number of ants performing a task. The maximum number of hierarchical levels is 5 (the highest number to which ants can count) beyond which the group loses its integration (Bogatyreva & Shillerov, 1998).
The group cohesion fis extimated bythe Normalised Entripy Index h. It increases significantly if more than 2 subordinate ants are associated with a leader ant, at whatever level within the hierarchy.
So, the Index of Normalized Entropy (INE) is a working, empirically tested parameter of group cohesion that will be the basis for our model of adaptable distributed swarm behaviour of robots. From triple unit pattern it is possible to create a hierarchically organised system of interactions of agents or robots within a responsive swarm while a performing a particular task (fig 7).

Estimation of adaptability, instability and non-adaptability, using the Normalized Entropy Index (NEI)
For example, we have a set of robot departures from the base to get resource R1 – “meat” for bio-generator of energy, R2 – light for solar panels. After 50 departures robots brought “meat”, the result of 30 departures was solar energy, but 20 departures were without any result (probabilities p1=50/100=0.5; p2=0,3; p3=0,2).
We can set up a rank of resource need as meat (R1) − 100, solar energy (R2) as 40 and empty journeys as 1(minimum) (these ranks should be adjusted to particular case during the experiment).
We need to estimate the contribution of each energy source to the “final result of the game” – energy effectiveness of the environment: to reduce the random variable with numerical value to random variable with nominal value.
According to the equation (2):
E(X) = 100 · 0,5 + 40 · 0,3 +1 · 0,2 = 62,2 and
The resource conditions are thus in the corridor of quasi-equilibrium and there is nothing to worry about. The robot swarm is in sustainable condition and will survive.
The next example is about task distribution in a swarm (Table 4).
How a group regulates agent function distribution
How a group regulates agent function distribution
Calculations give us the following:
In this example
The team of robots (distributed system Ω with agents ωi) is to be designed to investigate the effectiveness of energy extraction on vaguely known areas of alien planet. Each robot is capable to extract the energy performing the full set of following functions: F1 – the function of light energy usage, F2 – the wind energy utilisation, F3 – energy extraction from chemical reactions. Due to the fact that the surface explored is largely unknown let's initially distribute these functions among robots evenly. To start with each robot turned on to perform just one function: one third of robots performs the F1 function, one third – F2 function and one third of them – F3 function. They could turn to another energy source if it is necessary. The decision should be made to which from 2 other sources to switch. Solution based on sensors (local knowledge) does not include long-term adaptability - robot should know what mode is the most efficient at the moment. This can be achieved by simple communication. There is no a single example of a swarm in nature that can survive without communication and hierarchy. To simplify complexity theory, make it more animated, we included these features in our model.
Initially all robots have equal energy potential essential to start energy exploration on the remote spot and rank assigned according to it. Later on the rank of each robot is proportional to the square of effective energy:
r ≈ E2 effective , where Eeffective = Eg−Es – is the difference between generated (g) and spent (s) energy. As soon as an individual robot is incapable to perform the energy extraction in the efficient way (Eeffective<0) and energy storage falls beyond the threshold it enquires other robots of the ways to do it better (simply by sending a SOS signal). Other robots once they acquired amount of energy above some limit performing, for example functions F1 and F2 described earlier share their experience (they might share energy as well) with robots that are in need of help: “Do it like me!”. If a robot fails to acquire the needed amount of energy using its method (for example F1), it switches to the method that the helping robot with larger rank advertises, for example, F2. According to the structure of this “learning” development the diversity of ranks (F1, F2 and F3 methods) is decreasing as well as the entropy of the system.
The environmental conditions can change, so the winning methods of energy extraction could lose their efficiency: the rank of the robots that were winning on conditions before the change took place is falling. So, the rank distribution would also change. Clearly it will take place during the course of normalized entropy in the following interval: 0,382 ≤ h ≤ 0,618.
If the environment is stable for a long period of time, the diversity of the robots' functions ranks is getting rather high – all robots will perform the same function. The normalised entropy would be lowering drastically h ≤ 0,382; team becomes too adapted to the environment, but dangerously none-adaptable. The team central processing unit should take the decision to prevent all robots switching to one energy source.
If environment is very changeable the normalised entropy of the system would reach its climax h ≥ 0,618 – swarm disintegrates and robots will turn to a solitary work.
If the environment is moderately changeable, the distribution of ranks altered amidst robots would also take place. The normalised entropy index should stay in the corridor of quasi-equilibrium to provide group adaptability.
Conclusions
Global knowledge of agents, which take local action, is the only way to real adaptability. Decision-making and choice are the basic factors of life, whether we like it or not. Accepting this surprisingly did not cause difficulty; it even made the modelling easier. Decision-making is determined by goals and goals can be programmed as well as their possible changes or variations. We do not need unpredictable complexity for engineering. To build a simple system from elements with complex behaviour happens to be the way to controlled adaptability.
To achieve self-organisation and adaptability in robotics but not move beyond predictability we have developed a model, which allows:
Combination of top-down and bottom-up approaches; estimate the contribution of each agent to the behaviour of the system. The set of individual contributions gives 3 types of group pictures: strict order, quasi-equilibrium and “chaos”. Detection of the system state by investigating a sample of its agents (the “picture” of a group). The Normalized Entropy Index is a group index based on the behaviour each agent. Repression of the undesirable property of complexity - the “butterfly effect”. We argue that adaptation has a limited number of paths between 27 possible states. Paths and states can be programmed. After adjusting to a particular case of task and conditions, adaptability will never involve chaos.
A topical sphere of application of the model is the operation of robots in remote and/or dangerous places. It would be possible to design a modular system of basic robots, with sets of skills and sizes able to create temporary structures (bridges, shelters, vehicles) by self-assembling according to environmental conditions.
Within other fields of science and engineering this model is also useful to solve current burning questions:
In management, planning and product design: to escape from unpredictability, to trace a system changes. In distributed cognition: to determine the actions required to bring a system to a desirable state. For instant decision-making and optimal and robust control of dynamic systems: to organise information about system state in general parameters. In sustainable engineering: risk and uncertainty in engineering design, to show system vulnerability.
Footnotes
Acknowledgements
The authors are grateful to Prof. J.F.V. Vincent for inspiring discussions and interesting suggestions, to Dr. John. Williams for reading the manuscript, asking questions and patiently listening to all our presentations and for Dr. Nikolay Bogatyrev, who shared our field adventures studying ant.
