Abstract
Multi-objective optimization can be used to address possible conflicting relationships between multiple objectives. However, some objectives have a fuzzy temporal relationship between them, making it difficult to give a common method to portray the fuzzy temporal relationship. To fill this gap, we propose the concept of complex objectives, which can be described by fuzzy temporal logic that includes both temporal and logical operators. Furthermore, we investigated the optimal control problems of complex objectives and developed a fuzzy system called possibilistic decision systems (PDSs) to establish a framework for optimal control. In PDSs, states of fuzzy systems are determined by a family of variables, and transitions induced by actions between fuzzy states of systems are also fuzzy uncertain and determined by a possibility degree. Importantly, we proved that memoryless strategies are sufficient for optimal control of complex objectives. Finally, the theory presented in this paper is illustrated by a mobile robot simulation.
Keywords
Introduction
Multi-objective optimization (MOO) [1] is a problem in which multiple decision objectives are considered in the optimization process. Unlike traditional single-objective optimization problems, multi-objective optimization problems have multiple conflicting objectives, and the optimization process is no longer simply a matter of finding the optimal solution to a single objective function, but of finding the optimal balance between multiple objectives. Multi-objective optimization has important applications in many practical applications, such as engineering design [2], financial investment [3–5] and control systems of robot [6–8].
Research on multi-objective optimization problems has been ongoing for decades and researchers have proposed many different algorithms and methods, including evolutionary algorithms based on genetic algorithms, particle swarm optimisation, simulated annealing and ant colony algorithms, as well as traditional optimization methods based on linear programming, non-linear programming, support vector machines, neural networks and so on. Each of these methods has its own advantages and disadvantages and is suitable for different types of multi-objective optimization problems.
Most optimization problems in practice are multi-objective optimization problems. In general, sub-objectives may influence and constrain each other. The improvement of one sub-objective may cause the performance of another or several sub-objectives to decrease. In other words, it is not possible to achieve optimal values for several sub-objectives at the same time, but only to coordinate and compromise between them, so that each sub-objective is optimized as far as possible. In this case, the multi-objective optimization problem can generally be written as the following mathematical model:
Multi-objective optimization is a challenging field that must address several issues. One of the primary issues is how to handle conflicts and trade-offs between system objectives in uncertain environments, particularly under fuzzy uncertainty [9]. In some cases, there may be apparent complex conflicts between different objectives, such as logical connectives ∧ and ¬. In other cases, there may be intricate relationships and constraints between objectives, such as temporal relationships expressed by operators like “next time,” “until,” and “always.” For example, consider the objective of ensuring that sub-objective a is always reached before sub-objective b. Such a temporal relationship is difficult to express through a functional relationship. Therefore, multi-objective optimization requires the design of algorithms and methods that can adapt to different types of logical conflicts and temporal constraints in fuzzy environments.
To fill this gap, we proposed the concept of complex objective (CO), which is described by fuzzy temporal logic such that the objective function can express complex temporal properties between sub-objectives in fuzzy systems. In fact, a complex objective is also a quantitative property of a system. Fuzzy systems [10] are the generalisation of deterministic systems, in which the input, output, and state variables are defined on a fuzzy set and therefore include fuzzy uncertain information. In order to model systems with fuzzy uncertainty, several operational fuzzy models have been proposed, such as fuzzy discrete event system (FDES) [11–14], possibility Kripke structure (PKS) [15, 16] and its generalized version generalized PKS (GPKS) [17], etc., models. We adopt the notion of GPKS in this paper, extended by nondeterministic choices for decision-making. The nondeterminism is critical for control systems to make decision. In this paper, we call this GPKS with nondeterministic choices possibilistic decision system (PDS, in short). The use of PDS representation is based on the assumption that the decision depends on various imprecise sensor measurements that are obtained at discrete sampling instants. The fuzzy states in PDSs are determined by a family of evaluations of sensors, expressed as a set of atomic propositions of states.
Therefore, we focus on the following two issues in this paper: 1) How to characterize a complex objective with temporal properties and logical relationships? 2) How to implement complex objective optimization? Fuzzy temporal logic has features that make it an adequate tool to address a large amount of uncertainty that is inherent in natural environments. In [18], Saffiotti review some of the possible uses of fuzzy logic in the field of robots navigation. Furthermore, temporal logic formulas can describe various complex properties with temporal properties in the field of formal verification, such as model checking [19, 20]. Several fuzzy temporal logics have been well studied by Li in [17, 21–24]. Generalized possibility computational tree logic (GPoCTL) is a fuzzy temporal logic that can characterize the basic temporal properties. We developed the GPoCTL by adding actions, called possibilistic strategy computation tree logic (PoSCTL in short), to characterize a complex objective in this paper.
Main contribution
In this paper, we investigate complex objective optimizatio n (COO) in fuzzy environments. Complex objective optimization in fuzzy environments is a field of research that deals with multi-objective optimization problems where the objective functions are fuzzy in nature. In COO, the objectives are described by PoSCTL formula, which capture the inherent uncertainty and temporal properties of many real-world optimization problems. In fact, COO is an extension of traditional multi-objective optimization, which deals with fuzzy objective functions. In traditional multi-objective optimization, the goal is to find a set of non-dominated solutions that provide the best compromise between conflicting objectives. In COO, the objective functions are fuzzy and with temporal properties, and the goal is to find the optimal strategy that provide the maximal possibility to objective. What’s more, we will show that memoryless strategies are sufficient for solving optimal control problems of complex objectives that are characterized by PoSCTL.
In brief, the main contributions of this article are: A proposed fuzzy temporal logic is used to characterize a complex objective in a fuzzy environments such that the objective can express complex temporal properties between sub-objectives. We prove that memoryless strategies are sufficient for solving optimal control problems of complex objectives. A decision system called PDS is proposed, which can be used for modeling, formal verification, and decision-making of fuzzy systems with nondeterministic choices. We have built a framework for the application of PDS to mobile robots. This framework solves the problem of optimal control of a complex objective in a fuzzy environment.
Complex objective optimization in fuzzy environments is a field of optimization that deals with problems involving multiple, conflicting objectives in the presence of uncertainty or imprecision. The presentation of complex objectives is useful for the portrayal of temporal relationships between multiple objectives. This technique involve a search for a set of optimal strategy solutions that are optimal with respect to the multiple objectives using mathematical optimization techniques, and can be applied to a wide range of practical problems in various domains.
Related work
In fact, a complex objective is a variant of a multi-objective, except that both temporal and logical relationships are considered between the sub-objectives. Here are some studies on multi-objective fuzzy optimal control. In [6] and [7], an approach for the multi-objective control of sampled-data systems that can be modeled as FDES was proposed to simulate a mobile robot. In [8], an embedded fuzzy controller for a mobile robot is developed to make the mobile robot follow the target trajectory satisfactorily. However, there is only one functional relationship (optimal objective function) or only a sequence of implementation between sub-objectives in these researches. Such relations are not sufficient to characterize complex objective relations such as temporal relations: next, future, until, always and infinitely often, etc., and logical relations: or, and, not and implied, etc. Although there has been much research for robots path planning in Markov decision processes (MDPs) [25–27], they are both probability-based studies for robot path planning rather than depending on fuzzy theory.
Structure of this paper
The remainder of this paper is arranged as follows. Section 2 gives the notation about possibility theory and formal framework to possibilistic decision systems. In section 3, we study complex objectives optimal control in PDSs. In section 4, we verify the theoretical development through an example of robots moving. This paper ends with conclusions.
Notation and formal framework to possibilistic decision systems
In this section, basic knowledge about the possibility theory introduced in [17] and the formal framework to possibilistic decision system are given.
First, all parameters and variables as well as symbols of this paper are listed by a nomenclature to help the reader follow the paper conveniently, see Table 1.
Notations in this paper
Notations in this paper
Possibility theory is an uncertainty theory devoted to the handling of incomplete information and provides an alternative to probability theory, which use a pair of dual set-functions, i.e., possibility and necessity measures, instead of only one measure in probability theory [28, 29].
In this paper, for simplicity, we assume that the universe of discourse U is a nonempty set, and assume that all subsets are measurable. A possibility measure is a function Π from the powerset 2 U to [0, 1] such that:
1) Π (∅) =0, 2) Π (U) =1, 3) Π (⋃ E i ) = ⋁ Π (E i ) for any subset family {E i } of the universe set U, where we use ⋁i∈Ia i to denote the supremum or the least upper bound of the family of real numbers {a i } i∈I; dually, we use ⋀i∈Ia i to denote the infimum or the largest lower bound of the family of real numbers {a i } i∈I. If Π only satisfies the conditions 1) and 3), then we call Π a generalized possibility measure [17].
Possibilistic decision systems
S is a countable set of fuzzy states; Act is a set of actions; I : S → [0, 1] is the initial distribution such that I (s) >0 for some state s; AP is a set of atomic propositions; L : S × AP → [0, 1] is a possibilistic labeling function, which can be viewed as function mapping a state s to the fuzzy set of atomic propositions, i.e., L (s, a) denotes the possibility or truth value of atomic proposition a that is supposed to hold in s.
2) Let Act (s) = {α ∈ Act | ⋁ t∈S
3) In the following, S is determined by a family of imprecise sensors measurements. Act is a set of control actions. Atomic propositions AP represent a family of imprecise sensors. The labeling function L can be viewed as a fuzzifier to map a sensor data to the [0, 1] fuzzy set.
4) The most related model is labeled Markov decision processes (MDPs) [19]. The main difference between an MDP and a PDS is that the former is based on probability measures and the latter based on possibility (fuzzy) measures. Furthermore, the labeling function in PDSs is also fuzzy rather than crisp in MDPs. Any GPKS is a PDS in which for any state, action set is just a singleton set. Vice versa, any PDS with this property is a GPKS. The action names are irrelevant and are omitted in GPKSs. Thus, GPKSs are thus a proper subset of PDSs
Strategy
A strategy (c.f.[19]) for a PDS
Computational tree logic is well-suited for characterizing properties with branching time, and we utilize this logic to characterize a system objective. Thus, a complex objective is essentially a quantitative property of a system. We introduce the possibilistic strategy computation tree logic (PoSCTL), which has the same syntax as GPoCTL [17], but differs in semantics as it is related to the system’s strategy.
For path formulas, their semantics are defined recursively in a
For “eventually” (or called “future”) formula that expresses reachable property, its semantics is defined as
For “until” formula that expresses constraint reachable property, its semantics is defined as
In fact, the definition of ◊Φ is defined as true ⊔ Φ, where true = 1 ∈ r. The definition of
In this section, for example, for a path formula φ = Φ1 ⊔ Φ2, Φ1 and Φ2 are called sub-objectives that connected by temporal operators, and φ is thus called a complex objective (or objective, in short). Furthermore, each subformula in a state formula Φ connected by a logical operator is called a subgoal, and multiple objectives are combined by logic and temporal operators to form a complex objective φ. Assuming that the possibilistic decision system can take measurements of imprecise sensors per sampling instant in order to determine current fuzzy state sets.
The main idea is to find the optimal strategy for such objective even more complex one in the following.
Figure 1 shows the process of complex objectives optimal control in PDSs. First, an objective is a combination of multiple sub-objectives via logical operators, temporal operators. Next, this objective is formalized as a PoSCTL formula and input into the PDS. Then, a family of imprecise sensors sampling data, and the data is mapped to the [0, 1] interval by the fuzzifier, which is used to determine the current state of the system and the atomic proposition of the states. Finally, the PDS calculates and outputs the possibility of the objective and the corresponding optimal memoryless strategy to control the mobile robots or vehicles.

Process of complex objectives optimal control.
Formally, consider a complex objective φ, and sub-objectives Ψ1, Ψ2, i.e., then the complex objectives φ = ○ Ψ1, ◊Ψ1 and Ψ1 ⊔ Ψ2 optimal control problem amounts to determining maximal possibility
In fact, the Po operator can be viewed as an objective function, and we just need to compute the maximum possible value of the objective function. The set of strategies corresponding to the maximum possibility is the optimal solution set and is the key to our optimal control. However, strategies are infinite, and we need to find subclasses of strategies for optimal control, such as memoryless strategies.
The strategy corresponding to the maximum possibility of an objective φ is the optimal strategy, and by definition of maximal possibility, the optimal strategy is the global optimal strategy. There are three ways to construct an objective φ, i.e., φ = ○ Ψ (next), φ = ◊ Φ (future), φ = Ψ1 ⊔ Ψ2 (until).
The proof is placed in
The theorem allows for optimal memoryless control of next temporal operator. The optimal strategy is the corresponding action when Equation (16) takes the maximum value.
The proof is placed in
The proof is placed in
According to Propositions 3.2 and 3.2 , this theorem can be easily proved and left to the reader.
Since the number of memoryless strategy is finite (at most |Act||S|), then we can use strategy iterations [30] algorithms to find out an optimal memoryless strategy, which often applied in MDPs. Therefore the correctness of Algorithm 1 is obvious and is omitted here. Main steps are as follows:
Step 1: Start with an arbitrary memoryless strategy
Step 2: Evaluate the possibilities by using current memoryless strategy;
Step 3: Improve the strategy for all states by Equation (17);
Step 4: Repeat steps 2 and 3 until strategy convergence.
Since ◊Ψ = true ⊔ Ψ, as a supplement of Theorem 3.2, let
The proof is similar the GPoCTL model checking over GPKSs [17] and is omitted here.
Fuzzy systems have a wide range of applications, including the field of vehicles. References such as [31, 32] discuss how fuzzy systems can contribute to more efficient driving by considering various factors such as load, driving style, road conditions, and vehicle conditions. For instance, [33] explores a hybrid-electric autonomous vehicle under uncertain and ambiguous road environments and driver behavior. In [34], a new optimized fuzzy control system is proposed to address the limitations of existing vehicle motion simulators.
Fuzzy uncertainty is often encountered in linguistic interaction with automated driving systems. For instance, when drivers issue commands such as “increase the speed a little” or “go slightly to the left,” the terms “a little” and “slightly” are vague and must be dealt with accordingly by the automated driving system. Additionally, uncertainty arises from the imprecision of sensors placed on vehicles, which introduce measurement uncertainty. These uncertainties are classified as fuzzy environments and are detailed in [6]. The accuracy and range of sensors, their location and orientation, and the method of evaluating sensor measurements all contribute to measurement uncertainty. For example, the detection of an obstacle by proximity sensors and the distance estimation depend on the accuracy and range of sensors, as well as their number, location, and orientation. Reliable measurements can be achieved through an adequate number of sensors rather than just one.

a PDS model of mobile robots.
We propose a framework for optimal control of complex objectives of mobile robots in a fuzzy environment, based on PDSs. This framework simulates car driving by taking into account multiple objectives and the temporal and logical relationships between them. The growing demand for driving has led to an increase in the complexity of objective control problems, necessitating more sophisticated techniques to solve them. We illustrate this framework with an example of mobile robots studied in [8, 36]. In this example, a robot travels in a 10 × 10 grid to simulate unmanned car driving in a city, with each state representing a set of imprecise sensors placed at intersections to detect traffic congestion and air quality (indicated by the red and blue contours, respectively). Obstacles are represented by thick black lines, and each edge represents a two-way road (see partial enlargement in Fig. 2). The robot attempts to move from its current position s1 to a desired destination s100 and nearby states. Due to the imprecision of the sensors, the processor installed on the robot converts the received sensor data into fuzzy values, modeled as the truth value of an atomic proposition on states, to make decisions.
We will focus on the following four complex objectives in our simulation. It should be noted that the objectives chosen are relatively simple to illustrate the proposed approach.
Then these amount to determining maximal possibilities Pomax (s1 ⊨ φ i ) for 1 ≤ i ≤ 4 and the corresponding optimal strategies.

Optimal strategies for complex objectives 1-3. (a) shows the optimal strategy for objective 1 that considers only the reachability of the destination without considering the obstacles. (b) shows the optimal paths for objective 2 that considers the obstacles. (c) shows the contour of Jam with a threshold of 0.61 and an optimal strategy for objective 3 that considers both the obstacles and the traffic jam.
We wrote a program in MATLAB (version 2021b) 1 to implement complex objectives optimal control algorithms.
There are 100 states spread over a 10 × 10 grid, i.e.,
According to a certain random distribution, the states are labeled Obstacle, Jam, Fresh and Destination and assigned numbers to represent the possibility degree of the atomic propositions in the states. In other words, the numbers represent the sensors data of the current intersection. Then, let
Each state has eight actions, i.e., North, Northeast, East, Southeast, South, Southwest, West, Northwest, and each action has only one successor state in this model. (The outermost state may have two or three actions). The robots can make eight actions that can change its direction, i.e.,
A robot is at the state s1 for a certain moment, i.e., I (s1) =1 and I (s) =0 for s ≠ s1, and let state s100 and its nearby states be target states, i.e.,
Since it is beyond the scope of this paper to obtain the fuzzy transition matrixes and the transition matrixes can be obtained by referring to [37, 38], then we could choose the appropriate transition function between states based above experience. If the robot is moving towards the directions of Destination, we set a higher possibilities and vice versa. Then we can obtain the eight matrixes, i.e.,
Then we get the PDS model
We performed 30 simulations of above four objectives and a more intuitive simulation of four complex objectives was visualized by using the contour command in MATLAB. The 30 simulations took 142s.
By Fig. 3, it can be seen that the effect of multiple objectives on the optimal strategy.
Result 1: If only arriving at the destination is considered (objective 1). It can obtain the optimal strategy is
Result 2: When the obstacles are considered, it can obtain the optimal strategy is

Optimal strategies for objective 4. (a) shows the optimal strategy in the obstacle map. (b) shows the optimal strategy in the Jam contour map with threshold of 0.61. (c) shows the optimal strategy in the Fresh contour map with threshold of 0.65.

Optimal paths of avoiding traffic jams under three traffic jam contour: 0.82, 0.61, 0.38.
Result 3: Considering both obstacles and traffic jams in the complex objective 3, it can be seen that the strategy is
Result 4: For complex objective 4, it can be seen that the PDS can plan an optimal path by considering obstacles, traffic jam, and air quality simultaneously, and this optimal path is the global optimal solution due to Theorem 3.2. For convenience, we have visualised the three considered factors separately. Figure 4(a), (b) and (c) show the optimal strategy of complex objective 4 in the Obstacle map, Jam map and Fresh map, respectively.
Firstly, we will consider the effect of different thresholds on the optimal path of the objective, as illustrated in Fig. 5. Different thresholds for traffic congestion generate different optimal paths. Higher thresholds for Jam lead to more congested roads, with an average jam value of less than 0.82. Conversely, lower thresholds lead to smoother roads, with an average jam value of less than 0.38 for the entire path. Generally, higher thresholds result in a shorter distance, while lower thresholds result in a longer distance, as more congested roads need to be avoided. Overall, we consider these results sufficient for confirming the basic theoretical development.
Secondly, in practical engineering applications, the state accuracy λ is a critical parameter for control performance. The number of states increases exponentially with accuracy, as seen in the example of a control system for launching rockets. In this case, the pose state of the rocket is determined by three direction sensors (three atomic propositions) x, y, and z with an accuracy of 10-3, i.e., AP = x, y, z, λ = 3. A pose state s of the rocket may be determined by the three variable readings of direction sensors, such as [x = 0.425, y = 0.377, z = 0.856]. A modeler can generate approximately one billion states, which makes optimization in such a large-scale model highly resource-intensive and time-consuming. Therefore, state reduction technology based on intelligent algorithms is urgently needed.
In fact, the optimal strategy corresponding to the maximum possibility of the complex objective is a Pareto optimal solution introduced in [39]. For this simulation, we can get the maximal possibilities of above four complex objectives:
The existing work is based on our previous fuzzy model, Generalized Possibility Kripke Structures (GPKS), as described in [17]. In comparison, the proposed PDP model is a more general model that includes the previous models as special cases. Mathematically, any GPKS can be seen as a PDS where the action set for each state is a singleton set. Conversely, any PDS with this property is also a GPKS. The action names in GPKSs are irrelevant and are omitted. Therefore, GPKSs are a proper subset of PDSs. Furthermore, the proposed PDP model offers several new innovations and decision-making advantages that were not available in our previous models.
1) The PDS model as an open system increases the nondeterminism of the choices and allows the model to interact with the environment.
2) Due to the nondeterministic choices, different choices can lead to different models, and the degree to which these models satisfy a given property may vary. In this work, we investigate the optimal control of complex objectives, which enables us to determine not only the maximum possibility that the system will reach these objectives when considering all possible strategies, but also the corresponding optimal strategy for decision-making.
In the robot simulation, the mobile robot has nondeterministic choices at each intersection, making it suitable for modeling using PDS. On the other hand, the GPKS model does not perform well in this situation, as it significantly reduces the robot’s ability to make decisions. This is because GPKS models have no action set, or can be seen as a PDS with a singleton action set. This highlights the advantages of using PDS for modeling in such scenarios.
Conclusions
In this study, we investigate the challenges of multi-objective optimization in fuzzy environments, which is an important field of research. We use possibilistic strategy computation tree logic (PoSCTL) to formalize a complex objective that can be described by temporal properties and logical relations. Then, we propose a fuzzy decision system called possibilistic decision systems (PDS) to model systems with nondeterminism in fuzzy uncertain environments and perform an optimization process for the complex objective. Furthermore, we prove mathematically that memoryless strategies are sufficient to solve the optimal control problem for complex objectives, which is a crucial result for decision-making using PDSs. We propose a strategy iteration algorithm that can obtain the optimal memoryless decision of a complex objective, and it is a polynomial algorithm for the size of the model. Finally, we conduct a simulation study of mobile robots to demonstrate the applicability of complex objectives in a fuzzy environment. The simulation results show that the memoryless strategy implemented by the mobile robot follows complex objectives with the maximum possibility in an environment with different fuzzy uncertain factors. The strategy allows each sub-objective to be satisfied with a certain possibility degree and ensures that the complex objective is optimally achieved. Our study provides a general framework for complex objective optimization in fuzzy environments and highlights the effectiveness of our proposed methods.
There is still a lot of work to be done here. The optimization of complex objectives characterised by linear temporal logic has yet to be explored. Furthermore, although the time complexity of algorithms for strategy iteration is polynomial, they can still be time-consuming for systems with large-scale states. Further state reduction methods need to be developed.
Footnotes
Acknowledgment
This work was partially supported by National Natural Science Foundation of China (Grant Nos: 11671244, 12071271) and the Fundamental Research Funds For the Central Universities (Grant No: 2020CSLY016).
A. The proof of Theorem 1
Conversely, there exists an action α ∈ Act (s) and a state s1 ∈ S such that
We can construct a strategy
B. The proof of Proposition 1
Conversely, there exists an action α ∈ Act and a strategy
This completes the proof.
C. The proof of Proposition 2
Under the condition of ensuring reachability of R via U in the induced GPKS
•If s ∈ B, then Pomax (s ⊨ C ⊔ B) =1;
•If s ⊭ ∃ C ⊔ B, then Pomax (s ⊨ C ⊔ B) =0;
•If s ∈ C ∧ s ∉ B and s ⊨ ∃ C ⊔ B, then
Since the vector (Pomax (s ⊨ C ⊔ B)) s∈S also solves the above equation, then this completes the proof of Equation (18).
Second, we show that the proposition holds for general fuzzy states C and B. Let
By the fuzzy set decomposition theorem [10], then
Since C
λ
i
, B
λ
i
are crisp states, by Equation (18), there exists a memoryless strategy
Since k is finite, then there always exists
We choose
Obviously,
Conversely,
This completes the proof.
MATLAB program ran on a PC with an Intel i5-10400F 2.90-GHz CPU and equipped with 16 GB-RAM and 64-bit Windows 10 systems.
