Abstract
The optimisation of scientific workflows in cloud environments presents considerable challenges, primarily due to the inherent trade-offs between makespan and energy consumption. To address this, we propose Multi-Objective Cooperative Multi-Fitness (MOCMF), a novel mechanism that significantly enhances multi-objective evolutionary algorithms through a unique cooperative evaluation and recoding strategy. Diverging from existing multi-decoder approaches, MOCMF’s core innovation lies in its collaborative framework: heuristic decoders work in tandem to support a baseline decoding function, providing expert solutions that guide the Lamarckian recoding of chromosomes. Furthermore, MOCMF extends this cooperative evaluation to a multi-objective setting, where each heuristic decoder focuses on optimising a specific objective, leading to the generation of multiple distinct solutions per chromosome. Experimental results on data-intensive workflow benchmarks show that MOCMF improves the average Hypervolume by 32% and Inverted Generational Distance by 42% compared to a standard NSGA-II implementation, and by 7% and 6% respectively compared to its mono-objective cooperative variants. The proposed mechanism is also generalisable and potentially applicable to other multi-objective problems beyond workflow scheduling.
Keywords
Introduction
During the last decade, Cloud computing has become a definitive model for executing compute-intensive distributed applications, attributed to its intrinsic flexibility, scalability, and pay-as-you-go approach. Among its service models, Infrastructure as a Service (IaaS) provides users with the means to dynamically allocate virtual machines (VMs) tailored to their computational needs. This enables researchers to access vast computational resources efficiently and cost-effectively, making it particularly attractive for the execution of scientific workflows (sWF). These workflows, often represented as Directed Acyclic Graphs (DAGs), encompass a wide range of complex interdependent tasks, with data exchange requirements varying from kilobytes to terabytes. It is acknowledged that the scheduling of these workflows within cloud environments poses considerable challenges due to the intricate nature of balancing multiple Quality of Service (QoS) objectives. This problem is recognised as being NP-complete.1,2 Several surveys have analysed the scheduling of scientific workflows in cloud environments, highlighting its complexity, computational constraints, and relevance for both academia and industry. Comprehensive reviews by Wu et al., 3 Rodriguez and Buyya, 4 and Yassir et al. 5 provide taxonomies and discuss the evolution of algorithms and evaluation models used to tackle this problem in IaaS contexts.
In the context of scientific workflow scheduling, the makespan is a primary quality of service (QoS) metric. The makespan denotes the time required to complete all tasks in a workflow. It is imperative for researchers to minimise the makespan in order to accelerate the iterative process of experimentation. To accurately capture this objective, the most commonly adopted model for estimating execution time is the Network—Computing (NC) model, which considers computation and data transmission across the network, but typically ignores the impact of local disk I/O operations. While this simplification may suffice for compute-bound scenarios, it introduces significant inaccuracies when applied to data-intensive workflows, where disk access can become a major bottleneck. To address this limitation, the Disk—Network—Computing (DNC) model was introduced in Barredo and Puente. 6 This model extends the NC formulation by incorporating disk read/write times, leading to more accurate and realistic makespan estimations in cloud environments. In this study, we adopt the DNC model to ensure consistency with real-world execution behaviours observed in data-centric scientific applications.
Conversely, the exponential growth in energy consumption within cloud infrastructures, exacerbated by the increasing prevalence of artificial intelligence and big data applications, has made energy efficiency another critical objective. By 2025, data centres are expected to account for 4.5% of global energy consumption and contribute to 3.5% of global carbon emissions.7,8 Consequently, optimising workflows entails addressing the inherent trade-offs between minimising makespan and reducing energy consumption to achieve sustainable and efficient scheduling solutions. Various models have been proposed in the literature to estimate energy consumption in workflow scheduling. Simplified approaches compute energy solely as a function of processing time, 9 while more refined models distinguish between active and passive consumption based on task execution and resource idleness.10–12 In this work, we adopt the latter class of models, as they offer a good compromise between expressiveness and scalability. Although even more sophisticated strategies exist—such as DVFS-based models that account for voltage and frequency scaling13,14—these often require low-level hardware access and are not supported by most public cloud infrastructures. Furthermore, foundational studies have shown that CPU usage is the primary contributor to energy consumption in data centres, 15 validating the CPU-centric abstraction we employ in our evaluation framework.
Approximate optimisation techniques, particularly metaheuristics, have gained prominence in addressing the workflow scheduling problem. Heuristic-based methods, such as the Heterogeneous Earliest Finish Time (HEFT) algorithm 16 and its energy-aware variant Green-HEFT, 11 have demonstrated success in optimising single objectives like makespan or energy. Additionally, multi-objective approaches, such as Gravitational Search Algorithms, 17 Ant Colony Optimisation, 18 and hybrid strategies combining heuristics with metaheuristics,19,20 offer mechanisms to balance conflicting objectives. Furthermore, hyper-heuristic approaches, such as the one proposed by Kabirzadeh et al., 21 combine multiple algorithms (i.e. genetic algorithm (GA), particle swarm optimisation (PSO), ant colony optimisation (ACO), and simulated annealing) using a test-and-select strategy to optimise energy consumption, runtime, network usage, and total cost.
Beyond traditional heuristics like HEFT or metaheuristics such as Particle Swarm Optimization (PSO), a wide range of nature-inspired algorithms have been proposed for complex scheduling and optimisation problems. Among these, Harmony Search, 22 Simulated Annealing, 23 and Water Drop Algorithms24,25 have shown remarkable performance in various engineering domains due to their exploration—exploitation trade-offs and adaptive mechanisms. Other recent proposals explore hybridisation techniques such as memetic search for energy efficiency, 26 ensemble-based rule selection in multi-objective scheduling, 27 or adaptive swarm intelligence models like enhanced grey wolf optimisation. 28 These approaches, while powerful, typically evaluate candidate solutions through a single fitness pathway.
Scientific workflow scheduling has been widely addressed using multi-objective optimisation techniques. Several approaches leverage Pareto-based selection and problem-specific heuristics to balance conflicting objectives such as makespan, cost, and energy consumption. Representative works include list-based scheduling methods,29,30 metaheuristic-based techniques for energy-aware planning, 31 and reinforcement learning-driven strategies for dynamic workflows.32,33 Specifically, energy-aware scheduling has become a prominent research focus, as discussed in recent surveys by Verma et al. 34 and Khattar et al., 35 which review heuristic and dynamic power management strategies designed to reduce consumption in cloud-based workflows. These methods provide strong baselines but often rely on a single evaluation strategy or heuristic per solution. In this context, our proposal explores a cooperative approach where multiple expert decoders collaborate to guide the search in multi-objective settings.
A number of evolutionary algorithms have been described in the literature as utilising multi-fitness strategies to enhance solution quality in comparison with that achievable by conventional single-objective methods. These strategies can be categorised into two distinct approaches: the employment of distinct fitness functions at different stages of the algorithm, as observed in Wu et al., 36 or the assessment of diverse aspects of the same solution, as seen in multi-fitness learning approaches, as discussed in Yates et al. 37 However, despite the success of such methods, to the best of our knowledge, no approach has yet integrated multiple decoding functions to optimise the same fitness objective simultaneously.
In order to address these gaps, Cooperative Multi-Fitness (CMF) evolutionary algorithms have emerged as a promising approach. CMF introduces a novel mechanism in which multiple decoding functions collaborate to guide the search process. The fundamental premise of the CMF approach is to leverage the capabilities of existing heuristics to support and enhance the quality of the solution schedules generated by a basic decoding function. This approach ensures the primary function directing the search process remains unbiased with respect to the explored search space, while the supporting heuristic functions contribute by providing alternative solutions that may be of higher quality for the specific optimisation objective. The cooperative effect is reflected in the fact that the support functions recode the solution so that, in subsequent evaluations, the main basic function directly obtains the same improved solution. The effectiveness of CMF in improving scheduling quality for single objectives, such as makespan and total energy, has been demonstrated in previous works (Barredo and Puente 38 and Barredo and Puente 39 ). These algorithms utilise this polymorphic decoding systems and heuristic support functions to explore diverse regions of the solution space, thereby achieving superior results in comparison to standard mono-objective GA when minimising makespan or total energy respectively.
However, these previous efforts focus exclusively on mono-objective scenarios. The integration of multiple CMF schemes in a multi-objective optimisation context—where each heuristic may lead to distinct, potentially conflicting, high-quality solutions—remains unexplored. This raises an important research question: can the CMF paradigm be extended to support multiple objectives simultaneously, without undermining the structural clarity and diversity required by Pareto-based evolutionary algorithms?
This paper proposes a novel decoding mechanism for individuals in the context of multi-objective evolutionary algorithms, applied to scientific workflow scheduling in cloud environments. The approach builds on the foundations established in previous studies.38,39 The proposed approach integrates the CMF paradigm into the NSGA-II 40 algorithm to simultaneously optimise both makespan and total energy. Each chromosome is decoded using multiple heuristic strategies—one per objective—resulting in multiple candidate solutions. These are evaluated independently, and their corresponding scheduling structures are used to recode the genotype using a Lamarckian mechanism. Thus, the chromosome becomes capable of regenerating all heuristic-improved solutions without relying on the heuristics in subsequent iterations.
This approach breaks the traditional one-to-one mapping between genotype and phenotype by enabling one-to-many decoding. The offspring pool is effectively multiplied, improving diversity and enriching the approximation of the Pareto front. Moreover, since solutions guided by each heuristic tend to dominate in their respective objective, the cooperative evaluation naturally supports the generation of well-distributed Pareto-optimal solutions.
Following the concepts described in Barredo and Puente,
39
this research put emphasis on exploring the following concepts:
Extending the Cooperative Multi-Fitness approach from mono-objective to multi-objective optimisation in a coherent evolutionary framework. Introducing cooperation between two specialised mono-objective CMF subsystems—one optimising energy and the other makespan—within a unified NSGA-II-based algorithm. Analysing how each specialised decoder behaves when evaluated on objectives other than its native target. Defining a new genotype—phenotype decoding model for population-based evolutionary algorithms that accommodates multiple phenotypes per individual.
The remainder of this paper is organised as follows: Section 2 establish the context of the paper as a follow up to previous research. Section 3 describes the novel concept of Multi-Objective Cooperative Multi-Fitness. Section 4 defines the scientific workflow scheduling problem and presents the extended Disk-Network-Computation (DNC) computation and energy model. Section 5 describes the structure of the NSGA-II-based algorithm and the integration of the CMF paradigm. Section 6 describes the inner works of this novel method. Section 7 reports the experimental evaluation and analyses the trade-offs between makespan and energy. Finally, Section 8 summarises the conclusions and future directions.
Background: Cooperative multi-fitness strategy
The Cooperative Multi-Fitness (CMF) strategy, originally proposed in Barredo and Puente,6,38 is designed to enhance the effectiveness of standard genetic algorithms (GAs) in solving complex workflow scheduling problems. Unlike hybrid approaches that define multiple co-evolving populations or competitive decoding schemes, CMF relies on a single population evaluated through multiple support heuristics working cooperatively.
CMF builds upon a conventional GA with a direct decoding flow: chromosome
This direct decoding ensures that the search space remains unbiased, allowing optimal solutions to remain reachable. However, when applied to large-scale workflows involving thousands of tasks and heterogeneous VMs, the basic GA lacks sufficient search efficiency to generate high-quality schedules—whether the objective is makespan or energy consumption.
To address this, CMF introduces a set of support heuristics specialized in optimizing a particular objective (e.g., makespan or energy). These heuristics can produce more efficient schedules, albeit often converging to local optima due to their inherent bias. The novel aspect of CMF is its use of Lamarckism: when a support heuristic discovers a superior schedule, it rewrites the chromosome to reflect this plan (task order and VM assignment). This enables the standard decoder to regenerate the improved solution, potentially improving the GA’s effectiveness without altering its fundamental structure.
Multi-objective cooperative multi-fitness strategy (MOCMF)
This work extends the CMF framework into the multi-objective domain through the introduction of the Multi-Objective Cooperative Multi-Fitness strategy. MOCMF integrates cooperative support functions into a multi-objective evolutionary algorithm, such as NSGA-II, to simultaneously optimise conflicting objectives, including makespan and energy consumption.
The key innovation of MOCMF lies in assigning a tailored package of support heuristics to each objective, allowing the evaluation of a chromosome to proceed independently for each optimisation goal. When a heuristic group discovers an improved schedule, it generates a recoded chromosome that encodes the relevant scheduling information. This enables the standard decoder (
This cooperative multi-fitness mechanism is conceptually independent from the underlying multi-objective evolutionary algorithm. It operates exclusively at the genotype-to-phenotype mapping level, incorporating objective-specific heuristics and Lamarckian recoding. As such, MOCMF can be regarded as an orthogonal component that integrates with existing MOEAs to enrich the evaluation process without altering their evolutionary dynamics.
Moreover, by leveraging the JMetal framework, 41 this approach remains compatible with multiple MOEAs such as NSGA-II, 40 SPEA2, 42 and IBEA, 43 ensuring scalability and comparability.
While this strategy increases computational cost, the total number of individuals generated per generation is adjusted accordingly. This adjustment is intended to preserve the convergence behaviour of the algorithm and support a fair comparison with configurations that do not use MOCMF—such as
Workflow scheduling problem definition
Although the following formulation focuses on scientific workflow scheduling in cloud environments, this problem is used as a representative and practically relevant instance of a broader class of permutation-based, multi-objective scheduling problems under precedence and resource constraints. The proposed Cooperative Multi-Fitness mechanism is intended to be general and applicable beyond this domain, provided that suitable objective-specific heuristics are available.
Scientific workflows are typically represented as directed acyclic graphs (DAGs). The DAG, denoted by
The graph includes two dummy nodes:
The resource model is based on a cloud service provider offering an Infrastructure-as-a-Service (IaaS) platform with a mix of virtual machines (VMs). Let
Given a workflow
The objective is twofold: 1. To minimise the makespan (
Mathematically, these objectives can be expressed as:
The makespan of a workflow schedule is calculated using the DNC model, as described by Barredo and Puente. 6 Unlike the traditional Network-Computation (NC) model,11,44–46 which ignores disk access times and only considers network communications and CPU execution, the DNC model provides a more complete and accurate evaluation by including both disk and network transfer phases. This enhancement is particularly relevant in cloud environments, where I/O operations may significantly impact total execution time, especially in data- or storage-intensive workflows.
In the DNC model, the estimated start time of a task depends not only on the availability of computational resources but also on the location of its predecessor tasks. If predecessor data is available locally, only local disk reading times are incurred. Otherwise, remote disk access combined with network communication is required. Additionally, the DNC model includes local disk write times after task execution to account for output dataset storage.
To better illustrate the impact of the DNC model, we consider a simple example workflow shown in Figure 1, consisting of three computational tasks and a dummy initial task to define a single entry point. Each computational task requires 1 time unit (TU) and produces 1 data unit (DU) of output.

Example workflow with three computational tasks and one fictitious entry task to ensure a single entry and exit point.
Figure 2 compares the resulting schedule under two execution models: the standard Network-Computation (NC) model (a), and the extended Disk-Network-Computation (DNC) model (b). The scenario involves two hosts: Host A, with a disk transfer speed of 1 DU/TU, and Host B, with 0.5 DU/TU. Network transmission speed is assumed to be 1 DU/TU.

Execution timelines under the NC (a) and DNC (b) evaluation models for the workflow in Figure 1. The DNC model incorporates local/remote disk read and write operations before and after computation.
In the NC model, only network transfer times are considered, and the successor task starts as soon as data from the last completed predecessor is received. This leads to a makespan of 3 TU. In contrast, the DNC model accounts for all disk read and write operations, which must be completed before a successor task can begin. The slower disk on Host B increases data transfer time, and tasks may be forced to wait for local or remote disk operations. As a result, the makespan increases to 8 TU, demonstrating the critical role of disk throughput in realistic cloud environments.
The computational time spent on a specific task
Tasks may require reading input data from parent tasks, which can be located either on the same host or a different host. The data transfer time depends on whether the parent and child tasks are executed on the same or different hosts.
If the parent and child tasks are on the same host, the data transfer time is calculated as:
If the parent and child tasks are on different hosts, the data transfer time involves three components: the disk speed of the parent host (
The estimated finish time ( The finish times of all parent tasks. The data transfer times from all parent tasks to the current task. The computational time of the task. The data write times of the output files.
The formula for
Where:
The total time spent reading data from parent tasks can be defined as the summation of all data transfers from task predecessors, where
The total time spent writing information out can be defined as the summation of all data needed by successors divided by the speed of the current host. Therefore,
The EST for a task ((i,k)) can be mathematically defined as:
Provided,
The total energy consumption in a cloud environment can be broken down into two primary components: active energy and passive energy. Active energy refers to the energy required to execute tasks, which involves the CPU and disk operations. These activities generate specific energy consumption during task execution. On the other hand, passive energy is the energy needed to keep servers operational, even when they are waiting between tasks. Notably, all servers are powered on and off simultaneously, meaning that the makespan (the total time from the start of the first task to the completion of the last task) significantly impacts passive energy consumption. As the execution duration increases, so does the cumulative time the servers remain running.
Passive energy can be mathematically defined as the summation of the product between the passive energy consumption rate of each server and the makespan (or estimated finish time of the last task). This relationship is expressed in Equation 10:
For active energy, the energy consumed by a specific task executed on a specific host can be defined as the product of the active energy consumption rate of the virtual machine
The total active energy for the entire workflow is then obtained by summing the active energy consumed by all tasks across all hosts, as represented in Equation 12:
Finally, the total energy consumption for the workflow is the sum of both active and passive energy components:
Evolutionary algorithms span a wide range of strategies, from deterministic single-solution methods 47 to population-based stochastic approaches such as the genetic algorithm NSGA-II. 40 In this work, we adopt the latter framework, integrating our cooperative multi-fitness decoding mechanism to enhance performance in multi-objective optimisation. This section introduces the standard NSGA-II framework, adapted to the workflow scheduling problem. The algorithm serves as a baseline evolutionary structure, within which CMF is integrated as a genotype—phenotype mapping and cooperative evaluation strategy. A detailed illustrative example is provided later in Section 6, following the complete description of the multi-objective formulation.
The presented optimization metaheuristic is a multiobjective genetic algorithm builds upon previous work detailed in Barredo and Puente. 6 In earlier approaches, the concept of using cooperative multiple fitness functions was introduced to address makespan optimisation. 38 Subsequently, this approach was extended to tackle energy consumption. 39 In this work, we aim to address the simultaneous optimisation of both objectives, thereby moving into a multi-objective context. As such, it becomes natural to extend the single-objective genetic algorithm to the standard NSGA-II, as it merely involves replacing our selection and replacement operators. The challenge now lies in how to address the CMF approach within this multi-objective context. The genetic algorithm operates within a multi-objective context and employs the NSGA-II framework.
The algorithm is defined by several key parameters:
Chromosome Encoding. Each chromosome is represented as a permutation of pairs
As an illustrative example, consider a workflow with four tasks (
Crossover Operator. The crossover is based on the works of Zhu et al., 46 where using the CrossoverOrder approach guarantees the preservation of the topological order. The operator begins by randomly selecting a crossover point that partitions each parent into two subsequences. The initial segment of each offspring is formed by copying the first subsequence from one parent. The remaining positions are then completed by inserting the missing tasks from the other parent, preserving their original relative order. Because the relative precedence between any pair of tasks is guaranteed to be maintained in at least one parent, the resulting offspring respect all dependency constraints and preserve a valid topological order.
Mutation Operator. The mutation operator first selects a task
Initial population. The initial population is generated randomly, with each chromosome constructed to follow topological order while assigning VMs to tasks at random. The size of the population is determined by
New Decoding Schema. The primary fitness function, denoted as
The proposed innovation lies in the use of different lists of supporting fitness functions, one for each different objective, denoted by
Multi-objective cooperative multi-fitness decoding and evaluation function
Initially introduced for single-objective optimisation in Barredo and Puente,
38
each chromosome is evaluated using multiple functions, i.e. the standard decoding function
In addition to the solution (
However, in a multi-objective scenario, this approach is insufficient because a solution that minimises one objective may not necessarily minimise another. To address this, it is necessary to duplicate every solution and apply two distinct sets of support fitness functions,
To clarify how CMF mechanism operates within a multi-objective evolutionary algorithm, we present the following simplified example.
Consider a chromosome that encodes the execution order and resource assignment for three tasks:
This chromosome is evaluated using the Heuristics for makespan minimisation ( Heuristics for total energy minimisation (
Each heuristic applies its own decoding strategy and produces a distinct schedule, evaluated with respect to both objectives. The results of these evaluations are summarised in Table 1. While the values shown are hypothetical, they are designed to reflect meaningful ranking relationships among the solutions.
Illustrative example of the cooperative multi-fitness mechanism in a multi-objective context: optimizing makespan (mkp) and total energy (TE).
Illustrative example of the cooperative multi-fitness mechanism in a multi-objective context: optimizing makespan (mkp) and total energy (TE).
In this example, all heuristics evaluate the same chromosome

Resulting chromosome after Lamarckian recoding using the schedule obtained through CMF decoding (optimising makespan; analogous for energy).
In the multi-objective context, both
In the next subsection, the different lists of supporting functions for each objective, as well as standard fitness function, are introduced.
The following functions serve different purposes, since there are several objectives to meet. Each function is designed to address a specific objective. However, it is notable that functions specialised in energy, while addressing the primary objective, incorporate the makespan into their considerations.
This is the baseline fitness function, which evaluates the chromosome exactly as it is provided. It does not alter the task order or the task-to-VM mapping. Since this function does not interfere with the chromosome, it serves as a neutral evaluator and allows changes introduced by crossover and mutation operations to be directly reflected in the evaluation. This function targets the minimisation of total energy consumption while preserving the task order specified in the chromosome. It modifies only the task-to-VM mapping. For each task, it attempts to assign it to a VM that does not increase the current makespan. Among the VMs that satisfy this condition, the one with the lowest energy consumption for the task is selected. If no such VM exists—such as in the case of the first task or when dependencies constrain scheduling—the function selects the VM with the minimum energy cost. In cases where multiple VMs offer the same energy efficiency, the tie is broken by choosing the VM that results in the earliest task completion time. Also focused on reducing energy consumption, this function incorporates task runtime as a guiding criterion. It preserves the task order defined by the chromosome but adjusts the task-to-VM mapping. For each task, the function evaluates its estimated total runtime—which includes both CPU execution and communication delays—and compares it to the average runtime of all tasks. If the task’s runtime is below the average, it is assigned to the VM with the lowest energy consumption. Conversely, if the runtime exceeds the average, the task is mapped to the VM that provides the shortest execution time for that task. This function is designed to reduce the makespan by applying the first stage of the HEFT (Heterogeneous Earliest Finish Time) algorithm. Unlike previous functions, it ignores the task order defined in the chromosome. Instead, it computes a new order based on upward ranks, prioritizing tasks expected to finish earlier according to averaged CPU and communication times. However, the task-to-VM mapping from the original chromosome is preserved. Focusing also on makespan reduction, this function applies the second phase of the HEFT algorithm. It keeps the task order as specified in the chromosome but adjusts the VM assignments. Each task is scheduled on the VM that allows it to complete at the earliest possible time, accounting for both VM availability and communication overheads.
The combination of these functions constitutes the basis of the two mono-objective CMF approaches. Specifically,
Previous studies using the CMF approach have employed a mono-objective genetic algorithm to optimise workflow scheduling in a set of virtual machines. In particular, in Barredo and Puente,
38
the CMF approach proved to be more effective in optimising makespan than both the direct chromosome decoding function (
Furthermore, in Barredo and Puente,
39
a correlation analysis between makespan and total energy revealed that optimising total energy also led to an improvement in makespan. This is due to the fact that the heuristics
The aim of this study is to assess the effectiveness of applying the Cooperative Multi-Fitness strategy in a Multi-Objective context (MOCMF). To this end, the NSGA-II algorithm has been adopted to simultaneously optimise makespan and total energy. The experimental evaluation will focus on the following key aspects:
Benchmark instances
Workflows can vary in size and shape, thus having an accurate notion of how the different fitness functions work, is critical. Therefore we need to have different workflows. WFCommon
49
repository is a public repository containing multiple executions of seven different workflows
50
, each family have multiple instances. For each of the families we have selected four, with different tasks sizes, having a range from extra-small(50-100) to large number of tasks (500-1000), with two more sizes in between, small (100-200) and medium (200-500). The instance sizes can be seen in the Table 2.
Every instance grouped by problem.
Every instance grouped by problem.
The families not only vary in shape and sizes, also there are families with a huge amount of communications with will impact greatly in the disk and network usage. This can help better understand how these two can impact the final makespan of total energy.
To analyse the trade-off between execution time and energy consumption, we define a heterogeneous computing platform composed of two types of hosts, whose characteristics are summarised in Table 3. Each host type differs in computational throughput and power consumption.
Host characteristics on the benchmarking platform. Units: Throughput (GFLOPS), active power, and passive power (watts). Based on the SPECpower benchmark.
51
Host characteristics on the benchmarking platform. Units: Throughput (GFLOPS), active power, and passive power (watts). Based on the SPECpower benchmark. 51
Two power modes are considered (both expressed in Watts): passive power refers to the energy consumed when a host is powered on but idle, while active power refers to energy consumption during task execution. We assume that each task utilises 100% of the CPU during its execution.
Throughput is expressed in GFLOPS, representing the number of floating-point operations the CPU can execute per second. The AMD EPYC 4584PX offers lower throughput but is energy-efficient, while the Intel Xeon Platinum 8471N provides higher performance at the cost of significantly greater energy consumption. As a result, neither host type is universally optimal: the AMD processor yields longer execution times with lower energy usage, while the Intel processor achieves shorter runtimes at a higher energy cost.
Both host types share the same disk and network configuration: a solid-state drive (SSD) with 540 MB/s I/O speed and a 1 Gbps network interface.
Each benchmark instance is tested under four different host configurations, using 2, 4, 8, and 16 hosts with an even distribution between the two host types.
All experiments are repeated 10 times. Each algorithm is allowed a total budget of 100,000 solution evaluations. For standard configurations, this corresponds to a population of 100 individuals and 100 offspring generated per generation, over 1000 generations. Mutation and crossover probabilities are set to 0.1 and 1.0, respectively, following the configurations validated in previous mono-objective studies.38,39
In the case of MOCMF, each offspring produces one decoded solution per objective (i.e., two evaluations per offspring in a bi-objective setting). To ensure a fair comparison in terms of computational effort, the number of offspring generated per generation is reduced to 50, resulting in the same total number of evaluations per generation. The population size remains unchanged.
The experiments are implemented using the open-source Java framework JMetal.
41
Experiments are executed on a Linux server equipped with a 12th Gen Intel
We evaluate the efficiency of fitness functions designed for single-objective optimisation in a multi-objective setting. Specifically, we analyse the impact of fitness functions tailored to optimise makespan and total energy independently, comparing their performance within NSGA-II across all instances. The primary performance metric used is the Hypervolume (HV).
To assess statistical significance, we conduct a Friedman test followed by a Holm post hoc test to compare the ranking of different approaches. This procedure follows the recommendations for non-parametric statistical analysis of algorithm performance across multiple problem instances. 52 The results are summarised in Table 4.
Average ranking of evaluated fitness strategies across six performance metrics: Hypervolume (HV), inverted generational distance (IGD), inverted generational distance plus (IGD+), generational distance (GD), epsilon indicator (EP), and spread. Lower values indicate better performance.
Average ranking of evaluated fitness strategies across six performance metrics: Hypervolume (HV), inverted generational distance (IGD), inverted generational distance plus (IGD+), generational distance (GD), epsilon indicator (EP), and spread. Lower values indicate better performance.
Although the revised table omits statistical test columns for clarity, we note that the rankings presented are fully supported by statistical significance tests. In particular, the top-performing strategy (
When incorporating additional performance metrics—namely Inverted Generational Distance (IGD), Inverted Generational Distance Plus (IGD+), Generational Distance (GD), Epsilon Indicator (EP), and Spread—a consistent pattern emerges:
The hybrid strategies
These results reinforce the effectiveness and robustness of the cooperative multi-fitness architecture, particularly when evaluated across a comprehensive set of Pareto front quality indicators that capture both convergence and diversity dimensions.
Median hypervolume obtained for makespan-based fitness functions across different problem families. In bold the best result for each problem.
While six performance metrics are included in the ranking to provide a comprehensive multi-objective evaluation, we focus the detailed analysis on the Hypervolume (HV) indicator. HV is widely recognised for simultaneously capturing convergence and diversity properties of Pareto front approximations, and is therefore particularly suitable for evaluating the overall quality of multi-objective solutions. 53 Other metrics are used to support the robustness of the findings, but their individual analysis is omitted here for the sake of clarity and conciseness.
The first experiment compares four fitness functions designed to optimise makespan:
From Table 4, we observe that:
The Holm test confirms that The worst-performing function is
These results indicate that integrating multiple heuristics into a cooperative fitness function enhances performance when optimising makespan in a multi-objective context.
The
Nevertheless,
These results reinforce the earlier statistical findings, demonstrating that cooperative multi-fitness functions enhance makespan optimisation in multi-objective scheduling scenarios.
Evaluation of energy-oriented fitness functions
A similar experiment is conducted for fitness functions designed to optimise total energy:
Key observations from Table 4:
The Holm test rejects the null hypothesis for all individual heuristics, confirming that
These results confirm that, as with makespan, integrating multiple heuristics into a cooperative approach enhances performance in total energy optimisation as well, even in a multi-objective context.
Median hypervolume obtained for energy-based fitness functions across different problem families. In bold the best result for each problem.
Median hypervolume obtained for energy-based fitness functions across different problem families. In bold the best result for each problem.
The
Unlike makespan optimisation, where the performance of individual heuristics was relatively consistent, here we see that
In contrast, the
These results again reinforce the earlier findings that combining multiple heuristics within a cooperative framework significantly improves performance, even when focusing solely on total energy optimisation.
Overall, the experiments demonstrate that CMF approaches outperform individual heuristic functions when optimising either makespan or total energy. The Friedman test rankings consistently place
This suggests that leveraging multiple heuristics in a cooperative manner enhances the search process in NSGA-II, leading to superior hypervolume values compared to using single-objective heuristics in isolation.

Median hypervolume obtained for
In the next section, we will evaluate whether combining both CMF approaches into a single multi-objective function (MOCMF) results in additional performance improvements, further boosting the quality of solutions across both objectives.
Following the evaluation of the mono-objective CMF functions within the multi-objective context, the focus now shifts to the results obtained by the multi-objective MOCMF approach. In this configuration, each pair of heuristics —
The results of the statistical test are summarised in Table 4, which highlights the performance of MOCMF relative to other fitness functions, both simple and composite. As demonstrated in the table, MOCMF achieves the highest ranking (2.607), surpassing all other evaluated functions. For the sake of comparison, the mono-objective CMF functions achieved the following rankings:
MOCMF achieves the highest HV in four out of the seven problem families: epigenomics, montage, and srasearch. In instances where MOCMF does not attain the maximum HV, the performance disparity remains marginal. Specifically, in the 1000genome problem, the HV of MOCMF (30%) is only 1 percentage point lower than the best result of
These results highlight the ability of MOCMF to closely approximate the highest HV achieved by the most efficient mono-objective CMF approach for each problem. Even when it does not attain the absolute maximum, MOCMF consistently delivers competitive results, maintaining a minimal performance gap relative to the best-performing CMF variant. This demonstrates its robustness in effectively balancing both objectives across different problem families.
In order to provide a more comprehensive evaluation of the robustness and effectiveness of the MOCMF approach, a focus is directed towards the instances of each workflow family that present the greatest challenges. These instances are characterised by the highest number of tasks and scheduled on the maximum number of virtual machines scenario. The results, summarised in Figure 5, demonstrate the performance of MOCMF in comparison to the baseline

Median hypervolume results for the challenging instances of each workflow family using 16 hosts infrastructure. The blue bar chart represents
Although MOCMF achieves the maximum HV in only two problems (1000genome and epigenomics), it demonstrates a strong capacity to remain close to the maximum HV in the other cases, with small gaps. In all but one of the remaining problems, the HV obtained by MOCMF is within a few percentage points of the highest value. This suggests that MOCMF is not only competitive but also exhibits significant robustness in the multi-objective optimisation process, making it a viable solution for various problem families.
To evaluate the robustness and generality of the proposed MOCMF approach, we extended the experimental assessment to include four additional multi-objective evolutionary algorithms: SPEA2 and IBEA, both different genetic approaches, and multi-objective ant colony optimization algorithm (MOACO), and a particle swarm optimization algorithm (MOPSO). All algorithms were implemented using the JMetal v6 framework and configured with identical parameters (for SPEA2 and IBEA), the same evaluation model (DNC), and the cooperative decoding strategy employed in the NSGA-II experiments. The MOACO and MOPSO both use the parameters present in each original works. MOACO is based on ACO-HEFT, 19 with the exception that the construction of the ranking is performed randomly instead of using HEFT ranking. Nevertheless, the HEFT ranking remains relevant due to their inclusion in the makespan support functions. The MOPSO version used is the Speed-constrained Multi-objective PSO (SMPSO). 54 Since the original algorithm operates exclusively on real-valued representations, an adaptation was necessary to support permutation-based encoding. To address this, we employed a Random-Key encoding, 55 modelling the permutation mechanism following the approach described in MOPSO approach. 56
The ranking results obtained for SPEA2, IBEA, MOACO, and SMPSO are presented in Tables 7 to 10, respectively. As in the NSGA-II case, each configuration includes the baseline decoding function (
The results show that the relative performance ordering among fitness strategies remains consistent across all three algorithms. In both all algorithms, the MOCMF strategy consistently achieves the top average rankings across convergence-focused metrics (HV, IGD(+), GD, EP), followed by the mono-objective CMF variants. The baseline
Scalability and computational load
This section analyses the computational demands of the proposed MOCMF approach and assesses its scalability in large-scale workflow scheduling scenarios. We begin by describing the procedural cost of MOCMF in terms of its decoding and evaluation mechanism, which differs from standard evolutionary algorithms due to the cooperative use of multiple heuristic decoders. Then, we present an empirical evaluation using the Cycles workflow family, demonstrating how execution time evolves as the number of tasks increases across several orders of magnitude.
Average ranking of evaluated fitness strategies using the SPEA2 algorithm across six performance metrics: Hypervolume (HV), inverted generational distance (IGD), inverted generational distance plus (IGD+), generational distance (GD), epsilon indicator (EP), and spread. Lower values indicate better performance.
Average ranking of evaluated fitness strategies using the SPEA2 algorithm across six performance metrics: Hypervolume (HV), inverted generational distance (IGD), inverted generational distance plus (IGD+), generational distance (GD), epsilon indicator (EP), and spread. Lower values indicate better performance.
Average ranking of evaluated fitness strategies using the IBEA algorithm across six performance metrics: Hypervolume (HV), inverted generational distance (IGD), inverted generational distance plus (IGD+), generational distance (GD), epsilon indicator (EP), and spread. Lower values indicate better performance.
Average ranking of evaluated fitness strategies using the MOACO algorithm across six performance metrics: Hypervolume (HV), inverted generational distance (IGD), inverted generational distance plus (IGD+), generational distance (GD), epsilon indicator (EP), and spread. Lower values indicate better performance.
Average ranking of evaluated fitness strategies using the MOPSO algorithm across six performance metrics: Hypervolume (HV), inverted generational distance (IGD), inverted generational distance plus (IGD+), generational distance (GD), epsilon indicator (EP), and spread. Lower values indicate better performance.
Unlike traditional multi-objective evolutionary algorithms that apply a single decoding strategy per chromosome, MOCMF introduces a cooperative decoding mechanism that invokes multiple heuristics—one standard decoder and a set of support heuristics associated with each objective.
Given a population of
Therefore, the total number of decoding and evaluation operations per generation is:
This cost grows linearly with the population size and with the total number of heuristic decoders used. In our implementation, all heuristics are lightweight and rule-based, ensuring that the added cost remains tractable and predictable. This section is followed by an empirical evaluation to confirm that the model maintains scalable execution across a wide range of workflow sizes.
To validate the practical scalability of the full MOCMF model, we conducted a dedicated experiment using workflow instances from the Cycles family available in the wfcommons 49 repository. These workflows share structural characteristics but vary significantly in size, ranging from 67 to 6,543 tasks. Each instance was executed using the most demanding infrastructure setting in our study: a 16-host heterogeneous cloud configuration with two distinct VM types.
We applied the complete MOCMF-enhanced NSGA-II algorithm to each instance, incorporating both makespan-oriented and energy-oriented cooperative decoding mechanisms. The goal was to observe how total execution time evolves as the problem size increases.
Table 11 presents the results, including the number of tasks, total runtime, and average computation time per task. The latter metric provides a normalized view of computational cost per unit of workflow size.
Scalability of MOCMF evaluated on the cycles workflow family, under a 16-host heterogeneous configuration. The table reports the number of tasks per workflow, total execution time, and average time per task.
Scalability of MOCMF evaluated on the cycles workflow family, under a 16-host heterogeneous configuration. The table reports the number of tasks per workflow, total execution time, and average time per task.
These results confirm that the total execution time increases with workflow size, as expected, but the computation time per task remains remarkably stable. Across a two-order-of-magnitude increase in problem size, the per-task evaluation time grows modestly from 0.16 to 0.27 seconds.
This near-linear growth indicates that the evaluation strategy employed in MOCMF supports efficient scaling across a wide range of problem sizes. Furthermore, because each heuristic decoding is self-contained and independent, the approach lends itself naturally to parallelisation. A multicore implementation could distribute evaluation workloads across processing units, substantially reducing actual runtime and further reinforcing the model’s applicability to large-scale workflow scheduling.
In this paper, we proposed a novel cooperative multi-fitness decoding strategy,
The study builds upon previous works that separately address makespan and total energy optimisation using mono-objective Cooperative Multi-Fitness approach. These two CMFs were individually analysed for their effectiveness in optimising each objective. Our contribution focuses on exploring the synergies that arise from combining these two mono-objective CMFs within a multi-objective framework. By employing NSGA-II, a standard multi-objective algorithm widely applied across diverse and technically distinct engineering domains—ranging from structural inverse analysis in civil engineering, 57 and post-earthquake recovery scheduling in community resilience planning, 58 to water distribution optimisation in environmental systems, 59 train scheduling and shunting in high-speed railway operations 60 as well as integrated scheduling in multi-vehicle public transport systems 61 — we investigate the potential benefits of combining both objectives in a single optimisation process, guided by dominance principles.
One key observation is the close relationship between makespan and energy. These two objectives are typically opposing; as energy consumption is minimised, makespan often increases. However, this is not always the case in our scenario, due to two main factors: the energy model where virtual machines continue consuming passive energy until the makespan is reached, and the dependency of energy heuristics on makespan, which is treated as a soft constraint.
In our experiments, we observed that the mono-objective CMFs, both in makespan
The primary benefit of the cooperative multi-fitness approach is its high degree of reusability. It is not dependent on any specific number or set of support functions, nor is it constrained to a particular problem or optimisation algorithm. Its only requirement is the availability of heuristic support functions, which can be exploited alongside a standard function, all operating on the same solution representation scheme for the problem at hand.
Finally, the multi-objective version of Cooperative Multi-Fitness (MOCMF) appears to be readily applicable to a wide range of population-based metaheuristic algorithms, given its ability to generate alternative solution variants for each objective to be optimised. This adaptability allows its integration into various optimisation frameworks without requiring fundamental modifications to the underlying algorithm.
Future work
While the proposed MOCMF mechanism has demonstrated competitive performance and generality, several limitations should be acknowledged. First, the approach relies on the availability of high-quality, objective-specific heuristics, which may not be readily available in every application domain. Second, the use of Lamarckian recoding introduces a potential bias in the evolutionary process, as it systematically promotes heuristic-generated schedules over those derived directly from genetic search. Third, the cooperative decoding strategy increases the computational cost of each generation, since multiple decodings per chromosome are performed—one for each objective-specific heuristic package.
Future work will explore adaptive recoding strategies and dynamic heuristic selection to mitigate these effects. We also plan to extend the MOCMF mechanism to additional multi-objective domains beyond workflow scheduling, particularly those where conflicting heuristics can be derived from domain knowledge.
A promising direction is the integration of the MOCMF framework with other nature-inspired metaheuristics62,63 beyond the evolutionary algorithms tested in this study. In particular, techniques such as Spiral Dynamics Algorithm, 64 Bacteria Foraging Algorithm, 65 Particle Swarm Optimization with selective search, 66 and Discrete Spider Monkey Optimization 67 present diverse search dynamics that could benefit from the cooperative evaluation and Lamarckian recoding mechanisms introduced here. Such extensions would further validate the modularity and generality of the proposed strategy across different algorithmic paradigms.
Footnotes
Funding
This research has been supported by the Spanish Government under research Grants TED2021-131938B-I00 and PID2022-141746OB-I00, and by the Principality of Asturias under research Grant GRU-GIC-24-018.
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
