Format And Type Fonts


 CCHHEEMMIICCAALL  EENNGGIINNEEEERRIINNGG  TTRRAANNSSAACCTTIIOONNSS  
 

VOL. 45, 2015 

A publication of 

 
The Italian Association 

of Chemical Engineering 

www.aidic.it/cet 
Guest Editors: Petar Sabev Varbanov, Jiří Jaromír Klemeš, Sharifah Rafidah Wan Alwi, Jun Yow Yong, Xia Liu  

Copyright © 2015, AIDIC Servizi S.r.l., 

ISBN 978-88-95608-36-5; ISSN 2283-9216 DOI: 10.3303/CET1545059 

 
Please cite this article as: Sharma I., Hoadley A., Mahajani S.M., Ganesh A., 2015, Methodology for “surrogate-assisted" 

multi-objective optimisation (moo) for computationally expensive process flowsheet analysis, Chemical Engineering 

Transactions, 45, 349-354  DOI:10.3303/CET1545059 

349 

Methodology for “Surrogate-Assisted" Multi-Objective 

Optimisation (MOO) for Computationally Expensive Process 

Flowsheet Analysis 

Ishan Sharma*
a
, Andrew Hoadley

b
, Sanjay M. Mahajani

c
, Anuradda Ganesh

d
 

a 
IITB-Monash Research Academy, Indian Institute of Technology Bombay, Powai, Mumbai, 400076, India 

b 
Department of Chemical Engineering, Monash University, Clayton, VIC-3168, Australia 

c 
Department of Chemical Engineering, Indian Institute of Technology Bombay, Powai, Mumbai, 400076, India 

d 
Department of Energy Science and Engineering, Indian Institute of Technology Bombay, Powai, Mumbai, 400076,  

  India 

114174001@iitb.ac.in 

In recent years, Multi-Objective Optimisation (MOO) has increasingly been applied in chemical engineering 

problems where multiple, often conflicting, objectives need to be optimized simultaneously. Genetic 

Evolutionary Algorithms (EAs) are often used in order to solve MOO problems, especially when the user is 

concerned with the trade-offs involved between the multiple objectives. One particular form of the 

evolutionary algorithm is the Genetic Algorithm (GA), which is a population based algorithm that tries to 

mimic the process of biological natural selection. GA or any other EA is able to yield a set of 

non-dominated solutions in a single run. However, a downside associated with such population based 

algorithms is that a large fraction of the datasets evaluated in order to guide the search, are actually not 

represented in the final solution. This becomes an issue, especially for problems involving computationally 

expensive functional evaluations. For such problems, “surrogate” or “meta” models are often used to 

approximate the exact, but computationally expensive models. This results in a significant saving in terms 

of computation time. However, building a surrogate model, accurate enough in the entire decision variable 

space is a challenge in itself. In the present work, a methodology to perform “surrogate-assisted” MOO has 

been proposed.  The efficiency of the proposed methodology is compared against another similar 

methodology for surrogate-assisted MOO. The two approaches are first tested against two mathematical 

test problems and the most suitable method is then applied to the chemical engineering flowsheet 

optimisation of a coal to ammonia process with Carbon Capture and Sequestration (CCS). The results 

show savings in computation time even in the most conservative case. 

1. Introduction 

MOO is often used to analyse the trade-offs involved between two or more, often conflicting objectives. 

There are numerous methods available to solve MOO problems including; objective weighing method, 

ɛ-constraint method and evolutionary algorithm based methods, among others. GAs are a class of EAs 

that use a population based approach to mimic the process of biological natural selection. GA or any other 

EA has the advantage that they are able to yield a set of non-dominated solutions in a single run. 

However, in order to do so, a lot of candidate solutions need to be evaluated and ranked. Hence, solving a 

MOO problem by using such a population based approach takes a lot of time if the functions being 

evaluated, are computationally intensive. Also, a large fraction of the candidate solutions evaluated during 

the MOO run are not included in the final solution. Hence, there exists an opportunity to save a lot of 

computation time if the functions could be approximated by a computationally simpler function.  

A “surrogate” model refers to an approximation of the exact or “detailed” model. “Surrogate-assisted” MOO 

refers to the use of surrogate models, either partially or completely, during the course of a MOO run with 

an aim to speed up the search. However, building a surrogate model, accurate enough in the entire 


350 

 
decision variable space is a challenge in itself. Wilson et al. (2001) used such an approach of using a 

single global surrogate to carry out MOO. Jin (2005) discusses this approach in detail and concludes that 

this approach can be problematic in cases suffering from high dimensionality, ill distribution and limited 

number of training datasets. It is for this reason that Jin (2005) recommends the use of both the “detailed” 

and the “surrogate” model during the MOO run. 

The next issue to be dealt with is how to decide whether to use the surrogate or the detailed model for an 

evaluation. Jin (2005) described this as “model management” or “Evolution Control” (EC). There are two 

basic strategies used in EC namely, fixed EC and adaptive EC. The salient feature of fixed EC is that the 

frequency of detailed model evaluations is fixed. Fixed EC includes two sub-categories viz., individual and 

generation based EC. Individual based EC involves only some individuals in every generation using the 

detailed models. Generation based EC, on the other hand, involves the candidate solutions being 

evaluated by detailed model only after a fixed interval of evolutionary generations. Adaptive EC involves a 

variable, rather than a fixed, frequency of detailed model evaluations. This frequency depends on the 

accuracy of the surrogate models. Ray et al. (2009) proposed a Surrogate Assisted Evolutionary 

Algorithm (SAEA) which used a mixture of the basic EC strategies discussed above. In SAEA, in addition 

to generation based fixed EC, a form of adaptive individual based EC is also used.  In SAEA, the 

Euclidean distance of every normalized candidate solution is measured from each and every normalized 

training dataset. If the minimum distance is below a particular threshold value (confidence region), then 

only the surrogate model is used for function evaluation. SAEA involves periodic retraining of the 

surrogates by making use of all the detailed model evaluations stored in an “external archive”.  The 

periodic re-training of the surrogate model is advantageous especially in the cases where the dataset, 

used to initially fit the surrogate, is significantly different from the final Pareto-optimal solution.  

There is a wide variety of model approximations available to choose from, ranging from polynomial 

models, multi-layered feed-forward neural networks, kriging approximations, radial basis functions, and 

support vector machines. Santos et al. (2013), for example, compared the performance of multi-layered 

feed-forward neural networks and radial basis functions in detecting and locating leaks in gas 

transportation pipelines. Jin (2005) reports that though there are several studies comparing the 

performance of different approximation methods, there is a lack of clarity as far as the advantages and 

disadvantages of different methods is concerned. This is mainly because performance of individual 

methods is also dependent on the problem being solved. Also, there may be several criteria on which the 

comparison could be made.  

Isaacs et al. (2009) recently proposed another set of algorithms for surrogate-assisted MOO. The two 

algorithms proposed by Isaacs et al. (2009) are Multiple Spatially Distributed Surrogates (MSDS) and 

Multiple Adaptive Spatially Distributed Surrogates (MASDS). There are a lot of similarities between SAEA 

and these two algorithms. For example, like SAEA, MSDS and MASDS also use EC. The basic difference 

between the two approaches is that while SAEA uses single surrogate models in order to predict the 

functions in the entire decision variables space, MSDS and MASDS use multiple, locally accurate 

surrogate models in order to predict functions in different regions. The basic genetic algorithm used in 

MSDS and MASDS is the Non-dominated Sorting Genetic Algorithm (NSGA)-II proposed by 

Deb et al. (2002). Isaacs et al. (2009) demonstrated these algorithms for six mathematical MOO problems 

along with a welded beam design optimisation problem. Isaacs et al. (2009) concluded that MASDS 

performed by far the best for most of the problems. Recently, Zhu et al. (2014) proposed a multi-objective, 

variable-fidelity optimization method, i.e. a surrogate-assisted method, for genetic algorithms. They noted 

that the optimal decision variable space is typically a small subset of the original search. Hence, in their 

approach new points are only included for surrogate model fitting if they are expected to impact the 

domination decisions made by the genetic algorithm and on the basis of error estimates, provided by the 

kriging models. However, the computation complexity in fitting the surrogate model increases significantly 

with an increase in the size of the dataset used to fit the model. This problem is especially acute for 

problems involving three or more objectives. Liu and Collette (2014) proposed an improved version of the 

method proposed by Zhu et al. (2014) to incorporate multiple surrogate fitting, just like what 

Isaacs et al. (2009) had proposed in MSDS and MASDS, to deal with the increase in complexity of 

surrogate fitting. A complimentary way to deal with this issue could have been to limit the search space 

periodically and fitting the surrogate in just the promising subset of the original decision variable space. 

This is particularly important for many real life problems where the initial search space could be drastically 

different than the final solution. The present work aims to address this research gap, not considered in the 

above mentioned works, by proposing an appropriate surrogate-assisted MOO framework.  The proposed 

framework has been compared against MASDS for two mathematical test problems. This is then followed 

by applying the proposed algorithm to a chemical engineering flowsheet optimisation of coal to ammonia 

process with Carbon Capture and Sequestration (CCS) by a non-selective Rectisol
TM

 process.  


351 

2. Proposed methodology 

Ø All the “detailed” evaluations are 

stored in an “external archive”

Ø Aim is to incorporate only the “superior” datasets in surrogate model fitting

Ø Expected to improve the surrogate model predictions in the region of 

interest

Ø A surrogate model is termed as “valid” when the test error is less than the user specified threshold value

Ø Only a fraction, α, of the data in external 

archive is used for training the surrogates 

Ø This is primarily done to keep a check on the search direction

Ø Every candidate solution has to be within a particular euclidean 
distance, µ, from either one of the samples in training set for it 
to be evaluated using surrogate model

Start

Start the GA run using functional 

evaluations being done by using 

“detailed” models for “i” generations 

Partition the external archive, A, containing all the 

detailed model evaluations into “training data” 

and “test data”

“Refine” the decision variable search space on the basis of GA results 

obtained until now
Total number 

of generations = 

nmax

Stop

Ø It gives another opportunity to 

search in the sections that were 

discarded when last time surrogate 

models were built

Run the Genetic Algorithm using functional 

evaluations being done by using detailed 

models for 1 generation but with a “high” 

mutation rate in the entire decision variable 

search space as specified at the start

Train the surrogate models for objectives (m in number) and constraints (p in number) and test their 

generalisation abilities against test data. Let the trained surrogate models be Sj ; where, j=1,2.. (m+p)

Yes

No

For j = 1 to (m+p) do                                          / For every surrogate model
      If Sj is not valid Then                                          / Check wheter the surrogate model is valid or not
           For K = 2 to Kmax                                          / Kmax refers to the maximum number of partitions allowed
              βK  = 0          / βK refers to the number of valid surrogates for K partitions 
              Partition the input data in K clusters ( A1,... AK)
              For l = 1 to K do
                      Sj,l = train(Al)                  
              end for
              For l = 1 to K do
                      test(Sj,l)                                      
              end for
              For l = 1 to K do
                   If Sj,l  is valid Then
                       βK  = βK  + 1          
                   end If
              end for
              ØK = βK/K           / ØK refers to “valid partition ratio”
              If ØK = 1 Then           

    Exit
              end If
              end for
              Kfinal = K with maximum ØK           
      end If
end for

/ Testing the surrogate models for j
th
 independent variable   

and i
th
 partition

/ Increasing the count of valid surrogates by 1 for K 
partitions 

/ Training the surrogate models for j
th
 independent variable 

and i
th
 partition

 / In absence of a particular K for which all surrogates are 
valid, the one with the highest ØK is to be chosen with 
respective surrogate models for different partitions

/ If all the surrogates for K partitions come out to be valid, then 
further partitioning is not done. This is done in order to avoid 
extra computation effort required for large values of K. 

Run the GA using functional evaluations being done by using 

surrogate models for the next “s” generations 

Re-evaluate the intermediate results of GA using functional 

evaluations being done by using detailed models 

 
Figure 1: The proposed methodology for “surrogate assisted” MOO 

The most significant differences between MASDS and the proposed methodology is that whenever the 

surrogate models are retrained after the initial training, the decision variable search space is “truncated” 

according to the final results (DNA) obtained until that point. The idea of periodic truncation helps to 

discard “inferior” datasets in the external archive and prevent them from contributing to the surrogate 

models fitting. It is expected that the accuracy of the surrogate models thus fitted, would be better in the 

“promising” regions.  


352 

 
3. Test Problems 

3.1 Problem definition 
Table 1 lists the two numerical test problems against which the performance of the proposed methodology 

and MASDS are compared. The SCH problem represents a case where the initial search space is 

drastically different from the optimal decision variable space. To facilitate comparison between the two 

algorithms crossover probability (0.85), default mutation probability (0.01), random number generator and 

the seed value for random number generation were kept the same for the two runs. The resolution 

parameter, that is µ, has been kept as constant, that is, 5 % of the length of the solid diagonal of the 

decision variable space. However, it could also have been variable as the search space, for the algorithm 

proposed in this work, keeps on changing. The value of µ should largely depend upon the accuracy of the 

surrogate models being fitted. The surrogate models in this work are fitted by using SUMO toolbox 

developed by Gorissen et al. (2010). The model being fitted are feed-forward neural networks where 

MATLAB’s gads toolbox selects network parameters using a GA. 

Table 1: Numerical test problems used in this study 

Problem Objectives Initial search space Final Solution Parameters 

SCH problem  

(Convex Pareto) Minimise: 
2

1
x(x)f 

 
Minimise: 
2

2
2)(x(x)f 

 
 1000,1000x    0,2x 
 

1i   
3s   

13n
max


 

ZDT2 Problem 

(Concave Pareto) 
Minimise: 11 x(X)f   

Minimise: 
g(X))(X),h(f*g(X)(X)f

12


 
  2
11

gf1g),h(f 
,









 



10

2i

i
x1g(X)

 
 
01,2,3,...1i

,0,1x
i





 
 

102,3,4,....i

0,x

0,1x

i

1







 
1i   
7s   

56n
max


 

3.2 Results 
Figure 2(a) and 2(b) show the intermediate results obtained for SCH and ZDT2 test problems after 13 and 

56 generations. For both the test problems, it can be seen that the proposed framework gave better results 

in terms of both nearness to actual Pareto and the spread of points on the Pareto. However, for ZDT2 

problem, the proposed framework required a greater number of detailed model evaluations. The resolution 

parameter, that is µ, was kept constant as 5 % of the normalised solid diagonal length. Hence, the 

resolution (in absolute values) became smaller, as the search space became narrower. This meant that 

the minimum distance criterion for evaluation via surrogate models was not met in most of the cases. It is 

hence suggested that in such cases, µ could also be treated as variable, depending upon the accuracy of 

the fitted surrogate models.  

 
Figure 2: (a) SCH results after 13 generations; (b) ZDT2 results after 56 generations 

Since, the proposed framework showed promising results, it was then applied to “global” optimisation of 

coal to ammonia process.  


353 

 
Gasification 
High Temperature 

Shift 

Low Temperature 

Shift 

Water removal  

and Drying 

Stripper 

CO
2
 

Compression 

Solvent 

Flashing 

CO
2
 emission 

Source (1) 

Super-critical CO
2
 

@ 100bar 
Absorber 

Methanation 

CO
2
 emission 

Source (2) 

Compression Compression NH3 Synthesis Reactor 

NH
3 

Condensation 
Purge gas burning and heat recovery 

Drier 

4. Rectisol
TM

 Problem 

Figure 3 depicts the coal to ammonia process (with CO2 capture) under investigation.  The gasifier is an 

entrained flow gasifier and chilled methanol is being used to capture the CO2. The aim of this exercise is to 

optimise the “global” coal to ammonia process while using a computationally cheaper surrogate model for 

the CO2 capture process (namely, Rectisol
TM

). The detailed model for Rectisol
TM

 involves a number of 

recycle streams, thereby making the Aspen Plus
TM

 simulation, computationally expensive.  

 
Figure 3: Block diagram of the coal to ammonia process being optimized (the highlighted rectangular 

portion represents the surrogate model’s boundary) 

4.1 Problem definition 
Objectives: Minimise net electricity consumption (kW) and moles of CO2 emitted per unit moles of NH3 

produced (for a minimum NH3 production of 1,200 kmol/h) and a fixed coal feed rate.  

Decision variables (seven in total): Oxygen flow to gasifier (kg/h), HP steam to shift reactor (kg/h), Purge 

fraction for ammonia synthesis loop, along with the four parameters specific to Rectisol
TM

 unit- same as 

specified in (Sharma et al., 2014).  

Constraints: H2 and CO recovery across Rectisol
TM

 unit should both be greater than 99 % and CO2 

mole fraction in the treated gas from Rectisol
TM

 unit should be less than 10 PPM. 

4.2 Results 

Figure 4(a) shows the results obtained, for similar number of detailed model evaluations, with 

“business-as-usual approach” (BAU) (that is, where only detailed Aspen Plus
TM

 evaluations were done) 

and the surrogate-assisted approach, proposed in the present work. 

  
Figure 4: (a) Proposed framework and BAU approach (for similar number of detailed evaluations); 

(b) Proposed framework and BAU approach after 1,337 and 2,850 detailed evaluations 

Water HP Steam 

Slag 

Produced NH
3
  

Air 

N
2
 

O
2
 (99 % pure) 

Coal 

Water  


354 

 
The proposed framework’s performance is somewhat similar (but better in terms of spread of solutions) to 

the BAU. Pairwise comparison of the two Pareto datasets yielded that every point on the proposed 

framework plot was dominated by 0.50 points on the BAU plot. While every point on the BAU plot was 

dominated by 1.86 points on the proposed framework plot. Figure 4(b) shows the comparison of the results 

obtained from the proposed framework and those obtained from the BAU approach for a significantly 

higher number of detailed evaluations. The comparison depicts how close these results are to global 

optimum values, assumed to be depicted by the BAU plot in Figure 4(b). Such an analysis clearly shows 

the relative advantage offered by the proposed approach over the BAU approach, even for a conservative 

value of the resolution parameter (µ), kept as 5 % of the normalised solid diagonal length. 

5. Conclusions 

A surrogate-assisted MOO framework has been proposed in this work. The aim has been to improve upon 

a similar existing framework (MASDS). The proposed framework has been tested on two mathematical 

problems. This analysis has yielded some useful insights related to how to best apply this framework. The 

resolution parameter, µ, in particular could be varied depending upon the accuracy of the fitted surrogate 

models. This framework is then used for the global optimisation of coal to ammonia process, with CCS. 

The results of the optimisation run have been reported along with those obtained with the BAU approach. 

Each point on the non-dominated set generated with the proposed framework dominated, on an average, 

1.86 points in the set generated by BAU approach. While the same metric for BAU approach was just 0.50. 

The results show huge potential of using such a technique for MOO problems in chemical engineering 

which would otherwise be computationally prohibitive  The proposed framework, hence, could allow for 

better integration of computationally complex units into a large-scale plant simulation (i.e. global 

optimisation), rather than treat them as stand-alone units which could result in sub-optimal operation. 

Unlike other optimisation methods, where the solution is a set of optimised decision variables, this method 

provides both the optimised decision variables and an array of surrogate models accurate in different 

regions of the decision variable space, for future predictions. 

Acknowledgement 

The authors would like to thank Orica Ltd for funding the project through the IITB-Monash research 

academy. 

References 

Deb K., Pratap A., Agarwal S., Meyarivan T.A.M.T., 2002, A fast and elitist multiobjective genetic 

algorithm: NSGA-II, IEEE Transactions on Evolutionary Computation, 6(2), 182–197.  

Gorissen D., Couckuyt I., Demeester P., Dhaene T., 2010, A Surrogate Modeling and Adaptive Sampling 

Toolbox for Computer Design, Journal of Machine Learning Research, 11, 2051-2055. 

Isaacs A., Ray T., Smith W., 2009, Multi-objective design optimisation using multiple adaptive spatially 

distributed surrogates, Int. J. Product Development, 9 (1-2-3), 188-217. 

Jin Y., 2005, A comprehensive survey of fitness approximation in evolutionary computation, Soft 

Computing, 9(1), 3-12. 

Liu Y., Collette M., 2014, Improving surrogate-assisted variable fidelity multi-objective optimisation using a 

clustering algorithm, Applied Soft Computing, 24, 482-493. 

Ray T., Isaacs A., Smith W., 2009, Surrogate Assisted Evolutionary Algorithm for Multi-Objective 

Optimization, G.P. Rangaiah (Ed.), Multi-objective Optimization: Techniques and Applications in 

Chemical Engineering, Singapore: World Scientific, 131-151. 

Santos R.B., Rupp M., Bonzi S. J., Fileti A.M.F., 2013, Comparison  between Multilayer Feedforward  

Neural Networks and a Radial Basis Function Network to Detect and Locate Leaks in Pipelines 

Transporting Gases, Chemical Engineering Transactions, 32,  1375-1380. 

Sharma I., Hoadley A., Mahajani S.M., Ganesh A., 2014, Automated optimisation of multi stage 

refrigeration systems within a multi-objective optimisation framework, Chemical Engineering 

Transactions, 39, 25-30. 

Wilson B., Cappelleri D., Simpson T.W., Frecker M., 2001, Efficient Pareto frontier exploration using 

surrogate approximations, Optimization and Engineering, 2(1), 31-50. 

Zhu J., Wang Y.J., Collette M., 2014, A multi-objective variable-fidelity optimisation method for genetic 

algorithms, Engineering Optimisation, 46(4), 521-542.