CHEMICAL ENGINEERING TRANSACTIONS VOL. 52, 2016 A publication of The Italian Association of Chemical Engineering Online at www.aidic.it/cet Guest Editors: Petar Sabev Varbanov, Peng-Yen Liew, Jun-Yow Yong, Jiří Jaromír Klemeš, Hon Loong Lam Copyright © 2016, AIDIC Servizi S.r.l., ISBN 978-88-95608-42-6; ISSN 2283-9216 Forecasting Modeling for Energetic Efficiency in an Industrial Process Simone M. Acostaa, Angelo M. O. Sant’Annab, Osíris Canciglieri Juniorc* aDepartment of Electronics, Federal University of Technology Paraná (UTFPR), Av. Sete de Setembro, 3165, Curitiba, Brazil bMechanical Engineering Department, Federal University of Bahia, Rua Aristides Novis, 2, Salvador, Brazil. cIndustrial and Systems Engineering Graduate Program, Pontifical Catholic University of Parana, Rua Imaculada Conceição, 1155, Curitiba, Brazil. osiris.canciglieri@pucpr.br The quality of products and processes is a permanent challenge for industries, and such challenge is no different in steelmaking processes. One of the main problems affecting the quality of steel products is the existence of contaminants in alloy steel, and phosphorus (P) is a major contamination element interfering with the steelmaking process. The increased P levels can severely affect the physical integrity of steel bonds, thus threatening the quality of the final product. This paper proposes a robust approach to model the phosphorus concentration levels in the steelmaking process. The proposed approach consists in applying the artificial neural networks techniques for improving the energetic efficiency of the industrial process. We used the improved neural network models inspired in the human nervous system for processing the information. The different techniques used for modelling the phosphorus levels investigate the variables that have a significant influence on refining process. Based on the better predictive model, the increase of phosphorus levels in the final product is related to initial levels of carbon, oxygen, magnesium, manganese oxide and calcium oxide. The results illustrate the efficiency of the techniques used in the modelling, with emphasis on the adequacy of the predictive models constraints in the refining process. This study presents a relevant strategy to model characteristics’ of raw material based on forecasting strategy to the efficiency of alloys and steel industry. 1. Introduction Assuring the quality of processes and products remains a constant challenge for companies, and this is increasingly becoming a fundamental requirement for their endurance. This is no different in steelmaking companies. One of the main parameters affecting the excellence of products in this sector is the existence of contaminants in alloy steel. Phosphorus (P) is one of the main contaminants found to interfere in steelmaking processes. The ferromanganese alloys are the major sources of P contamination during the process of steelmaking, which demands a limited use of this kind of alloy along the process (Um et al., 2014). The increased P levels can significantly affect physical aspects of alloy steel, thus severely compromising its quality. P-rich steel compounds usually show: (i) increased hardness; (ii) decreased ductility; (iii) appearance of ghost lines in carbon-rich alloy steels; and (iv) increased frailty of steel bonds under high and low temperatures (Chaudhary et al., 2001). The refining process adopted by the company subject to this study uses high-purity oxygen to reduce carbon levels in High-Carbon Ferromanganese (FeMnHC) originating Medium-Carbon Ferromanganese (FeMnMC) which has higher market value. During this process, there is an alteration in levels of other elements, including phosphorous. Therefore, the present work proposes a model for the FeMnMC refining process in an important Brazilian metallurgic industry with the purpose of predicting P levels at the end of the refining process. The Artificial Neural Network (ANN) method, defined as data processing systems based on the behavior of the human nervous system, was the chosen modeling technique (De Castro, 2007). The main advantages of ANNs are: (i) the capacity of approximating the behavior of non-linear physical phenomena, not requiring DOI: 10.3303/CET1652181 Please cite this article as: Acosta S. M., Sant’Anna A. M. O., Canciglieri O. J., 2016, Forecasting modeling for energetic efficiency in an industrial process, Chemical Engineering Transactions, 52, 1081-1086 DOI:10.3303/CET1652181 1081 profound statistical comprehension of the modeled data and; (ii) the capacity of learning any input/output variables of the continuous form (Haykin, 2008). An ANN is usually called a black-box model, considering that this kind of technique approximates the relative input/output of a certain process. For the process studied here, input variables were FeMnHC composition, dross composition, alkalinity indexes and amounts of solid loads added during the process. The output variable was P concentration at the end of the process. Application of ANN to model different chemical and industrial processes is relevant and several authors have profited from the approximation of processes with complex behavior allowed by this technique in their work, such as Bakar et al. (2013) that performed a study to identify values of key parameters for prediction of heat exchanger of a bio-methanol production plant. Joo et al. (2013) propose to control the temperature of a polymerization reaction of styrene in suspension and to compare with classical controller PID systems. Osuolale and Zhang (2014) evaluated different ANN models for forecasting hourly load of an electricity market. Yong et al. (2015) minimized the heat exchanger for waste heat under varying feed conditions using neural network model, allowing lower time and cost in manufacturing, while Gao and You (2015) developed an approach to optimize the pattern of filter networks, offering practitioners a relevant trade-off between quality and cost. This paper proposes a robust approach to model the phosphorus concentration levels in the steelmaking process. The proposed approach consists in applying the artificial neural networks techniques for improving the industrial process energetic efficiency. 2. Backgroud Artificial Neural Networks are systems that manipulate information through the interaction of basic processing units called artificial neurons (Haykin, 2008). Artificial neurons, in turn, are basic units that receive and process external inputs generating outputs. Du and Swamy (2013) asserted that an artificial neuron comprises the following components: (i) a group of synaptic weights that can balance the importance of input signals; (ii) a threshold term or Bias that enables to increase or decrease the impact of different inputs on neuronal function; (iii) an adder, which can add up values of inputs balanced according to synaptic weights and (iv) an activation function that controls the output signal. The artificial neural are usually organized in layers which can be categorized into output and hidden layers. An ANN may have a variable number of hidden layers, but according to Kolmogorov’s theorem and to the second theorem of universal approximation only one hidden layer is enough to approximate any function of continuous nature (Scarselli and Tsoi, 1998). This allows a reduction in the complexity of the model, but occurrence an increased number of neurons are required. An ANN model with two layers may be mathematically described by the Eq(1), (1) where yt is the estimated network output, wkj and wkj are the synaptic weights of the output and hidden layers, bj represents the threshold terms, xt is the values of input variables and f() symbolizes the exchange between neurons in output and hidden layers. Even though there are several forms of transfer functions, the most commonly studied are the sigmoid logistic and the linear and hyperbolic tangent functions. The choice of the most suitable activation function is a crucial step and a linear function is typically used for the output layer in issues of function approximation, whereas logistic or tangent hyperbolic functions are employed in the hidden layers. Before the learning process starts, the data are standardized according to the activation function in the hidden layer. For logistic function, the data are standardized in the interval [0,1] whereas the interval [-1,1] is used for the hyperbolic tangent function. This standardization process is ruled by the Eq(2), (2) where ts is the standardized variable, tmin and tmax are the minimum and maximum values to ts, respectively, xt is the original variable, xmin and xmax are the minimum and maximum values of xt. These functions are chosen due to easy the derivation and simplification of the network learning process, at the synaptic weight regulation (Engelbracht, 2006). The learning process consists of two basic steps: (i) the prediction stage, where synaptic weights are adjusted based on a group of training data, and (ii) the test stage, that determines whether the prediction model is able to generalize. The synaptic weights are set through an optimization algorithm called learning algorithm, which for multiple layer ANNs is usually the error Backpropagation algorithm. It consists on the minimization of the mean quadratic error (Eq(4)) of output variables generated by the ANN relative to real measured values. There are several variations of the error Backpropagation algorithm, being the Levenberg-Marquadt algorithm amongst the most efficient. This algorithm, however, tends to cause overfitting, where the model specializes in 1082 the training data. For that reason, a Bayesian regularization term can be employed in the ANN cost function. The ANN cost function was then modified as Eq(3), (3) where et = (yi – yt) is the estimated error, yi is the actual value measured at the system, yt is the output estimated by the ANN, n is the number of samples,  and  are terms estimated by the learning algorithm with a cost function, wkj and wji are the synaptic weights of output and hidden layers, respectively. To evaluate the generalization capacity of an ANN error minimization strategies are used, such as the root mean square error (RMSE) and the mean absolute percentage error (MAPE), theses can be determined according to Eq(4) and 5, (4) (5) An important way for evaluation of an ANN model is to estimate the number of parameters or synaptic weights (W) and the amount of data used for model estimation (N), where corrected Akaike Information Criteria (AICc) is suitable (Sant’Anna, 2015), see Eq(6), and the number of synaptic weight in an ANN with two layers can be defined by the Eq(6). (6) where W = [(kj) + j] + [(ji) + i], k represents the number of input variables, j is the number of neurons in the hidden layer and i the number of neurons in the output layer. (Vining and Kuwalski, 2011) assert that, an ANN model must have N > W based on the Occam theorem that represents a system employing a minimum number of parameters and synaptic weights. 3. Case Study Several kinds of steel are affected by high levels of phosphorus or manganese, which can compromise their quality. The refining process of ferromanganese consists of carbothermic reaction that involves the reduction and control of the percentage of carbon in the liquid metal by injection of oxygen in the bath. The chamber for the refining process of ferromanganese on the oxygen converter has an internal layer covered of magnesium oxide (MgO), and two pipes (N2-cooled) and vertical stream (water-cooled) blow oxygen. Inside the converter, the FeMnHC alloy has an average carbon level of 6.5 %, which increases the efficacy of the oxidant dephosphorylation. The Lime is also used during the dephosphorylation step, causing significant loss of manganese with no substantial effect on P levels. When lime is dissolved in the molten bath it increases the alkalinity of dross, which prevents the loss of manganese into the dross. Before the oxygen blow begins, approximately 60 kg of CaO per ton of FeMnHC dissolved in the bath are added, in a process divided into 3 stages. On the first stage, dephosphorylation and decarburization start to occur and significant amounts of manganese are lost to the dross through volatilization, because of the excess of oxygen. During the second stage, called decarburization, bath temperature rises, reducing manganese oxidation and increasing the rate of decarburization as well as Mn volatilization. Rephosphorylation occurs as a consequence of such temperature variations because P2O5 is highly unstable at high temperatures. As the temperature continues to rise, rephosphorylation can only be halted if the activity of phosphor pentoxide is reduced, which can be achieved by dissolving P2O5 into the dross in the presence of Lime. At the third stage, there is the addition of a solid deoxidizing load containing 110 kg of HCFeMn or MCFeMn, 150 kg of FeSiMn and 40 kg of CaO per ton of liquid HCFeMn in the bath, which helps to cool the mixture and contributes to reducing manganese volatilization. At this point, a blow of inert CO2 gas at 200 Nm3/h begins with the only purpose of stirring the bath. 3.1 Experimental design The 21 initial input variables defined as relevant for modeling the dephosphorization process were grouped as: (i) composition of metal alloys (x1, …, x6); (ii) composition of slag (x7, …, x15); (iii) composition of solid loads 1083 (x16, x17, x18); and (iv) levels of alkalinity: binary; quaternary e optical basicity (x19, x20, x21), see Table 1. The data selected covered a sample of 248 observations. Table 1: Description of characteristics of sustainable products Group Specifications Und Alloys composition Si*, P*, Fe*, Ti*, C*, O2 % Slag composition MgO, MnO, CaO, SiO2, Al2O3, BaO, K2O, TiO2, FeO wt-% Loads composition Initial, Liquid, Dross Nm3 Alkalinity levels BB, BQ, BO Nm3 Considering the large number of analyzed variables, a correlation analysis was needed in order to select those which are actually linked to the P concentration levels at the final process. The Pearson correlation test is a statistical tool which indicates the correlation between two variables (to value p<0.01). After the correlation test was applied it became clear that from the 21 initial variables only 4 (four) variables were significantly correlated with P level output variables at the final refining process, as the initial phosphorus level (P*), the initial carbon level (C*), the manganese oxide level (MnO) and the loads composition (liquid). The other variables present in the dephosphorization process haven’t high influence to predict P concentration level at the final process. Table 2 presents the statistical analysis of outcome elements and P concentration level (output variable) and the Figure 1 illustrates the Phosphorus level data at the final refining process. Figure 1: Phosphorus concentration level at the final process Table 2: Summary of the variables selected Group Variable Mean Std. dev. Max Min Correlation Alloys P* 0.216 0.025 0.280 0.155 0.687 Alloys C* 6.759 0.140 7.070 6.077 -0.291 Slag MnO 39.056 5.848 51.993 24.202 0.289 Loads Liquid 9.889 1.543 12.700 5.510 -0.295 Output P 0.260 0.026 0.331 0.204 1.000 The design of the proposed approach was performed using MATLAB©. The chosen output variable to the neural network was the P concentration level at the final Ferromanganese refining process. The relevant input variables selected were: initial phosphorus level (P*), initial carbon level (C*), manganese oxide level (MnO) and loads composition (Liquid). After the pre-processing phase, it was randomly chosen 198 (80 %) observations of the 248 total observations performed for composing the Training group and 50 (20 %) were chosen for the Test group. It was used one hidden layer to projects in the ANN architecture, the Kolmogorov theorem, and the Levenberg-Marquadt algorithm. The Kolmogorov theorem found search-space for our study is 3.0 neurons which are in agreement with the established restriction: N>W and T=198 observations. The modeling process consisted of estimating ANN models with logistic and hyperbolic activation function in the hidden layer to assess which was the most efficient model, using 4 (four) input variables, one output variables, crossover rate 0.8, mutation rate 1.6, 1084 learning rate 0.05, momentum rate 0.1 and population 198 obs. The appendices show the realized performance study for logistic and hyperbolic activation function (see Table 3). The ANN model which better represented the process studied is with 11 neurons in the hidden layer with the logistic activation function, considering the error minimizing statistics and the complexity analysis of the model (see Tables 3). For 11 neurons in the hidden layer, the number of adjustable parameters is W = 67, keeping compliance to N > W. The Table 3 illustrates that as the number of neurons increases towards the top limit (jmax = 14) and the estimates of root mean square error (RMSE) and mean absolute percentage error (MAPE), highlights that the power of estimation from model decreases and its complexity increases. The residuals errors generated by prediction model show a random distribution varying between - 0.04 and 0.05, without seasonal variation (see Figure 2). In general, the fitted ANN model showed a mean percentage error of 0.09 %, and it is possible to consider this ANN model as trustworthy for P concentration level at the final refining process. Table 3: Performance of ANN with logistic activation function Nodes 10 11 12 13 14 RMSE 0.0159 0.0159 0.0159 0.0159 0.0159 MAPE 0.000958 0.000949 0.000956 0.000955 0.000949 AICc 53.25 59.17 65.24 71.28 77.21 Figure 2: Residuals errors from the better ANN model 4. Conclusions The process of extraction of metals, such as zinc, aluminum, iron and manganese for the production of alloys is very important to the industry, detaching the interest for reducing costs, environmental impacts and improvement performance. We propose to determine a prediction model for Ferromanganese refining process in a steelmaker industry that allows estimating the phosphorus concentration levels. The Ferromanganese refining process performed in a steelmaker industry to allow the prediction of P concentration levels. The development of the ANN model with the variables from Ferromanganese refining process was used as input and selected from correlation test. All the variables were tested with a correlation test to verify how each of them is related to the percentage of phosphorus in the final product. These results show that four variables were correlated and those were used to compose the input set of the ANN model. The architecture of neural network was composed by the Kolmogorov theorem, the Levenberg-Marquardt algorithm, and diagnostic criteria to improve the parameters estimates, increase the quality of prediction and decrease the statistical errors. The samples were divided into groups: one was used to the training of the ANN model and a second was used to test the ANN model, those groups were composed of 198 and 50 samples, respectively. The great number of neurons in the hidden layers made the network structure very heavy in a computational way which demands a little more time to achieve the results, and the diagnostic analysis suggests 11 neurons, hyperbolic function to better performance of ANN model. The estimation model showed excellent results with an average percentage error of 0.09 %, supporting the generalization capacity and reliability of the ANN model. Based on this created estimation model it is possible to estimate the impact of certain P concentration levels in FeMnMC beforehand, with a considerable amount of reliability. 1085 The present study has established that dephosphorization process of ferromanganese with alloys composition correlated variables: initial phosphorus (< 0.28 %), initial carbon (< 7.070 %) can be carried out to significant influence to estimate P concentration level. As soon, the manganese oxide level (24.20 – 51.993 wt.%) and loads liquid (5.51 – 12.7 Nm3) for slag and loads composition for dephosphorisation of high-carbon ferromanganese. To future research is possible to investigate non-linear or chaotic behavior between input and output variables, as an interesting point to be studied to make forecasting models. Acknowledgments The authors are grateful to the National Council of Scientific and Technological Development (CNPq) under grants 486707/2013-0 (Universal) and scholarship for the financial support of this work. Reference Bakar S.H.A., Hamid M.K.A., Wan Alwi S.R., Manan Z.A., 2013, Flexible and Operable Heat Exchanger Networks, Chemical Engineering Transactions, 32, 1297–1302. Chaudhary P.N., Goel R.P., Roy G.G., 2001, Dephosphorisation of high carbon ferromanganese using BaCO3 based fluxes, Ironmaking and Steelmaking, 28(5), 396–403. De Castro L.N., 2007, Fundamentals of natural computing: an overview, Physics of Life Reviews, 4, 1–36. Du K-L, Swamy M.N.S., 2013, Neural networks and statistical learning. Heidelberg: Springer. Engelbrecht A.P., 2006, Computational Intelligence: An Introduction. 2nd ed, West Sussex: John Wiley & Sons Ltd. Haykin S., 2008, Neural networks and learning machine. 3rd ed, New York: Prentice Hall. Joo S-W., Hong S-H., Lee G-H, You B-D., 2013, Evaporation behavior of low carbon ferromanganese alloy melt at reduced pressure, Metal and Materials International, 19(3), 585–590. Gao J., You F., 2015, Milfp Model and Algorithms for Network Design and Long-term Planning of Water Management System for Shale Gas Production, Chemical Engineering Transactions, 43, 1423–1428. Osuolale F.N., Zhang J., 2014, Energy Efficient Control and Optimisation of Distillation Column Using Artificial Neural Network, Chemical Engineering Transactions, 39, 37–42. Sant’Anna A.M.O., 2015, Framework of decision in data modeling for quality improvement, The TQM Journal, 27(1), 135–149. Scarselli F., Tsoi A.C., 1998, Universal approximation using feed-forward neural networks: a survey of some existing methods and some new results, Neural Networks, 11(1), 15–37. Um H., Lee K., Kim K-Y., Shin G., Chung Y., 2014, Effect of carbon content of ferromanganese alloy on corrosion behaviour of MgO–C refractory, Ironmaking and Steelmaking, 41(1), 31–27. Vining G., Kuwalski S., 2011, Statistical methods for Engineers, 3rd ed, Boston: Cengage Learning. Yong J.Y., Nemet A., Varbanov P.S., Klemes J.J., Cucek L., Kravanja Z., Mantelli V., 2015, Heat Exchanger Network Modification for Waste Heat Utilisation Under Varying Feed Conditions, Chemical Engineering Transactions, 43, 1279–1284. 1086