Format And Type Fonts CCHHEEMMIICCAALL EENNGGIINNEEEERRIINNGG TTRRAANNSSAACCTTIIOONNSS VOL. 45, 2015 A publication of The Italian Association of Chemical Engineering www.aidic.it/cet Guest Editors: Petar Sabev Varbanov, Jiří Jaromír Klemeš, Sharifah Rafidah Wan Alwi, Jun Yow Yong, Xia Liu Copyright © 2015, AIDIC Servizi S.r.l., ISBN 978-88-95608-36-5; ISSN 2283-9216 DOI: 10.3303/CET1545208 Please cite this article as: Jacobs I.D., Chirwa E.M.N., 2015, Estimation of reaction parameters for phenol biodegradation using trainable artificial neural networks, Chemical Engineering Transactions, 45, 1243-1248 DOI:10.3303/CET1545208 1243 Estimation of Reaction Parameters for Phenol Biodegradation Using Trainable Artificial Neural Networks Irene D. Jacobs, Evans M.N. Chirwa* Department of Chemical Engineering, University of Pretoria, Pretoria 0002, South Africa emnchirwa@gmail.com Biological systems typically respond non-linearly to the external stimuli such as food availability or toxic exposure. Analytical models based on empirical and semi-empirical representations only simulate a narrow range of conditions. Simulation of the developed kinetic laws on wider scale normally fails. A black box approach is normally applied to simulate responses outside of the studied range. One such system that could be used without worrying about the internal mechanisms is the Trainable Artificial Neural Network (TANN). This system offers the capability to predict the next steps in the behaviour of the system using data from the past. In this study, a multi-layer feed forward neural network was capable of simulating phenol biodegradation without a kinetic model. The network consisted of an input layer with three neurons, a single hidden layer with ten neurons and an output layer with one neuron. Network testing achieved a Mean Squared Error (MSE) of 0.001 with the regression coefficient (R 2 ) of 0.984. The predicted trend was validated by substrate inhibited Monod-type kinetic using actual experimental data from a batch system using a laboratory enrichment of an environmental sample containing Pseudomonas aeruginosa. Both the model and the TANN was further tested against literature data from Chirwa and Wang (2000), where the researchers achieved simultaneous Cr(VI) reduction and phenol degradation using an anaerobic consortium of bacteria containing Escherichia coli ATCC 33456. In all cases tested, using the TANN algorithm ahead of the kinetic model generated a dataset that reduced convergence time during parameter search in the highly non-linear reaction kinetics. 1. Introduction Phenol is a commonly found pollutant in industrial waste effluents from factories of iron-steel, coke, petroleum, pesticide, paint solvent, pharmaceutical, wood processing chemicals, pulp and paper (Kavuri, 2011). The Environmental Protection Agency (EPA) has proposed a water purification standard for phenol in surface water due to the documented effect of phenol on aquatic life. Aromatic compounds and halogenated aromatics are water-soluble and highly mobile, resulting in resistance to degradation (Collins and Daugulis, 1997). For more complex phenolic compounds, analytical solutions for the complex kinetic models arising from complex metabolic pathways are usually impossible to solve. The degradation of the compounds is typically highly inhibited and sometimes interlinked with electron transport in the conversion of inorganic substances such as metals (Chirwa and Smit, 2010). An example of a more simplified version is presented by Chirwa and Wang (2000, 2001) and later articles by the same researchers. I the previous studies, a Monod-type kinetic for phenol degradation was suggested in which phenol degradation transitioned from first-order kinetics at low concentration to zero-order kinetics at high concentrations with substrate level inhibited Monod kinetics at very high phenol concentrations as shown in Eq(1) below: X KSSK Sk dt dS Is ms 2 (1) where kms = maximum substrate (phenol) utilisation rate coefficient (h -1 ), Ks = half velocity concentration (mg.L -1 ) and KI = inhibition coefficient (mg.L -1 ). Simulation of the developed kinetic laws on a wider scale, normally fails when the system is operated under conditions outside the range in which the model was 1244 tested and optimised. Neural network and fuzzy logic systems came into being in an effort to answer the limitation of the mechanistic simulations (Hajmeer and Basheer, 2002). 1.1 Application of Trainable Artificial Neural Networks (TANN) TANNs are nonlinear computational systems used in approximating the behaviour of systems associated with explicit modelling difficulties without assuming the type of relationship and degree of nonlinearity between various independent and dependent variables. TANNs have a wide range of application and the ability to handle problems in noisy and highly complex nonlinear data (Balan et al., 1999). Conventional kinetic modelling entails certain assumptions about kinetic equations and requires experimental estimation of kinetic constants. The assumptions made usually increase calculation errors and model feasibility is limited to reaction conditions in which the model and associated constants were determined. Parallel, real time processing is achieved with TANNs whereas traditional modelling is sequential, logical and restricted by rules/algorithms. TANNs are known as universal function approximators, capable of adaptive learning and self-organization (Anjum et al., 1997). Their current limitation is that they are most interpolative in nature, thus they cannot be relayed upon to estimate points outside the current training dataset. Figure 1: Schematics of a multi-layer artificial neural network. Artificial neurons accept different signals from neighbouring neurons and process them in a predefined simple way. The neuron either fires an output signal or not, depending on the outcome of this processing. The weights in the artificial neuron decide what proportion of the incoming signal is transmitted into the neuron body. A negative weight reflects an inhibitory connection, while positive values designate excitatory connections. The output signal can be either zero or one, or have any real value between zero and one, for binary or real valued artificial neurons. The input signals are normalised between zero and one, and are regarded as non-active layers of neurons serving only to distribute the signals to the first layer of active neurons. The function which calculates the output from the input vector is composed of two parts. The first part evaluates the net input and the first function is a linear combination of the input variables multiplied with the weight coefficients. The second part transfers the net input in a non-linear manner to the output value. The net input is calculated using Eq(2) and the output value is determined using Eq(3). 1 1 m i ijij xwNet (2) jNeti e y 1 1 (3) where Netj = net input, wji = weight coefficient, xi = input variable, and yi = output variable. The subscripts and superscripts m, i, and j are iteration numbers. Bias Bias Input layer Output layer 1245 2. Error Back-Propagation Training Artificial neurons try to mimic the adaption of synapse strength by iterative adaption of weights in neurons, according to the difference between the actual obtained outputs and the targets. Neurons in error back- propagation learning try to yield quantitatively an answer as close as possible to the target, with the order of weight correction enabling accurate determination of the error in each output node. Weight corrections are done according to Eq(4), after each input-target pair has produced an output vector. previousi ji i i i j i ji wyw ΔΔ 1 (4) The assumption is made that the errors have been evenly distributed when the signals were passing from the last hidden layer to the outer layer. The δj l term, calculated using Eq(5) represents the error that occurs on a specific hidden layer. The output layer error, δj last , is calculated according to Eq(6). i j i j k k i j i j i j yyw 1 1 11 for l = 1,..., last – 1 (5) last j last j last jj last j yyyt 1 (6) where k = number of output units, tj = target value, η = learning rate, μ = momentum, and δj = error term. Learning by error back-propagation is carried out in epochs, where one epoch is a period in which all input-target pairs are presented once to the network. After each epoch, the MSE is reported according to Eq(7) (Engelbrecht, 2007). kp yt ErrorSquareMean T p p k k kpk T p 1 1 2 (7) in which p = individual patterns, pT = total number of patterns. The accuracy of the predictions of the neural network is quantified by the MSE difference between the measured and the predicted phenol biodegradation (Arce-Medina and Paz-Paredes, 2009). The MSE is used as the objective function and supervised training aims to reach the smallest possible MSE in the shortest possible time. 3. Artificial Neural Network Setup A multi-layer feed-forward neural network with error back-propagation was used to develop the model. The network was designed with the Neural Network Toolbox, MATLAB (Beale et al., 2013). Published data from earlier studies (Chirwa and Wang, 2000) and later (Chirwa and Wang, 2001) was used in the evaluation of degradation parameters. The inputs of the neural network were time, chromium (VI) (Cr(VI)) concentration and initial phenol loading concentration, and the network output was the percentile of phenol biodegraded. The network’s predicted phenol biodegradation was compared to the actual phenol biodegradation achieved during the laboratory study. It is important to note that the neural network feasibility greatly depended on the range of data used during the design process. The test set data was utilized to evaluate the prediction accuracy of the network using data outside the range covered in the training and control data sets. A log-sigmoid transfer function was used in the hidden layer of the multi-layer network and a linear transfer function was used as function approximator in the final layer of the network. During the learning procedure the experimental data was divided into three sets namely a training set, a control set and a final test set. The training set contained 70 % of the data and the control and final test sets each contained 15 % of the data. In this study, training was performed to minimize the MSE by adjusting the weights and biases. The control set was used to find the best neural network configuration and training parameters. 4. Results and Discussion The neural network was design as a multi-layer feed forward neural network, with an input layer, a single hidden layer and an output layer (Figure 2). Good correlation was achieved during training, shown in Figure 3, with a minimized MSE of 0.001. The neural network design was optimized using the control data set and the optimum network configuration is depicted in Figure 3. The input layer consisted of three neurons, with ten neurons in the hidden layer and one neuron in the output layer. 1246 The control data set achieved high correlation, shown in Figure 4, for the optimized network configuration. The initial phenol loading concentrations utilized during testing exceeded to maximum training and control set data range by 20 %. The correlation between the estimated and experimental biodegradation for the test set data is shown in Figure 5. The neural network exhibited high correlation between experimental and estimated biodegradation for input values exceeding the initial data range. Excellent prediction accuracy with a high regression coefficient of determination (R 2 = 0.984) was obtained, demonstrating the reliability, accuracy and feasibility of the network for future phenol biodegradation estimations. The optimum parameters determined in this study were in agreement within the acceptable range of biological parameters for phenol degradation in the presence of Cr(VI) with less than 5 % error difference in the maximum phenol degradation rate coefficient, kms = 0.033 L.h -1 and Ks = 833 mg.L -1 this study versus kms 0.035 L.h -1 and Ks = 940 mg.L -1 from Nkhalambayausi-Chirwa and Wang (2005). Figure 2: Neural network design architecture. Figure 3: Correlation between estimated and experimental biodegradation for the training set data. Fraction phenol biodegraded Initial phenol concentration Initial Cr(VI) concentration Time P re d ic te d p h e n o l b io d e g ra d a ti o n Experimental phenol biodegradation 0.997 1247 Figure 4: Correlation between estimated and experimental biodegradation for the control set data. Figure 5: Correlation between estimated and experimental biodegradation for the test set data. P re d ic te d p h e n o l b io d e g ra d a ti o n Experimental phenol biodegradation P re d ic te d p h e n o l b io d e g ra d a ti o n Experimental phenol biodegradation 0.997 0.984 1248 5. Conclusions A multi-layer feed forward neural network consisting of an input layer with three neurons, a single hidden layer with ten neurons and an output layer with one neuron, provided good correlations between experimental and estimated biodegradation for the training and control data sets used. The model accuracy extended towards data up to 20 % outside of the training range while maintaining a MSE of 0.001. The regression coefficient of determination (R 2 = 0,984) obtained demonstrated the reliability, accuracy and feasibility of the network for future phenol biodegradation predictions. The neural network only considers the effect of initial phenol loading and inhibitor concentrations on phenol biodegradation, but previous studies have shown that temperature, pH, aeration and agitation also influence phenol biodegradation. It is recommended that the neural network be expanded to incorporate these parameters, allowing computation of optimal control inputs. Acknowledgements The research was funded through the National Research Foundation (NRF) of South Africa through the Focus Areas Programme Grant No. FA2006031900007 and the Incentive Funding for Rated Researchers Grant No. IFR2010042900080 awarded to Evans M.N. Chirwa of the University of Pretoria. References Anjum M.F., Tasadduq I., Al-Sultan K., 1997, Response surface methodology: A neural network approach, Eur. J. Oper. Res., 101, 65-73. Arce-Medina E., Paz-Paredes J.I., 2009, “Artificial neural network modeling techniques applied to the hydrodesulfurization process, Math. Comput. Model., 49, 207-214. Balan M.S., Annadurai G., Sheja R.Y., Srinivasamoorthy V.R., Murugesan T., 1999, Modeling of phenol degradation system using artificial neural networks, Bioprocess Eng., 21, 129-134. Beale M.H., Demuth H.B., Hagan M.T., 2013, Matlab Neural Network Toolbox User’s Guide. MathWorks, Natick, Massachusetts, USA. Collins L.D., Daugulis A.J., 1997, Biodegradation of phenol at high initial concentrations in two-phase partitioning batch and fed-batch bioreactors”, Biotechnol. Bioeng., 55, 155-162. Chirwa E.M.N., Wang Y.-T., 2000, Simultaneous Cr(VI) reduction and phenol degradation in an anaerobic consortium of bacteria, Water Res., 34, 2376-2384. Chirwa E.M.N., Wang Y.-T., 2001, Simultaneous chromium(VI) reduction and phenol degradation in a fixed-film coculture bioreactor: reactor performance, Water Res., 35, 1921-1932. Chirwa E.M.N., Smit E.H.R., 2010, Simultaneous Cr(VI) reduction and phenol degradation in a trickle bed bioreactor: shock loading response, Chemical Engineering Transactions, 20, 55-60, DOI: 10.3303/CET1020010. Engelbrecht A.P., 2007, Computational Intelligence: An Introduction. John Wiley & Sons, Ltd., London, UK. Hajmeer M., Basheer I., 2002, A probabilistic neural network approach for modeling and classification of bacterial growth/no-growth data, J. Microbiol. Meth., 51, 217-226. Kavuri N.C., 2011, Data-Based Modeling: Application in Process Identification, Monitoring and Fault Tetection. M.Tech (Research) Degree, Department of Chemical Engineering, National Institute of Technology, Rourkela, India. Nkhalambayausi-Chirwa E.M., Wang Y.T. 2005, Modeling Cr(VI) reduction and phenol degradation in a coculture biofilm reactor, J. Environ. Eng., 131, 1495-1506. Santos R.R.C., Santos B.F., Fileti A.M.F., da Silva F.V., Zemp R.J., 2013, Application of artificial neural networks in an experimental batch reactor of styrene polymerization for predictive model development, Chemical Engineering Transactions, 32, 1399-1404.