JOURNAL OF THEORETICAL AND APPLIED MECHANICS 42, 3, pp. 503-517, Warsaw 2004 FUZZY REACTIVE CONTROL OF WHEELED MOBILE ROBOT Zenon Hendzel Department of Applied Mechanics and Robotics, Rzeszów University of Technology e-mail: zenhen@prz.rzeszow.pl Thispaperproposesa sensorbasednavigationmethodwitha fuzzy combiner for navigationof amobile robot in uncertain environments.Wediscuss a fuz- zy approach to path design and control of simple individual behaviours of a wheeled mobile robot in an unknown 2D environment with static obstacles. A strategy of reactive navigation is developed including three main beha- viours: reaching the middle of a collision-free space behaviour, goal-seeking behaviour andwall-following behaviour. These simple individual behaviours are achieved by means of fuzzy inference systems. It is assumed that each low-level behaviour has been well developed at the design stage and then fused by the fuzzy combiner of behaviours to determine proper actions ac- ting on the environment at the running stage. The fuzzy combiner can fuse low-level behaviours so that the mobile robot can go for the goal position without colliding with obstacles. The fuzzy combiner is a soft switch that chooses more than one low-level action to be active with different degrees through fuzzy combination at each time step. The output of the navigation level is fed into a fuzzy tracking controller that takes into account the dy- namics of the mobile robot. A computer simulation have been conducted to illustrate the performance of the proposed fuzzy combiner of behavio- urs by a series of experiments on an emulator of the wheeled mobile robot Pioneer-2DX. Key words: behaviour control, fuzzy logic, wheeledmobile robots 1. Introduction The development of techniques for autonomous navigation constitutes one of the major trends in the current research on robotics. Many researchers have pointed out the interest of fuzzy logic for mobile robot control. In ge- neral, fuzzy logic has been used to consider uncertainties due to limitations 504 Z.Hendzel in a perception system, unstructured environment, and disturbances in robot dynamics. The most classical type of behaviour used in mobile robots is the trajectory tracking. The trajectory tracking control implies an a priori tra- jectory design to be given for the entire robot operation. The kinematics and dynamics of mobile robots is complex and non-linear, and is hard to model in general. These characteristics led the authors of this paper to use different control approaches: robust control, adaptive control, neural networks and fuz- zy control for the trajectory tracking. All of them have been tested on the real mobile robot Pioneer 2Dx with good results (Giergiel et al., 2000, 2002; Hendzel and Żylski, 1997). Expansion of the range of robot tasks and robot autonomy created a need to generate trajectories on-line. For example, robotsneed to adjust trajectories on-line to avoid collisions with obstacles in the workspace while approaching a given goal point. There are a lot of studies on the trajectory generation for robots using various approaches e.g. Arkin (1998), Berenstain and Koren (1989), Verbruggen et al. (1999), Zalzala andMorris (1996). The artificial po- tential field method is a popular tool for on-line trajectory generation with inherent collision avoidance (Arkin, 1998; Berenstain andKoren, 1989; Żylski, 2002). A comprehensive overview of the reactive navigation, robot behaviour and behaviour-based control field can be found in the books by Arkin (1998), Driankov andSaffiotti (2001),Verbruggen et al. (1999). Several neural network models, e.g. Berenstain andKoren (1989), Giergiel et al. (2000), Hendzel and Żylski (1997), Hendzel (2003), Zalzala and Morris (1996), were proposed to generate real-time trajectories. Although many solutions have already been reported in the literature, the continuing development of new proposals sug- gests that this field has not settled down yet. Several researchers have already argued the importance of looking at amobile robot as a set of elementary be- haviours (Arkin, 1998; Driankov and Saffiotti, 2001; Hendzel, 2003; Latombe, 1991). Elementary behaviours are important components of reactive control in which themobile robotmust continuously interactwith its environment.Reac- tive control means that all decisions are based on currently perceived sensory information (Berenstain and Koren, 1989; Benreguieg et al., 1998; Giergiel et al., 2000; Hendzel, 2003). In this paper, a fuzzy inference system approach is proposed for the collision-free trajectory generation in an unknown environment. A strategy of reactive navigation is developed including three main behaviours: reaching themiddle of a collision-free space behaviour, goal-seeking behaviour andwall- following behaviour. The fundamental idea of the behavioural control is to view mobile robot missions as a simultaneous and temporal execution of a Fuzzy reactive control of wheeled mobile robot 505 set of elementary behaviours. Numerous behaviour co-ordination mechanisms have been proposed. For a detailed overview, discussion and comparison of be- haviour co-ordination mechanisms see the following papers by Driankov and Saffiotti (2001), Lin and Chung (1999), Zalzala and Morris (1996). The be- haviour co-ordination mechanisms can be divided into two main classes: ar- bitration and command fusion (Driankov and Saffiotti, 2001). The command fusionmechanisms provide for a co-ordination scheme that allows all behavio- urs to simultaneously contribute to the control of the system in a co-operative manner. The command fusion approach with a fuzzy mechanism, which allows for weighted combination of behaviours (Driankov and Saffiotti, 2001; Lin and Chung, 1999), is used in this work to solve the task of reactive navigation of amobile robot in uncertain environments. The proposed navigator consists of two main behaviours: reaching the middle of a collision-free space behaviour and goal-seeking behaviour. It is assumed that each low-level behaviour has been well set forth at the design stage and then fused by the fuzzy combiner of behaviours to determine proper actions acting on the environment at the running stage. The fuzzy combiner can fuse low-level behaviours so that the mobile robot can go for the goal positionwithout collidingwith obstacles. The output of the navigation level is fed into a fuzzy tracking controller that takes into account the dynamics of the mobile robot. The structure of the fuzzy controller for a nonholonomic mobile robot is derived using the filtered error approach (Giergiel et al., 2002). This work is an extension of former work (Hendzel, 2004). 2. Fuzzy logic for behaviour design Themechanical structure of the mobile robot, like Pioneer-2DX, is shown in Fig.1. The presented robot has two degrees of freedom. Its posture is defined as [xA,yA,β]>, where (xA,yA) is the position of the point A, and β is the heading angle of the robotwith respect to the absolute coordinates (x,y). The kinematics of the mobile robot is defined by    ẋA ẏA β̇    =    VAmcosβ 0 VAm sinβ 0 0 ωm    [ uv uβ ] (2.1) 506 Z.Hendzel Fig. 1. Mobile robot where themaximum linear speed is denoted by VAm, the angular one by ωm, uv is themultiplying coefficient of themaximum linear velocity of the point A and uβ is the multiplying coefficient of the maximum angular velocity of the frame. 2.1. An overview of the navigator The describedmobile robot is equipped with eight ultrasonic sensor rings as depicted in Fig.1. The position of the sensor si is Li. The sensors are divided into three groups. One group is composed of three, two and three neighbouring sensors. This gives a distance to the obstacle dLi, dFi, dRi in its field of view, respectively, where dmin ¬ d(·) ¬ dmax. Each sensor covers an angular view which is oriented by angles αLi, αFi, αRi, respectively. To solve the trajectory tracking problem for a non-holonomicmobile robot with regard to the vehicle dynamics (Giergiel et al., 2000, 2002, Hendzel and Żylski, 1997; Hendzel, 2003), it is assumed that the current configuration of the mobile robot xd = [xA,yA,β] > (desired kinematics) is generated at each time step by the fuzzy navigator which generates the vector of multiplying coefficients uB = [uv,uβ] > based on the environment information d(·). In this work, three navigation tasks are discussed: reaching the middle of a collision- free space behaviour, goal-seeking behaviour and wall-following behaviour. A diagram of the navigator and controller architecture is shown in Fig.2. 2.2. Reaching the middle of a collision-free space behaviour Let the input variables of the fuzzy navigator are respectively the nor- malized distances measured on the right dnR = dR/(dR + dL), on the left Fuzzy reactive control of wheeled mobile robot 507 Fig. 2. Navigator and controller Fig. 3. Angular (a) and linear (b) velocity coefficient rules, respectivly dnL = dL/(dR + dL) and in front of the robot d n F = dF/η. In the above the following denote: dL =min(s2,s3), dF =min(s4,s5), dR =min(s6,s7) and η is the distance beyondwhich the obstacle is not taken into account. The used navigator is build with fuzzy inference systems based on a set of rules such as: if dnR is Ai and d n L is Bi then uβ is Ci, and if d n F is Di then uv is Ei, where Ai, Bi, Ci, Di, Ei are linguistic labels of the inputs dnR, d n L, d n F and the outputs uβ, uv. The shape of the membership functions is triangular and the 508 Z.Hendzel universes of discourse are normalized between −1 and 1 for uβ andbetween 0 and 1 for uv. The whole rule-base is presented in Fig.3 in two decision tables (Benreguieg et al., 1998). The Max-Prod inference algorithm is used to evaluate the rules, and the center averagemethod is used for the defuzzyfication. To illustrate the perfor- mance of theproposedalgorithms for pathplanningandcontrol, a simulator of themobile robot and its workspace was built in theMatlab/Simulink environ- ment (Hendzel, 2003). An example of the resulting navigation ”reaching the middle of a collision-free space” behaviour and control is shown in Fig.4a,b. The impacts of the sensors is depicted in Fig.4c. Fig. 4. Reaching the middle of a collision-free space behaviour 2.3. Goal seeking behaviour The task of goal seeking behaviour consists in on leading themobile robot to the desired point G as shown in Fig.1. It means minimalization of the distance ‖A,G‖ and the angle ψG which is the angular deviation needed to reach the goal. In this task, thenavigator is builtwitha fuzzy inference system. The whole control rule-base, deducted from a human intuitive experience, is represented by twelve rules and fuzzy sets shown in two decision tables in Fig.5. Figure 6a presents a numerical example of the navigation. The action uβ, uv generated by the fuzzy navigator for the point G(9,9) is shown Fig.6b. The driving torques M1 and M2 generated by the adaptive fuzzy controller are presented in Fig.6c. 2.4. Wall-following behaviour In environments without concave obstacles the combination of analysed behaviours is sufficient for the navigation. In the case of concave obstacles it Fuzzy reactive control of wheeled mobile robot 509 Fig. 5. Angular (a) and linear (b) velocity coefficient rules Fig. 6. Goal seeking behaviour Fig. 7. Membership functions 510 Z.Hendzel is necessary to analyse the strategy of wall-following. The input variables of the fuzzy inference system are error defined ones e =min(dR,dF)−d0, where d0 is a desired distance to the wall (e.g. follow the right wall) and dF gives the distance to the wall. The navigator is the built with the fuzzy inference systembased on a set of rules such as: if e is Ai then uβ is Ci, and if dF is Di then uv is Ei, where Ai, Ci, Di, Ei are linguistic labels of the input signals and the outputs uβ, uv. The shape of themembership functions is triangular. The fuzzy sets are depicted in Fig.7. An example of the resulting navigation with the impacts of the sensors is shown in Fig.8 (follow left wall) and Fig.10b (follow right wall). Fig. 8.Wall-following behaviour The action uβ, uv generated by the fuzzy navigator is presented in Fig.8c. 3. Fuzzy combiner This section introduces structures and functions of the proposed fuzzy na- vigator and its key component, i.e. fuzzy combiner of two behaviours: obstacle avoidance and goal seeking. It is assumed that each low-level behaviour has beenwell formulated at the design stage and then fusedby the fuzzy combiner of behaviours to determine proper actions in the environment at the running stage, as shown in Fig.9. When the mobile robot encounters an obstacle which obstructs the goal, these two behaviours are in conflict. In this paper, we adopt the concept of gatingarchitecture (Driankov andSaffiotti, 2001; LinandChung,1999; Zalzala andMorris, 1996), shown in Fig.10, to solve this conflict. In the proposedmultiobjective fuzzy navigator, we use the fuzzy combiner (FC) to combine low-level modules which are denoted as the obstacle avoider Fuzzy reactive control of wheeled mobile robot 511 Fig. 9. Architecture of the navigator and controller Fig. 10. Basic structure of the fuzzy navigator (OA) and goal seeker (GS), see Fig.10. Eachmodule receives distances sensed by the ultrasonic sensors d(·) andproduces output signals. TheOAdetermines the action uB2 = [uvOA,uβOA] > for the behaviour of obstacle avoidance, while the GS determines the action, uB1 = [uvGS,uβGS] > for the behaviour of goal seeking. These two behaviouralmoduleswork independently and their actions are fused by the FC to produce the action uB = [uv,uβ] > for the navigation. It is assumed that each low-level module has been well designed to serve a particular objective of the requiredmultiple objectives.Many techniques have 512 Z.Hendzel been development to design a single objective. These techniques include fuzzy modelling, neural network, etc. As can be seen in Fig.10, the proposed FC is composed of two elements, a fuzzy context unit and an integration unit. The fuzzy context selects a set of weights ai for the two low-level module actions according to the two control status signals Ji (the ”context”) generated by the environment feedback signal at each time step. The control status signals indicate the status of the control objectives and are defined by Ji = degree(distance of goali) i =1,2 (3.1) where any distance measure can be used. The control status signal Ji indica- tes the degree that the ith control objective is achieved at the current time step. Such information can be obtained by simply checking the status of each control objective independently.Theweights ai producedby the fuzzy context determine the degree of the low-level control action uBi. With these weigh- ting values, the integration unit will carry out linearly weighted summation to combine the two low-level actions into the final action uB as the output of the FC. Due to the powerful capaabilities of the fuzzy modelling, we employ the fuzzy techniques to realise the FC in this paper. 4. Design fuzzy combiner In this section, the proposed FC is applied to two behaviours: obstacle avoidance and goal seeking – to show its performance and applicability. To design the proposed FC, first we need to define two control status signals J1 and J2. Let the input variables of the fuzzy navigator are the measured distances on the right dR =max(s6,s7,s8), on the left dL =max(s1,s2,s3), and at the front dF =min(s4,s5), where d is the distance between the point A and G. Let us define the errors: eR = dR −d, eL = dL−d, eF = dF −d, and ψG – the angular deviation needed to reach the goal G. Control status signals are defined according to the two control objectives J1 = [eL,eR,eF ] > J2 = ψG (4.1) They are normalised to be heldwithin emin ¬ e(·) ¬ emax and −π ¬ ψG ¬ π, respectively. The used FC is built with fuzzy inference systems based on a set of rules such as: if e(·) is Ai and ψG is Bi then a1 is Ci, and if e(·) is Ai and ψG is Bi Fuzzy reactive control of wheeled mobile robot 513 then a2 is ¬Ci, where Ai,Bi,Ci are linguistic labels of the inputs e(·),ψG and of the outputs a1, a2. The shape of the membership functions is triangular. Thewhole rule-base is shown in Fig.11 in six decision tables. The set of terms for J1 and J2 is N-negative, P-positive and N-negative, Z-zero, P-positive, respectively, and the set of terms for a1 and a2 is S-small, L-large. Fig. 11. Rule table for a1 (a) and for a2 (b) In the FC, the inputs are the control status signals J1 and J2, and the output are the weighting values of low-level modules 1 and 2, a1, a2. From the rule tables we observe that module 1 will be activated, i.e. a1 = L, when (J11 > 0 and J2 < 0) or (J12 > 0 and J2 > 0) or (J13 > 0 and J2 = 0). At the same time module 2 is suppressed, i.e. a2 = S. We also observe that module 2 is activated, i.e. a2 = L only when the first goal is suppressed, i.e. a1 = S. The design method for the FC is straightforward. It depends heavily on expert’s knowledge and precise analysis of the discussions problem. 5. Simulation To illustrate the performance of the proposed fuzzy combiner for path planning and control, a simulator of the mobile robot and its workspace was built in the Matlab/Simulink environment first (Hendzel and Żylski, 1997). We carried out simulations in which the rule-base tables, shown in Fig.11, were put into theMax-Min inference algorithm to evaluate the rules, and the centre average method was used for the defuzzyfication. The shape of the membership functions is triangular, see Fig.12. 514 Z.Hendzel Fig. 12.Membership functions of corresponding to the fuzzy terms presented in Fig.13 An example of the resulting fuzzy combiner is shown in Fig.13. The mo- bile robot received a mission to reach the given goal position Gi from the given start position with reaching the middle of the collision-free space. The environment was considered as fixed and completely unknown. Fig. 13. Trajectories of the point A of the mobile robot in the assumed environment The successive activation of different behaviours canbeobserved inFig.13. As shown in Fig.14, for the given goal positions G2 and G5, more than one behaviour is active at the same time, which allows for soft transition. Fuzzy reactive control of wheeled mobile robot 515 Fig. 14. Functions that modulate the weights a1, a2 6. Conclusion A fuzzy navigator, based on expert’s knowledge, well operating in complex and unknown environments is presented in the paper. The principle of the navigator is built on the fusion of obstacle avoidance and goal seeking beha- viours.The output of thenavigation level is fed into a fuzzy tracking controller that takes into account the dynamics of themobile robot.Thenumerical expe- rimental results obtained from an emulator of the mobile robot Pioneer-2DX also confirmed the effectiveness of the proposed path planning and control strategy. Future research will concentrate on applying the proposed solution as a priori knowledge used in the reinforcement learning approach. References 1. Arkin R.C., 1998,Behaviour-Based Robotics, TheMITPress 2. Benreguieg M., Maaref H., Barret C., 1998, Navigation of an auto- nomous mobile robot by coordination of behaviors, Proceedings of 3rd IFAC Symposium on Intelligent Autonomous Vehicles, Madrid, Spain, 589-594 3. Berenstain J., Koren J., 1989,Real-time obstacle avoidance for fastmobile robots, IEEE Transaction on Systems, Man, and Cybernetics, 19, 5, 1179-1186 4. Driankov D., Saffiotti A., Edit., 2001,Fuzzy Logic Techniques for Auto- nomous Vehicle Navigation, Springer-Verlag 5. Giergiel J., Hendzel Z., Żylski W., 2000, Adaptive fuzzy controller of wheeled mobile robot,Proc. of the 5-th Conference Neural Networks and their Applications, Zakopane, 395-400 6. Giergiel J., Hendzel Z., Żylski W., 2002,Modelling and Control of Whe- eled Mobile Robots, WNT,Warsaw (in Polish) 516 Z.Hendzel 7. Hendzel Z., 2003, Neural network reactive navigation and control of wheeled mobile robot, In: Advances in Soft Computing, Rutkowski, Kacprzyk, Eds., Phisica-Verlag 8. Hendzel Z., 2004,Fuzzy combiner of behaviors for reactive control of wheeled mobile robot, In: Advances in Soft Computing, Rutkowski, Kacprzyk, Eds., Phisica-Verlag, to be appear 9. Hendzel Z., Żylski W., 1997, Neural net mobile robot controller, CAE- techniques, Proceedings of 3rd International Scientific Colloquium, Rzeszów, 361-368 10. Latombe J.C., 1991,Robot Motion Planning, Kluwer Academic Publishers 11. Lin C.T., Chung I.F., 1999, A reinforcement neuro-fuzzy combiner for mul- tiobjective control, IEEE Transaction on Systems, Man. and Cybernetics, 29, 6, 726-744 12. Verbruggen H.B., Zimmermann H.J., Babuska R., Edit., 1999, Fuzzy Algorithms for Control, Kluwer Academic Publishers 13. Zalzala A.M.S., Morris A.S., 1996,Neural Networks for Robotic Control, Ellis Horwood 14. Żylski W., 2002, Kinematics of behaviour of wheeled mobile robot, Proce- edings of the XVIII-th Conference on The Theory of Machines and Mechani- sms, Scientific Papers if the Institute of Machine Design and Operation of the Wroclaw University of Technology, 85, Wroclaw, 417-422 (in Polish) Rozmyte sterowanie odruchowe mobilnego robota kołowego Streszczenie W pracy analizuje się zagadnienie nawigacji odruchowej mobilnego robota ko- łowego w nieznanym otoczeniu. Przedstawiono rozmyte podejście do projektowania trajektorii ruchu i sterowania prostych indywidualnych zachowań mobilnego robota kołowego w nieznanym środowisku dwuwymiarowym ze statycznymi przeszkodami. Strategię odruchowej nawigacji zastosowanode realizacji trzech elementarnych zacho- wań: osiągnij środek wolnej przestrzeni, idź do celu, podążaj przy ścianie. Powyższe elementarne zachowania zrealizowano stosując układy z logiką rozmytą. Zakłada się że każde z elementarnych zachowań realizowane przez niższy poziom sterowania jest poprawnie zaprojektowane, a ich koordynacja odbywa się na wyższym poziomie hie- rarchii sterowaniaprzez rozmytyukładkoordynacyjny,któregozadaniem jestwygene- rowaniewłaściwych sterowań umożliwiających realizację zadania nawigacji. Rozmyty układ koordynacji elementarnych zachowań łączy te zachowania generując zadaną trajektorię ruchu wybranego punktu mobilnego robota kołowego. Trajektoria ta jest Fuzzy reactive control of wheeled mobile robot 517 realizowana przez układ niskiego poziomu sterowania ruchem nadążnym mobilnego robota kołowego uwzględniającym dynamikę obiektu sterowania.W celu weryfikacji zaproponowanego rozwiązania przeprowadzono symulacje komputerowe na emulato- rze mobilnego robota Pioneer-2DX. Manuscript received December 15, 2003; accepted for print April 2, 2004