JOURNAL OF THEORETICAL AND APPLIED MECHANICS 43, 3, pp. 575-591, Warsaw 2005 ONLINE OPTIMIZATION OF A PREVIEW CONTROLLER – STRUCTURE AND ALGORITHMS 1 Joachim Lückel Eckehard Münch Henner Vöcking Thorsten Hestermeyer University of Paderborn e-mail: Joachim.Lueckel@mlap.de; muench@mlap.de; Henner.Voecking@mlap.de; Thorsten.Hestermeyer@mlap.de Active suspension systems are used to increase ride comfort and safety of vehicles. Optimal results can be achieved if disturbances from the track are known in advance. Usually, this causes a problem as track excitations cannot bemeasured until they take effect on the vehicle. Here, we present an appro- ach to disturbance compensation for railway vehicles coupled in a network. Information on stationary arising track disturbances are gathered by vehicles and stored locally at track sections.By repeated runs over a respective section this information is iteratively optimized and can be used for the disturbance compensation in subsequent vehicles. Thedevelopedoptimizationalgorithm isdescribed.Design criteria arederived fromdigital control theory.Theprocedurewas implemented ona testbed for a semi-vehiclewith threedegreesof freedom,where its usabilitywasproved.The resultsarediscussedand,finally, someaspectsofgeneralizationareconsidered. Keyword:mechatronics, learning,distributedoptimization, active suspension, railway systems 1. Introduction Today, active suspension systems are well established in theory and prac- tice. This holds especially true for automotive applications. When looking at 1 This workwas developed in the course of the ”Collaborative ResearchCenter 614 – Self- Optimizing Concepts and Structures inMechanical Engineering” –University of Paderborn, and was published on its behalf and funded by the Deutsche Forschungsgemeinschaft. 576 J.Lückel et al. the railway industry, vehicles with active suspensions are available, but these systems usually focus on the tilt and centering of the coach body rather than on ride comfort. However – even if rare – there has also been some work on active damping in industry (Streiter et al., 2001) and public research. Thiswork uses, as an application example, the system setup of the railway system ”Neue Bahntechnik Paderborn”, which is described in more detail in Section 2. Most of the vast literature on the control of active suspension systems focuses on single vehicles. Collaborative vehicle networks, however, offer a promising way to improve ride comfort even further. This article shows that it is possible to reduce bodymotion by a great extent by using the experience gainedbyother vehicles. In order to do so, two things are necessary: Firstly, an algorithm is required that determines information about the track excitation and uses this in the control algorithm of the active suspension. Secondly, a collaborative network with communication infrastructure has to be set up. Here, we focus on the first step. The article is structured as follows: Section 2 gives a brief overview of an examplary suspension system and its underlying control structure. Section 3 presents the basic idea for the overall system setup including the optimization algorithm in the collaborative vehicle network. With this setup in mind, Section 4 develops the learning algorithm. In order to show the applicability of the algorithm and its benefits, the system was implemented on a suspension test bed described in Section 5, where the results arediscussedaswell.Thearticle concludeswithanoutlook inSection6. 2. Active suspension control The railway system ”Neue Bahntechnik Paderborn” (NBP) (Hestermeyer, 2003) features small autonomous railway vehicles of van sizewitha fully-active suspension system (Fig.1). The suspension system is used for both tilting of the car body and for the adjustment of the spring and damper characteristics. The car body and bogie are connected by air springs without any passive dampers. This is for enhancing the comfort as passive dampers harden if their excitation contains high frequencies. The function of passive dampers is taken on by an active system of hy- draulic cylinders that create damping forces by displacing spring bases. The displacement vector xactive yields necessary cylinder displacements lcyl,i by computing the inverse kinematics of the cylinder arrangement (Fig.2). As the Online optimization of a preview controller... 577 Fig. 1. Structure of active suspension input for the controller, relative positions with respect to velocities between the car bodyand the bogie are used,which are gained fromthemeasured leng- ths of hydraulic cylinders and position sensors in parallel to the air springs. In this way, dynamics of the entire system can be customized arbitrarily by adju- sting the parameters of the controller, and therefore the rules for calculating the additional active forces. Fig. 2. Control structure for relative damping 3. Preview systems for active suspensions Whendesigning an active suspension, it is important to put special care on the employed sensor concept and the control strategy as bothplay amajor role in the success of the system. One important aspect became clear alreadywith first realizations of active suspension systems. The disturbance compensation using information about the ground excitation can improve the ride comfort considerably. Jäker (1990) e.g. useda disturbance compensator as apart of the control law for the active suspension in an off-road truck with great success2. 2The information about wheel excitation was derived from an observer based on signals from accelerometersmounted on the axles. 578 J.Lückel et al. The way in which the ground excitation is determined has significant in- fluence on the compensation result. Due to actuator dynamics, it is vital to know the ground excitation as early as possible. In the optimal case, the exci- tation is known before it actually hits the wheel. This is known as ”preview”. Preview information for rear wheels can be gained by using information from front wheels (so called ”internal preview”). This is quite a convenient way for disturbance compensation in trains, where the locomotive can collect track information and transfer it to the carriages. In short vehicles like cars howe- ver, the influence of the front wheels on comfort is very high and the internal preview provides only small benefit (Rutz, 1987). The preview information for the front wheels would therefore help to improve ride comfort even more. Unfortunately, looking at a single vehicle, collecting the preview information for the front wheels is an arduous and costly business. (Donahue (2001) e.g. describes a military external preview system with an expensive radar and optical sensors.) A much simpler way to obtain the desired information can be found for vehicles integrated in a network. Ioannou (1998) proposed such an infrastructure-supported network for highway vehicles. In this work, we concentrate on the railway system ”Neue Bahntechnik Paderborn” (NBP) (Hestermeyer, 2003), which supplies perfect infrastructure for the new preview system presented here. The shuttles are propelled by a double-fed asynchronous linear motor. For the implementation of the motor, the track is divided into sectors which are equipped with their own frequency converters and computer hardware. The creation of propelling forces requires fast communication between the shuttles and the track. Figure 3 shows the information and communication structure of the NBP-system (Zanella et al., 2002). The available computation power and communication infrastructure can be used to set up a preview system for the active suspension (see also Hestermeyer et al., 2004; Münch et al., 2004). Fig. 3. Communication structure for the railway-system ”Neue Bahntechnik Paderborn” (Zanella et al., 2002) Online optimization of a preview controller... 579 The system structure shown in Fig.3 suggests the following set-up for the determination of the track excitation. In the first step, the track is logically divided into different sections, and the agent network is allocated to the track. One track agent is allocated to each section (Fig.4)3. Fig. 4. Determination of preview information bymulti-agent optimization When a shuttle wants to enter a special section, it contacts the track agent and receives in return an estimation of the track excitation it can use for disturbance compensation4. After completing the section, the shuttle answers with a performance rating which is used by the track agent to optimize the trajectory. In the case of a communication error, thedisturbance compensation is simply turned off. This results in less comfort but is otherwise uncritical. Apart from improving the ride comfort by optimal disturbance compen- sation, this method offers an excellent way of monitoring the track quality, as the track information is continously updated with each shuttle. Special measurement runs can be reduced or even excluded at all. 3Comparing Fig.3 and Fig.4, it seems obvious to select sections according to the motor sectorsanddownload the trackagent softwareon the available sectorhardware. However, this is not a prerequisite. The multi-agent software can also be run on centralised hardware. 4Dynamics of the respective shuttle has to be considered when using the preview information.Otherwise, the optimizationof thepreview information in the trackagent might yet converge, but is now valid only for shuttles with similar dynamics. 580 J.Lückel et al. 3.1. Extended control structure As alreadymentioned in the introduction, the work presented here focuses on the realization of the disturbance compensation and the trajectory opti- mization disregarding communication issues and questions arising from the multi-agent implementation. Figure 5 shows the structure of the self-learning control system including the learning algorithm. Fig. 5. Control structure featuring learning algorithm The basis of the active suspension control is a simple feedback law (block ”controller”) assuring sufficient damping of the car body as described in Sec- tion 2. In order tominimize the absolutemovement of the car body, an additional relative displacement signal f is introduced, which includes the reference and disturbance information in function of the shuttle position s (Hestermeyer et al., 2004). The table f =(si,fi) determines f from s by interpolation. Based on the system response evaluated by a block objective generation, the superposed learning algorithm computes a trajectory that reduces the influence of disturbances in the track by adding the signal f to the relative displacement between body and bogie. So far, this concept suggested the usage of the track excitation as di- sturbance compensation. This requires knowledge of the vehicle and actuator dynamics when using the excitation trajectory in the controller. In the first step, this dynamics was not explicitly considered, so that the car body and actuator dynamics was reflected in determined trajectories. Online optimization of a preview controller... 581 4. Learning algorithm During the runover a track section, different disturbances affect the chassis of a shuttle. These disturbances can be distinguished into stochastic distur- bances and stationary disturbances,which recur at the sameplace of the track section. The learning algorithmpresentedhere identifies and compensates the- se stationary disturbances on the chassis. The objective is to keep the car body of the shuttle as still as possible in order to improve the comfort of passengers. Asdescribed inSection 3, the learningalgorithmdetermines a trajectory as a sequence of numbers fki , where k indicates the step number of the learning process and thus the number of shuttles that have crossed the section. The shuttle measures movements of the car body during the passage over the track section. Afterwards the data is given back to the learning algorithm, it determines the new sequence fk+1i . 4.1. Learning algorithm As the learning algorithm, a computation instruction of the form f k+1 i = f k i −Kay k j (4.1) with j = i+h (4.2) was chosen. The value Ka gives the learning factor of the algorithm and y k j the deviation of the car body position. The value h reflects dynamics of the car body and indicates a shift of the fki signal with respect to the associated measuring point. This shift is chosen according to the cut-off frequency of the car body dynamics T and the travel speed of the shuttle v. For the passage over the regarded track section, a constant speed v(t)= const (4.3) is assumed. 4.2. Convergence analysis The learning algorithm inEq. (4.1) is verymuch similar to the description of discrete controllers. They differ in the meaning of the counting variable k, whichdescribes the progres of timewith thediscrete controllers. In thepresen- ted learning algorithm, the variable k indicates a new run over the respective track section. 582 J.Lückel et al. It is obvious to analyze the convergence characteristics of the learning algorithm by means of well-established digital control-engineering methods (Hanselmann, 1984). Therfore, it is necessary to describe shuttle-dynamics by a mathematical model. In order to treat the supporting points independently of each other, some simplificationsmust bemade for the convergence analysis of the learning algorithm. For the feed-forward signal, a simple step-function in place of the interpolation function is used. Furthermore, an ideal reference reaction of the car body is assumed. The system response of the shuttle can be described with the simple model y k j =Kp(f k i +u k i ) (4.4) in which ui describes the track disturbance at the supporting point i. Inserting it into learning algorithm equation (4.1) yields f k+1 i = f k i −KpKa(f k i +u k i ) (4.5) Theuseof theZ-transformationwithEq. (4.5) results ina transfer function G(z)= fi(z) ui(z) = KpKa 1−KpKa−z (4.6) In order to analyze the stability of the system, poles of (4.6) can be used z=1−KpKa (4.7) For stable system behaviour, the pole stays within the unit circle 0