Stochastic Graph Transformation with Regions Electronic Communications of the EASST Volume 29 (2010) Proceedings of the Ninth International Workshop on Graph Transformation and Visual Modeling Techniques (GT-VMT 2010) Stochastic Graph Transformation with Regions Paolo Torrini, Reiko Heckel, István Ráth and Gábor Bergmann 15 pages Guest Editors: Jochen Küster, Emilio Tuosto Managing Editors: Tiziana Margaria, Julia Padberg, Gabriele Taentzer ECEASST Home Page: http://www.easst.org/eceasst/ ISSN 1863-2122 http://www.easst.org/eceasst/ ECEASST Stochastic Graph Transformation with Regions Paolo Torrini1, Reiko Heckel2, István Ráth3 and Gábor Bergmann4 1 pt95@mcs.le.ac.uk 2 reiko@mcs.le.ac.uk University of Leicester 3 rath@mit.bme.hu 4 bergmann@mit.bme.hu Budapest University of Technology and Economics Abstract: Graph transformation can be used to implement stochastic simulation of dynamic systems based on semi-Markov processes, extending the standard approach based on Markov chains. The result is a discrete event system, where states are graphs, and events are rule matches associated to general distributions, rather than just exponential ones. We present an extension of this model, by introducing a hierarchical notion of event location, allowing for stochastic dependence of higher- level events on lower-level ones. Keywords: Graph Transformation, Stochastic Simulation, Topology 1 Introduction Graph transformation combines the idea of graphs as a universal modelling paradigm with a rule-based approach to specify the evolution of systems [KK96]. Behaviour can be modelled in terms of labelled transition systems, where states are graphs and rule applications represent transitions. A discrete event system can be generally obtained by interpreting rule matches as events. Hierarchical graphs can be used to keep into account the spatial structure of graphs in terms of topological grouping, with advantages that have been underlined from the point of view of modelling and verification [BL09]. Stochastic graph transformation is applicable to probabilistic analysis and stochastic valida- tion of graph-based modelling. Stochastic simulation can be particularly useful as validation technique when systems are too complex to be model-checked. It can be implemented relying on a discrete event system approach [CL08]. Transitions are labelled by scheduling times, randomly chosen according to given probability distributions — thus replacing stochastic determinism for indeterminism in the models. A simple form of stochastic graph transformation can be obtained by associating rule names with exponential distributions [HLM06]. The associated Markov-chain analysis has been applied to integrated modelling of architectural reconfiguration and non-functional aspects of network models [Hec05]. However, this approach has some limits. Exponential distributions can express well the relative speed of processes, but are less than suited to describe phenomena characterised by mean and deviation. Generalised stochastic graph transformation can answer this problem, allowing for general distributions to be associated with rule names [KL07] and more generally 1 / 15 Volume 29 (2010) mailto:pt95@mcs.le.ac.uk mailto:reiko@mcs.le.ac.uk mailto:rath@mit.bme.hu mailto:bergmann@mit.bme.hu SGT with regions with rule matches [HT10, KTH09]. In the latter case, assignment of probability distributions to events may depend e.g. on attributes of match elements. Generalised semi-Markov processes provide the discrete event semantics for such systems. In reality, events can often be described at different levels of spatial and causal detail. The ex- pression of causal dependency in graph transformation depends solely on rules and their match- ing. On the other hand, in the approaches that we have considered up to now, each event is treated as stochastically independent from any other with respect to the assignment of proba- bility distributions. Stochastic dependency on global variables and derived attributes has been considered [HT10]. Even allowing that, it is not generally possible to express in a direct way e.g. that the probability of a certain event depends on other events. This can make it particularly hard to model aspects that involve correlation between different levels of description, as in the case of geographic and biochemical systems, where information is usually found at different levels of spatial granularity [TSB02]. In this paper we propose an approach based on hierarchical graphs in order to introduce local- isation and granularity of events, we define a notion of structured stochastic simulation allowing us to express stochastic dependency of higher-level events on lower-level ones, and we provide a semantics for it in terms of discrete event systems. Regardless of the specific approach, a major stumbling block in the implementation of stochastic simulation based on graph transformation is the need to compute all the matches at each step. This is hard in principle — the subgraph homomorphism problem is known to be NP-complete, though feasible in many cases of interest. However, the cost of recomputing can be prohibitive. For this reason, we rely on incremental pattern matching based on a RETE-style algorithm as implemented in VIATRA [BHRV08] (a model transformation plugin of Eclipse). In [THR10] we presented GRASS, a tool that extends VIATRA with a stochastic simulation control based on the SSJ libraries [LMV02]. By using a decoupled notion of graph hierarchy, it should be possible to implement hierarchical stochastic simulation in VIATRA/GRASS. 1.1 Hierarchical extensions Hierarchy in graph models can be used to introduce a notion of topological grouping on model elements. Grouping information can be represented as a hierarchy graph, as distinct from the un- derlying graph, relying on a decoupled approach [BKK05]. In the case of bigraphs, the approach is to pair place graphs and link graphs, together with a specific notion of matching [Mil08]. Here we use topology to localise events, rather than elements, relying on a generic notion of rule matching. A model consists of an underlying graph coupled with a place graph, where the latter is a directed acyclic graph (dag) from which the hierarchy arises, as partial order (≤). Topo- logical grouping arises from rule matching through the hierarchy. Nodes in the dag are places and edges represent containment between places (hierarchical containment). The two graphs are linked together by containment edges (coupling containment) that map underlying graph nodes to places. Regions are defined as downward-closed sets with respect to the hierarchy, i.e. closed sets in the corresponding order topology [TSB02]. From the stochastic point of view, we use hierarchy to let lower-level events affect the assign- ment of probability distributions to higher-level ones. In particular, we allow for the distribution assigned to an event to depend (1) on the enabling of other events, and more generally on the Proc. GT-VMT 2010 2 / 15 ECEASST number of enabled matches of a certain type — what we call a density measure; (2) on the scheduling of other events — what we call an activity measure. In this way, we expect to be able to perform more sensitive stochastic analysis without resorting to making models and reachabil- ity analysis more complex. In fact, density measures boil down to counting matches, therefore they could be handled by introducing additional attributes in a flat model — though this would mean making the model more complex. Activity measures are trickier, as in principle they might lead to circular depen- dencies. This is avoided by requiring that stochastic dependency between events as well as event time scheduling comply with the hierarchy. Therefore, no event may depend on higher level ones, and at each step of the simulation, the time scheduling of lower-level events takes place before that of higher-level ones. 2 Example We model a power grid as example in which higher-level events may depend stochastically on large numbers of lower-level ones. Each power source serves a number of distribution points, by allocating power quotas in a reconfigurable way. Appliances can be added to and removed from a distribution area, and they can be connected to and disconnected from distribution points, determining the level of consumption, which must remain within a tolerance of the quota. A power failure may occur when the quota is overstepped. A failure determines the disruption of the distribution point, with consequent loss of data, and it forces the intervention of a recovery unit. Actual reconfiguration is carried out following optimisation criteria that can be reflected stochastically in the application of the rule. The model is based on the SPO approach, and uses typed graphs with attributes. A power station is connected to each of the distribution points by power lines denoted by multi-edges, i.e. sets of parallel edges represented as a single edge with an integer value. A station can reconfig- ure the capacity of each power line depending on the available power and the distribution area consumption — this takes place by changing the number of line edges, also updating residual power and local quota. The spatial structure of the model is quite simple — there are three types of places: the network area, a supply area for each station, and a distribution area for each service point. Each place is represented as a rounded box. The hierarchy order ≤ is represented as containment (larger boxes are places higher up in the order, therefore associated with higher-level elements). The coupling order is also represented as containment in an obvious way — each underlying graph node (a square box) being coupled only with the smallest place box it is contained in. In this example, the place graph is a tree. The notation could be easily extended to the dag case, by associating places to intersections. The symbols Dec, Inc, Tol, Add, F, G, H, P in the figures stand for given functions. The distinction between higher-level and lower-level events here is comparatively straight- forward. The former ones are those associated with reconfigurations, failures and recoveries, and located in regions generated by supply areas. The latter ones are associated with adding, removing, switching on and off appliances, and are located in regions generated by distribution areas. From the stochastic point of view, actions that depend heavily on external aspects, such 3 / 15 Volume 29 (2010) SGT with regions SwitchOff D:DPoint A:App D:DPoint A:App C2=Dec(C1,X) C1:ACons X:CWeight C2:ACons D:DPoint A:App C2=Inc(C1,X) D:DPoint A:App SwitchOn C1:ACons L:Quota C2:ACons X:CWeight C2L a b c a b c b:DA c:SA Recovery Unit b:DA c:SAa:NA a:NA (a) Failure, Recovery Figure 3: Failure, Recovery as adding appliances, switching on and failure, may be assigned exponential distributions. Ac- tions more plausibly associated with mean values, such as switching off and recovery, may be more naturally associated with normal distributions. Crucially, reconfiguration can dependent stochastically on the context. In the example, there are two possible matches for the reconfiguration rule — one with a1 and the other with a2 as distribution areas. In order to model stochastically a “smart” reconfiguration strategy, one could make the probability of application inversely dependant on the difference between quota and area consumption (a derived attribute associated with distribution points and denoted by D in the picture). However, if that is to be the only criteria, here there is little chance of modelling a high quality of service without changing the model. Given a high rate of switching on against a low one of appliance addition, the area a1 is more at risk than a2, in spite of the higher D value. This risk is essentially associated with the number of matches for switch on in a1, and further than that — with their scheduling times. Of course it would be possible to retain information about the number of appliances in an area explicitly, by adding an attribute — however, apart from the need to extend the model, this way of capturing the density measure would not be the most natural in this case, as it would not belong to the service point. Moreover, it is difficult to think of a similar way to capture an activity measure. On the other hand, the knowledge embedded in the reconfiguration strategy might be based on estimates rather than precise data. Therefore, modelling it in terms of implicit stochastic dependence seems realistic. 3 Stochastic Graph Transformation Stochastic graph transformation for semi-Markov process modelling requires us to track matches through transformation. Incremental pattern matching is indeed based on tracking partial 5 / 15 Volume 29 (2010) SGT with regions y:App s:PS W=1 a1:DA x:App W=1W=1 w:App a2:DA d2:DP D=1 d1:DP D=2 z:App (a) Example Figure 4: Example matches. Here we provide a general definition of typed graph transformation that supports track- ing with respect to a generic approach (although the running example is based on SPO), allowing for node type inheritance and negative application conditions. We then extend the notion, by en- dowing graphs with hierarchy and derived topological structure. 3.1 Graph transformation In existing axiomatic descriptions of graph transformation [KK96], a graph transformation ap- proach is given by a class of graphs G , a class of rules R, and a family of binary relations ⇒r⊆ G ×G representing transformations by rules r ∈ R. Here we assume that each rule r is associated to a left hand-side graph Lr and a set of negative application conditions Nr. This notion can be further refined by introducing a definition of rule match depending on a given ap- proach (including SPO and DPO). In the following, we will sometimes use a syntax for function definitions with dependent types [Bar92] in a comparatively informal way, in order to specify functions that we assume to be implementable, by writing Πx ∈ α.β rather than α → β when x ∈ α and β depends on x. Basically, a graph is a triple G = 〈NodesG, EdgesG, asgG〉, where for x ∈ NodesG, asgG(x) = 〈y, z〉 with y, z ∈ NodesG. At an abstract level, a partial match of a rule r in a graph G can be represented as a triple m = 〈r, g, c〉, where r = rule(m), g = SG(m) is a partial graph morphism from Lr into G, and c = AC(m) ⊆ Nr is the set of application conditions that are satisfied. We denote the graph elements of the match, i.e. the image of m, by EL(m). We say that a match m is valid when SG(m) is total and AC(m) = Nr. We denote by Mr,G the set of the partial matches of r in G. Mr,G is ordered by a relation vr,G, component-wise defined as subgraph and subset relation on the SG and AC components of the matches, respectively. We define the set of all partial matches in a graph G as MG = ⋃ r∈R Mr,G, and by M v G ⊆ MG the set of all those that are valid. Def. 1 We define a function ⇒: ΠG ∈ G .MvG → G . We write G ⇒m H for ⇒ (G)(m) = H, and we say that this is the transformation step determined by the application of rule rule(m) to match SG(m) in graph G. Proc. GT-VMT 2010 6 / 15 ECEASST Notice that transformation steps correspond to a function that is partially defined with respect to the set of all matches. The functional requirement captures the idea that rule application is well-defined and deterministic once a valid match m is found in G. This is needed, in order to guarantee that matches form a proper set, unlike in more abstract presentations [EEPT06]. Also notice that, however, we implicitly allow for a global renaming action associated with a transformation. Def. 2 A graph transformation system (GTS) G = 〈R, G0〉 consists of a set R ⊆ R of rules and an initial graph G0 ∈ G . A transformation in G is a sequence of rule applications G0 ⇒m1 G1 ⇒m2 ···⇒mn Gn using rules in G with all graphs Gi ∈ G . Assuming finite graphs and an enumerable set of rules, the graphs that are reachable from G0 by a finite sequence of transformation steps form a set, denoted by LG, and so do the partial matches over the reachable graphs, denoted by MG. Even without fixing the approach, we can say that in general, a transformation step t = G ⇒m H is associated with a set Delt of all the graph elements that are deleted or modified by t, and with a set Cret of all the graph elements that are created by t. Correspondingly, the partial matches that are destroyed form a set DG,t ⊆ MG, those that are created form a set CH,t ⊆ MH , and those that are preserved are M|t = MG\DG,t . From the deterministic assumption, MH = σt (M|t )∪CG,t follows, where σt is the renaming induced by t (assuming the name spaces are disjoint). We are interested in defining a notion of persistent match, by identifying matches that are preserved through transformation. In particular, from the point of view of stochastic models, given m1 ∈ σt (DG,t ), m2 ∈ DG,t and m2 = σt (m1), we want to identify m1 and m2 when they are both valid with respect to the same rule r. This can be generalised to partial matches. In [KTH09, KL07] we relied on a conservative naming policy — here we adopt a looser one altogether, though still defining a persistent match as equivalence class. In order to abstract from renaming, we define, for n1 ∈ MR,G′, n2 ∈ MR,G′′, the symmetric relation n1 =a n2 that holds whenever for all transformation steps t, if t = G′⇒m G′′ and n1 ∈M|t , then n2 = σt (n1), and if t = G′′ ⇒m G′ and n2 ∈ M|t then n1 = σt (n2). We can then define the transitive closure using the least fixpoint operator (µ ) n1 ≡ n2 =d f µ E.(n1 = n2)∨(∃n3.E(n1, n3)∧n3 =a n2) It is a matter of routine to show that ≡ is indeed an equivalence relation. The persistent matches over the set of the reachable graphs in G can therefore be defined as the quotient class MG = MG/ ≡. We call event a persistent valid match, and we denote with EG the corresponding set of events. We define MG = {[m]|m ∈ MG}, and EG = {[m]|m ∈ MvG}. This in general will allow us to lift definitions from valid matches to events. Lr n1 // = n2 88G t / H Figure 5: persistence of matches 7 / 15 Volume 29 (2010) SGT with regions 3.2 Hierarchical structure We use hierarchical graphs in the decoupled sense [BKK05], i.e. we define a hierarchical graph as graph that includes the underlying graph as well as a directed acyclic graph representing the hierarchical structure. Def. 3 A hierarchical graph is a graph G in which there is a distinguished dag PG ⊂ G called the place graph, where the nodes of PG are the places; (a) the edges that link nodes in G\PG to (non-empty) places, called C-edges, express cou- pling containment and form a distinguished set CEdgesG ⊂ EdgesG; (b) the edges connecting places together (H-edges) express hierarchical containment. Moreover, the following conditions must be satisfied: (1) 〈PG,≤G〉 is a partial order (hierarchy of G), where ≤G is the reflexive-transitive closure of hierarchical containment. (2)