Electronic Communications of the EASST Volume 53 (2012) Proceedings of the 12th International Workshop on Automated Verification of Critical Systems (AVoCS 2012) Proving Linearizability of Multiset with Local Proof Obligations Oleg Travkin, Heike Wehrheim and Gerhard Schellhorn 15 pages Guest Editors: Gerald Lüttgen, Stephan Merz Managing Editors: Tiziana Margaria, Julia Padberg, Gabriele Taentzer ECEASST Home Page: http://www.easst.org/eceasst/ ISSN 1863-2122 ECEASST Proving Linearizability of Multiset with Local Proof Obligations Oleg Travkin1, Heike Wehrheim1 and Gerhard Schellhorn2 1Universität Paderborn, Institut für Informatik, 33098 Paderborn, Germany [oleg82|wehrheim]@uni-paderborn.de 2Universität Augsburg, Institut für Informatik, 86135 Augsburg, Germany [schellhorn]@informatik.uni-augsburg.de Abstract: Linearizability is a key correctness criterion for concurrent software. In our previous work, we introduced local proof obligations, which, by showing a refinement between an abstract specification and its implementation, imply lineariz- ability of the implementation. The refinement is shown via a thread local backward simulation, which reduces the complexity of a backward simulation to an execution of two symbolic threads. In this paper, we present a correctness proof by applying those proof obligations to a lock-based implementation of a multiset. It is interest- ing for two reasons: First, one of its operations inserts two elements non-atomically. To show that it linearizes, we have to find one point, where the multiset is changed instantaneously, which is a counter-intuitive task. Second, another operation has non-fixed linearization points, i.e. the linearization points cannot be statically fixed, because the operation’s linearization may depend on other processes’ execution. This is a typical case to use backward simulation, where we could apply our thread local variant of it. All proofs were mechanized in the theorem prover KIV. Keywords: linearizability, refinement, multiset, concurrency, verification, KIV 1 Introduction With an increasing number of cores per CPU, data structures like lists or sets can be used more and more concurrently. To avoid bottlenecks in a system, such data structures must be designed for maximizing throughput. This is done by applying fine-grained synchronisation schemes or avoiding the use of locks at all and using hardware primitives for synchronisation instead. Due to the fine granularity of synchronisation and interleaving of instructions, verifying such data structures to be correct is hard. Reasoning about correctness of concurrent data structures also means reasoning about their linearizability [HW90]. Linearizability is a safety property. Data structure implementations are said to be linearizable, if for each concurrent execution there exists a sequential execution producing the same result. Operations of linearizable data structures seem to take effect instanta- neously at some point in time between their invocation and response. This point is referred to as the linearization point (LP) or sometimes also as the commit point. The LP does not necessarily have to be a fixed location. Sometimes, the LP is even outside the performed operation, i.e. an operation is linearized by another concurrently running operation. 1 / 15 Volume 53 (2012) Proving Linearizability of Multiset with Local Proof Obligations Proving a data structure to be linearizable usually involves showing a refinement relation be- tween an abstract specification of a data structure and its actual implementation. An abstract specification contains atomic operations only. To show that an implementation is linearizable w.r.t. its abstract specification, one has to prove that the implemented operations non-atomically refine the operations from the abstract specification. Several different approaches exist in order to prove linearizability, such as shape abstrac- tion [ARR+07], rely-guarantee reasoning and separation logic combined [VP07] and our own simulation-based approach [DSW07]. In our approach, we formally defined a general theory relating refinement theory and linearizability following the notion of Herlihy and Wing [HW90]. From our general theory, we could derive proof obligations that are thread local and, once proven for a concrete data structure implementation, imply linearizability of the implementation. In our ongoing work, we aim to determine the class of algorithms and data structures, to which these local proof obligations can be applied. As a case study for this paper, we selected a multiset implementation proposed by Elmas et al., which they verified linearizable [EQS+10]. In contrast to Elmas et al., our aim is to understand why it is linearizable. It is an interesting case study for several reasons: (1) Its operation Insert- Pair adds two elements to the multiset non-atomically. Thus, finding an abstraction function for a refinement proof may be challenging for such an operation as Elmas et al. already pointed out. It seems to be counter-intuitive in thinking of it to have exactly one location for the LP, since the point of element insertion is implemented by two instructions. (2) It includes an operation with an LP that is not statically fixed. In such cases usually a full backward simulation is required. We could avoid this by using our theory extension for potential linearization points [DSW11b]. (3) Another point to mention is that the multiset is blocking, i.e. critical variables are protected by locks. However, the implementation contains a writing instruction to a potentially locked variable. This property forced us to slightly modify our local proof obligation theory. We used the interactive theorem prover KIV [RSSB98] to formalize the implementation, the abstract specification as well as to mechanize all our linearizability proofs. In Section 2, we introduce the multiset implementation, the encoding of the implementation and its abstract spec- ification. Section 3 presents the abstraction function that we used for the refinement proofs. We give a brief overview of our local proof obligations in Section 4, followed by an explanation of the invariants used for the proofs in Section 5. In Section 6, we discuss why we modified our local proof obligations. Finally, Section 7 concludes. 2 Multiset as a Case Study As a case study for our local theory, we chose an implementation of a multiset data structure of integers from Elmas et al. [EQS+10]. The parts of the implementation we considered in our proof are only the three operations presented in Figure 1 as Elmas et al. published it. A delete operation or an insert operation for single elements were not considered to limit the proof effort on the one hand and to gain results that can be compared on the other. The multiset is implemented as an array containing elements of type Slot. A slot encodes its value by the integer attribute elt. A second attribute stt of type Status encodes the state of insertion. A slot may be in empty, reserved or full state. An integer i is supposed to be in the set, Proc. AVoCS 2012 2 / 15 ECEASST enum Status = { empty,reserved,full }; InsertPair(x:int, y:int) returns {r:bool} record Slot { elt: int, stt: Status }; var i,j: int; var M: array[0..N-1] of Slot I1 i := FindSlot(x); I2 if (i == N) { LookUp(x:int) returns(r:bool) I3 r := false; var i: int; I4 return;} L1 for (i := 0; i < N; i++) { I5 j := FindSlot(y); L2 lock(M[i]); I6 if (j == N) { L3 if (M[i].elt==x && M[i].stt==full) { I7 M[i].stt := empty; L4 unlock(M[i]); I8 r := false; L5 r := true; return; I9 return; } L6 } else unlock(M[i]); L7 } //i++ evaluation I10 M[i].elt := x; L8 r := false; return; I11 M[j].elt := y; atomic FindSlot(x:int) returns (r:int) I12 lock(M[i]); 1 if (forall 0<=i