INTERNATIONAL JOURNAL OF COMPUTERS COMMUNICATIONS & CONTROL ISSN 1841-9836, 9(4):497-509, August, 2014. Optimized Branch and Bound for Path-wise Test Data Generation Y.W. Wang, Y. Xing, Y.Z. Gong, X.Z. Zhang Ya-Wen Wang (1,2), Ying Xing* (1,3), Yun-Zhan Gong (1), Xu-Zhou Zhang (1) (1) State Key Laboratory of Networking and Switching Technology Beijing University of Posts and Telecommunications, Beijing, China 10 Xitucheng Road, Beijing, China E-mail: wangyawen@bupt.edu.cn, gongyz@bupt.edu.cn, laomao22311@126.com (2) State Key Laboratory of Computer Architecture Institute of Computing Technology, Chinese Academy of Sciences (3) School of Electronic and Information Engineering Liaoning Technical University *Corresponding author: lovelyjamie@yeah.net Abstract: The increasing complexity of real-world programs necessitates the au- tomation of software testing. As a basic problem in software testing, the automation of path-wise test data generation is especially important, which is in essence a con- straint satisfaction problem solved by search strategies. In this paper, the search algorithm branch and bound is introduced and optimized to tackle the problem of path-wise test data generation. The optimized branching operation is fulfilled by a dynamic variable ordering algorithm with a heuristic rule to break ties. The optimized bounding operation is accomplished by analyzing the results of interval arithmetic. In order to facilitate the search methods, the solution space is represented as state space. Experimental results prove the effectiveness of the optimized branching and bounding operations, and show that the proposed method outperformed some other methods used in test data generation. The results also demonstrate that the proposed method is applicable in engineering. Keywords: test data generation, constraint satisfaction problem, branch and bound, state space search. 1 Introduction With the surge of increasingly complex real-world software, software testing plays a more and more important role in the process of software development [1]. In 2002, National Institute of Standards and Technology (NIST) found that over one third of the cost of software failure could be eliminated by an improved testing infrastructure [2]. But manual testing is time-consuming and error-prone, and is even impracticable for real-world programs. So the automation of testing is of crucial concern [3]. Furthermore, as a basic problem in software testing, path-wise test data generation (denoted as Q) is of particular importance because many problems in software testing can be transformed into Q. The methods of solving Q can be categorized as dynamic and static. The dynamic methods require the actual execution of the program under test (PUT), and the metaheuristic (MHS) [4] methods are very popular. Recently, the MHS method particle swarm optimization (PSO) [5] has become a hot research topic due to its convenient implementation and faster convergence speed. But dynamic methods often consume a large number of iterations, and the definition of objective function is also a big problem. The static methods utilize techniques including symbolic execution [6] and interval arithmetic [7] to analyze the PUT without executing it. The process of generating test data is definite with relatively less cost. They abstract the constraints to be satisfied, and propagate and solve these constraints to obtain the test data. Due to their precision in generating test data and the ability to prove that some paths are infeasible, the static methods Copyright © 2006-2014 by CCC Publications 498 Y.W. Wang, Y. Xing, Y.Z. Gong, X.Z. Zhang have been widely studied by many researchers. Demillo and Offutt [8] proposed a fault-based technique that used algebraic constraints to describe test data designed to find particular types of faults. Gotlieb et al. [9] introduced static single assignment into a constraint system and solved the system. Cadar et al. from Stanford University proposed a symbolic execution tool named KLEE [10] and employed a variety of constraint solving optimizations. In 2013, Wang et al. [11] proposed an interval analysis algorithm using forward dataflow analysis. No matter what techniques are adopted, static methods require a strong constraint solver. In this paper, considering the drawbacks of the dynamic methods and the demand for static methods, we propose a new static test data generation method based on Code Test System (CTS)(http://ctstesting.com), which is a practical tool to test codes written in C programming language. Our contribution is threefold. First, path-wise test data generation is defined as a con- straint satisfaction problem (CSP). Two techniques (state space search and branch and bound) in artificial intelligence are integrated to tackle the CSP. Second, the branching operation is optimized with a heuristic variable ordering algorithm. Third, the bounding operation is opti- mized in different stages of the search to reduce the search space greatly. Through experimental results, we try to evaluate the performance of our method, especially the optimized branching and bounding operations. We also make comparison experiments to find whether our method outperforms some currently existing test data generation methods in terms of coverage. 2 Problem definition and solving strategies 2.1 Problem definition A control flow graph (CFG) for a program P is a directed graph G=(N,E,i,o), where N is a set of nodes, E is a set of edges, and i and o are respective unique entry and exit nodes to the graph. Each node n∈N is a statement in the program, with each edge e=(nr , nt)∈E representing a transfer of control from node nr to node nt . Nodes corresponding to decision statements such as if statements are branching nodes. Outgoing edges from these nodes are referred to as branches. A path through a CFG is a sequence p=(n1 , n2 , . . ., nq ), such that for all r, 1≤rx2) 2: if(x1+x2==100) 3: if(x1>=20) 4: if(x1-x2==30) 5: printf("Solved! "); } Figure 1: Program test and its corresponding CFG 504 Y.W. Wang, Y. Xing, Y.Z. Gong, X.Z. Zhang Table 2: DVO process for x 1 and x 2 Ordering rule Condition of each variable Tie encountered? Ordering result Domain size |D1|=69, |D2|=69 Yes x 1→x 2 Rank1 Rank1(x 1)=1, Rank1(x 2)=1 Yes Rank2 Rank1(x 1)=2, Rank1(x 2)=2 Yes Rank2 Rank2(x 1)= 3, Rank2(x 2)=∞ No 2={ 1:[100,100]-[1,99]=[1,99] ; 2:[100,100]-[2,100]=[0,98]}D x x 1={ 1:[1,100] ; 2:[1,100]}xD x x1-x2=30 4={ 1:[1,98]+[30,30]=[31,128] ; 2:[20,99]-[30,30]=[-10,69]}D x x 5={ 1:[20,99] [31,128] [31,99] ; 2:[1,98] [-10,69] [1,69]}xD x x1>x2 1={ 1:[2,100] ; 2:[1,99]}x xD 2 ={ 1:[1,100] [2,100] [2,100] ; 2:[1,100] [1,99]=[1,99]}xD x 3={ 1:[20,98]}D x 4 ={ 1:[2,99] [20,98] [20,98] ; 2:[1,98]}x xD x1 20 x1+x2=100 3={ 1:[2,100] [1,99] [2,99] ; 2:[1,99] [0,98]=[1,98]}xD x Figure 2: The IDR process 4 Experimental Analyses and Empirical Evaluations To observe the effectiveness of BFS-BB, we carried out a large number of experiments in CTS. Within the CTS framework, the PUT is automatically analyzed, and its basic information is abstracted to generate its CFG. According to the specified coverage criteria, the paths to be traversed are generated and provided for BFS-BB as input. The experiments were performed in the environment of MS Windows 7 with 32-bits, Pentium 4 with 2.8 GHz and 2 GB memory. The algorithms were implemented in Java and run on the platform of eclipse. 4.1 Performance evaluation The number of variables is an important factor that affects the performance of test data generation methods [17]. Hence, in this part, experiments were carried out to evaluate the effectiveness of the optimized branching and bounding operations for varying numbers of input variables. Testing the optimized branching operation This part presents the comparison between our branching algorithm DVO (A) which is also BFS-BB and the method which orders variables only by remaining domain sizes (B). The other operations (the bounding operations) in the two methods were both accomplished by IDR and HC. The comparison was accomplished by repeatedly running the two methods on generated test programs having input variables x1 , x2 ,. . ., xn where n varied from 2 to 50. Adopting statement coverage, in each test the program contained n if statements (equivalent to n branching conditions or n expressions along the path) and there was only one path to be traversed of fixed length, which was the one consisting of entirely true branches. In each test, the expression of Optimized Branch and Bound for Path-wise Test Data Generation 505 Table 3: The hill climbing process for x 1 D1 V1 F (Vi) |F (Vi)| Peak reached? [31, 99] 80 30 30 No [50, 79] 60 -10 10 No [61, 70] 65 0 0 Yes the ith (1≤i≤n) if statement was in the form of [a1, a2, . . . , an][x1, x2, . . . , xn]′rel − op const[i] (3) In formula(3), a1, a2 ,. . ., ai were randomly generated numbers either positive or negative, rel -op∈{>,≥,<,≤,=, ̸=}, and const[j ] (j ∈[1, i]) was an array of randomly generated constants within [0, 1000]. The randomly generated aj and const[i] should be selected to make the path feasible. This arrangement constructed the tightest linear relation between the variables. The programs for various values of n ranging from 2 to 50 were each tested 50 times, and the average time required to generate the data for each test was recorded. The results are presented in Figure 3. Figure 3: Test result of DVO In Figure 3, (a) shows that A had a better performance than B, but it was not very obvious when the number of variables (expressions) was not very large, because there was no requirement for an optimized ordering algorithm, since remaining domain size was enough to determine the next variable to be instantiated. So the more variables, the better DVO works. For BFS-BB (A), it is clear that the relation between average generation time and the number of variables can be represented as a quadratic curve very well as shown in (b) and the quadratic correlation relationship is significant at 95% confidence level with p-value far less than 0.05. Besides, average generation time increases at a uniformly accelerative speed as the increase of the number of variables. The differentiation of average generation time indicates that its increase rate rises by y=9.5294x -120.53 as the number of variables increases. We can roughly draw the conclusion that generation time using DVO is very close for n ranging from 1 to 25, while it begins to increase when n is larger than 25. So DVO will be very useful for PUTs with more variables, especially the large-scale real-world programs. Testing the optimized bounding operation This part presents the comparison between our bounding algorithm HC (A) which is also BFS-BB and the method without HC (B). The other operations (DVO and IDR) in the two 506 Y.W. Wang, Y. Xing, Y.Z. Gong, X.Z. Zhang methods were totally the same. The comparison was accomplished by repeatedly running the two methods on generated test programs having input variables x1 , x2 ,. . ., xn where n varied from 1 to 50. Adopting statement coverage, in each test the program contained 50 if statements (equivalent to 50 branching conditions or 50 expressions along the path) and there was only one path to be traversed of fixed length, which was the one consisting of entirely true branches. The expression of each if statement was in the form of [a1, a2, . . . , an][x1, x2, . . . , xn]′rel − op const[c] (4) In formula(4), a1, a2 ,. . ., ai were randomly generated numbers either positive or negative, rel -op∈{>,≥,<,≤,=, ̸=}, and const[c] (c∈[1, 50]) was an array of randomly generated constants within [0, 1000]. The randomly generated ai (1≤i≤n) and const[c] should be selected to make the path feasible. This arrangement constructed the tightest linear relation between the variables. In addition, we ensured that there was at least one “=” in each program to test the equation solving capability of the methods. The programs for various values of n ranging from 1 to 50 were each tested 50 times, and the average time required to generate the data for each test was recorded. The comparison result is presented in Figure 4. Figure 4: Test result of HC We exponentiated the axis representing average generation time as shown in (a). It can be seen that the average generation time of A is far less than B. For BFS-BB (A), it is clear that the relation between average generation time and the number of variables can be represented as a quadratic curve very well as shown in (b) and the quadratic correlation relationship is significant at 95% confidence level with p-value far less than 0.05. Besides, average generation time increases at a uniformly accelerative speed as the increase of the number of variables. The differentiation of average generation time indicates that its increase rate rises by y=1.06x -8.6817 as the number of variables increases. We can roughly draw the conclusion that generation time using HC is very close for n ranging from 1 to 8, while it begins to increase when n is larger than 8. 4.2 Coverage evaluation To evaluate the capability of BFS-BB to generate test data in terms of coverage, we used some real-world programs to compare BFS-BB with both static and dynamic methods adopted in test data generation. Comparison with a static method This part presents the results from an empirical comparison of BFS-BB with the static method [11] (denoted as "method 1" to avoid verbose description), which was implemented in CTS prior Optimized Branch and Bound for Path-wise Test Data Generation 507 Table 4: The details of comparison with method 1 Project program Function AC by method 1 AC by BFS-BB qlib sin.c radian 55% 100% floor.c ceil 66% 100% dell8i-2 asinl.c acosl 55% 100% tanl.c cotl 66% 100% Table 5: Parameter setting for PSO Parameter Value Population size 30 Max generations 100 Inertia weight w Ranging from 0.2 to 1 Acceleration constants c1 and c2 c1 =c2 =2 Maximum velocityVmax Set according to the input space of the tested program to BFS-BB. The test beds were from two engineering projects at http://www.moshier.net/. The comparison adopted statement coverage as the adequacy criterion. For each test bed, the experiments were carried out 100 times, and average coverage (AC) was used for comparison. The details of the comparison are shown in Table 4. From Table 4, it can be seen that BFS-BB reached higher coverage than method 1 for all the test beds as shown in bold. That is largely due to the optimization methods utilized in BFS-BB. The applicability in engineering remains one of our primary goals in the future. Comparison with PSO This part presents results from an empirical comparison of BFS-BB with PSO, which is mentioned in Section 1 as a popular MHS method with relatively fast convergence speed. Table 5 is a brief introduction to some parameters used in PSO. We used three real-world programs, which are well-known benchmark programs and have been widely adopted by other researchers [18]- [20]. Branch coverage was taken as the adequacy criterion. For each test bed, the experiments were carried out 100 times, and AC was used for comparison. Table 6 shows the details of the test beds and the comparison results. Obviously BFS-BB achieved 100% coverage as shown in bold on all the three benchmark programs, which are rather simple programs for BFS-BB, and it outperformed the algorithm in comparison. The better performance of BFS-BB is due to two factors. The first is that the initial values of variables are selected by heuristics on the path, so BFS-BB reaches a relatively high coverage for the first round of the search. The second is that the optimized bounding operation is conducted not only in the state space search stage but in the initialization stage as well, which reduces the domains of the variables to ensure a relatively small search space that follows. Table 6: The details of comparison with PSO Program LOC Branches Variables AC by PSO AC by BFS-BB triangleType 31 3 5 99.88% 100% cal 53 18 5 96.85% 100% calDay 72 11 3 97.35% 100% 508 Y.W. Wang, Y. Xing, Y.Z. Gong, X.Z. Zhang 5 Conclusions and Future Works The increasing demand of testing large-scale real-world programs makes the automation of the testing process necessary. In this paper, path-wise test data generation (Q) which is a basic problem in software testing is defined as a constraint satisfaction problem (CSP), and the algorithm best-first-search branch and bound (BFS-BB) is presented to solve it, combining two techniques in artificial intelligence which are state space search and branch and bound (BB). The branching and bounding operations in BFS-BB are both optimized. For the branching operation, dynamic variable ordering (DVO) is proposed to permutate variables with a heuristic rule to break ties. The bounding operation is optimized in both stages of BFS-BB. Initial domain reduction (IDR) functions in the initialization stage to reduce the search space as well as detect infeasible paths. In the state space search stage, the process of determining a fixed value for a specified variable resembles climbing a hill, the peak of which is the value judged by interval arithmetic that does not cause a conflict. To facilitate the search procedure, the solution space is represented as state space. Empirical experiments show that the optimized branching operation is especially useful for large-scale programs, while the advantage of the optimized bounding operation hill climbing (HC) is very obvious. The results also show that BFS-BB outperforms some current static and dynamic methods in terms of coverage. Our future research will involve how to generate test data to reach high coverage. The effectiveness of the generation approach continues to be our primary work. Acknowledgment This work was supported by the National Grand Fundamental Research 863 Program of China (No. 2012AA011201), the National Natural Science Foundation of China (No. 61202080), the Major Program of the National Natural Science Foundation of China (No. 91318301), and the Open Funding of State Key Laboratory of Computer Architecture (No. CARCH201201). Bibliography [1] Michael R. Lyu; Sampath Rangarajan; Ada P. A. van Moorse. (2002); Optimal allocation of test resources for software reliability growth modeling in software development, IEEE Transactions on Reliability, ISSN 1841-9836, 51(2): 183-192. [2] Tassey Gregory.(2002);The economic impacts of inadequate infrastructure for software test- ing, National Institute of Standards and Technology, RTI Project 7007.011. [3] Weyuker Elaine J.(1999); Evaluation techniques for improving the quality of very large soft- ware systems in a cost-effective way, Journal of Systems and Software, ISSN 0164-1212, 47(2): 97-103. [4] Shaukat Ali, Lionel C. Briand; Hadi Hemmati; Rajwinder K. Panesar-Walawege. (2010); A systematic review of the application and empirical investigation of search-based test case generation, IEEE Transactions on Software Engineering, ISSN 0098-5589, 36(6): 742-762. [5] Mao Chengying; Yu Xinxin; Chen Jifu.(2012); Swarm Intelligence-Based Test Data Genera- tion for Structural Testing, Proceedings of 11th International Conference on Computer and Information Science (ICIS 12),623-628. [6] Suzette Person; Guowei Yang; Neha Rungta; Sarfraz Khurshid. (2012); Directed incremental symbolic execution, IACM SIGPLAN Notices, ISSN 0362-1340, 46(6): 504-515. Optimized Branch and Bound for Path-wise Test Data Generation 509 [7] Moore Ramon Edgar; R. Baker Kearfott; Michael J. Cloud.(2009);Introduction to interval analysis, Society for Industrial and Applied Mathematics, Philadelphia, PA, USA. [8] Richard A. DeMillo; A. Jefferson Offutt. (1991); Constraint-based automatic test data gen- eration, IEEE Transactions on Software Engineering, ISSN 0098-5589, 17(9): 900-910. [9] Arnaud Gotlieb; Bernard Botella; Michel Rueher.(1998);Automatic test data generation using constraint solving techniques, Proceedings of the 1998 ACM SIGSOFT International Sympo- sium on Software Testing and Analysis, 23(2):53-62. [10] Cristian Cadar; Daniel Dunbar; Dawson Engler.(2008);KLEE: Unassisted and automatic generation of high-coverage tests for complex systems programs, Proceedings of USENIX Symposium on Operating Systems Design and Implementation (OSDI 2008), 209-224. [11] Wang Yawen; Gong Yunzhan; Xiao Qing. (2013); A Method of Test Case Generation Based on Necessary Interval Set, Journal of Computer-Aided Design & Computer Graphics, ISSN 1003-9775, 25(4): 550-556. [12] A.E. Eiben; Zs Ruttkay.(1997);Constraint satisfaction problems, pp. C5.7:1-8, New York, NY, USA: IOP Publishing Ltd and Oxford University Press. [13] Ling-Ling Wang; Wen-Hsiang Tsai. (1988); Optimal assignment of task modules with prece- dence for distributed processing by graph matching and state-space search, BIT Numerical Mathematics, ISSN 0003-3835, 28(1): 54-68. [14] Lianbo Gao; Shashi K. Mishra; Jianming Shi. (2012); An extension of branch-and-bound algorithm for solving sum-of-nonlinear-ratios problem, Optimization Letters, ISSN 1862-4472, 6(2): 221-230. [15] Ying Xing; Junfei Huang; Yunzhan Gong; Yawen Wang; Xuzhou Zhang. (2014); An Intelli- gent Method Based on State Space Search for Automatic Test Case Generation, Journal of Software, ISSN 1796-217X, 9(2): 358-364. [16] Ying Xing; Junfei Huang; Yunzhan Gong; Yawen Wang; Xuzhou Zhang. (2014); Path-wise Test Data Generation Based on Heuristic Look-ahead Methods, Mathematical Problems in Engineering, ISSN 1024-123X,volume 2014, Article ID 642630. [17] Matthew J Gallagher; V. Lakshmi Narasimhan. (2012); Adtest: A test data generation suite for Ada software systems, IEEE Transactions on Software Engineering, ISSN 0098- 5589, 23(8): 473-484. [18] Mao Chengying; Yu Xinxin; Chen Jifu.(2012); Generating Test case for Structural Testing Based on Ant Colony Optimization, Proceedings of the 12th International Conference on Quality Software (QSIC12), 98-101. [19] Ammann Paul; Jeff Offutt.(2008); Introduction to Software Testing, Cambridge University Press, New York, NY, USA. [20] E. Alba; F. Chicano. (2008); Observation in Using Parallel and Sequential Evolutionary Algorithms for Automatic Software Testing, Computers & Operators Research, ISSN 0305- 0548, 35(10): 3161-3183.